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METHODS OF DIAGNOSIS OF OVARIAN CANCER, COMPOSITIONS AND 
METHODS OF SCREENING FOR MODULATORS OF OVARIAN CANCER 



CROSS-REFERENCES TO RELATED APPLICATIONS. 
5 This application is related to USSN 60/299,234, filed June 1 8, 2001 ; USSN 

60/315,287, filed August 27, 2001; USSN 60/317,544, filed September 5, 2001; USSN 
60/350,666, filed November 13, 2001 ; and USSN 60/372,246, filed April 12, 2002, each of 
which is incorporated herein by reference for all purposes. 



10 FIELD OF THE E^NTION 

The invention relates to the identification of nucleic acid and protein expression 
profiles and nucleic acids, products, and antibodies liiereto that are involved in ovarian 
cancer; and to the use of such expression profiles and compositions in the diagnosis, 
prognosis, and therapy of ovarian cancer. The invention further relates to methods for 

15 identifying and using agents and/or targets that inhibit ovarian cancer. 



BACKGROUND OF THE INVENTION 
Ovarian cancer is the sixth most common cancer in women, accounting for 6% of all 
female cancers. It ranks fifth as the cause of cancer death in women. The American Cancer 

20 Society predicts that there will be about 23, 1 00 new cases of ovarian cancer m this country in 
the year 2000 and about 14,000 women will die of the disease. Because many ovarian 
cancers cannot be detected early in their development, they account for a disproportionate 
number of fatal cancers, being responsible for almost half the deaths firom cancer of the 
female genital tract; more deaths than any other reproductive organ cancer, 

25 Most patients with epithehal ovarian cancer, the predominant form, are asymptomatic 

in early-stage disease and usually present with stage III or IV disease. Their five-year 
survival is less than 25%, with lower survival among Afiican- American women. The 
minority of patients discovered with early-stage disease have a five-year survival rate of 
80%-90%. See, Parker, et. al.. (1997) "Cancer Statistics, 1997" CA Cancer J. Clin. 47:5-27. 

30 In the absence of a family history of ovarian cancer, lifetime risk of ovarian cancer is 

1/70. Risk factors include familial cancer syndromes (risk of up to 82% by age 70 in women 
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with hereditary breast/ovarian syndrome); family history (1.4% lifetime risk with no affected 
relatives, 5% with orie affected relative, 7% wifli two affected relatives; Kerlikows'ke, et.al. 
(1992) Obstet. Gynecol. 80:700-707); nulliparity; advancing age; obesity; personal history of 
breast, endometrial, or colorectal cancer; fewer pregnancies; or older age (>35 years) at first 
5 pregnancy. However, 95% of all ovarian cancers occur in women without risk factors. Use 
of hormonal contraceptives, oophorectomy, and tubal steriUzation reduce risk of ovarian 
cancer (Kerlikowske, et. al. (1992) Obstet. Gynecol. 80:700-707; Grimes (1992) Am J. 
Obstet. Gvnecol. 166:1950-1954; HaiJdnson, et. al. (1993) JAMA 270:2813-2818); however, 
even bilateral oophorectomy may not be completely effective in preventing ovarian cancer. 

1 0 Treatment of ovarian cancer consists largely of surgical oophorectomy, anti-hormone 

therapy, and/or chemotherapy. Although many ovarian cancer patients are effectively 
treated, the current therapies can all induce serious side effects which diminish quality of life. 
Deciding on a particular course of treatment is typically based on a variety of prognostic 
parameters and markers (Fitzgibbons, et al. (2000) Arch. Pathol. Lab. Med. 124:966-978; 

1 5 Hamilton and Piccart (2000) Arm. Oncol. 1 1 :647-663), including genetic predisposition 
markers BRCA-1 and BRCA-2 (Robson (2000) J. Clin. Oncol. 18:113sup-118sup). 

The identification of novel therapeutic targets and diagnostic markers is essential for 
improving the current treatment of ovarian cancer patients. Recent advances in molecular 
medicine have increased the interest in tumor-specific cell surface antigens that could serve 

20 as targets for various immunotherapeutic or small molecule strategies. Antigens suitable for 
immunotherapeutic strategies should be higihly expressed in cancer tissues and ideally not 
expressed in normal adult tissues. Expression in tissues that are dispensable for life, 
however, may be tolerated. Examples of such antigens include Her2/neu and the B-cell 
antigen CD20. Humanized monoclonal antibodies directed to Her2/neu 

25 (Herceptin®/trastuzumab) are currently in use for the treatment of metastatic breast cancer. 
Ross and Fletcher (1998) Stem Cells 16:413-428. Similarly, anti-CD20 monoclonal 
antibodies (Rituxin®/rituximab) are used to effectively treat non-Hodgkin's lymphoma. 
Maloney, et al. (1997) Blood 90:2188-2195; Leget and Czuczman (1998) Cuir. Opin. Oncol. 
10:548-551. 

30 Potential immunotherapeutic targets have been identified for ovarian cancer. One 

such target is polymorphic epitheUal mucin (MUCl). MUCl is a transmembrane protein, 
present at the apical surface of glandular epithelial cells. It is often overexpressed in ovarian 
cancer, and typically exhibits an altered glycosylation pattern, resulting in an antigenically 
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distinct molecule, and is in early clinical trials as a vaccine target. Gilewski, et al. (2000) 
Clin. Cancer Res. 6:1693-1701; Scholl, et al. (2000) J. Immimother. 23:570-580. The tumor- 
expressed protein is often cleaved into the circulation, where it is detectable as the tumor 
marker, CA 15-3. See. e.g„ Bon, et al. (19971 Clin. Chem. 43:585-593. However, many 
5 patients have tumors that express neither HER2 nor MUC-1 ; therefore, it is clear that other 
targets need to be identified to manage localized and metastatic disease. 

Mutations in both BRCAl and BRCA2 are associated with increased susceptibility to 
ovarian cancer. Mutations in BRCAl occur in approximately 5 percent (95 percent 
confidence interval, 3 to 8 percent) of women in whom ovarian cancer is diagnosed before 

10 the age of 70 years. See Stratton, etal. (1997) N^EXMed 336:1125-1130. And,inBRCAl 
gene carriers, the risk for developing ovarian cancer is .63. See Easton (1995) Am. J. Hum. 
Genet. 56:267-xxx; and EUt (2001) Can. Fam. Physician 47:778-84. 

Other biochemical markers such as CA125 have been reported to be associated with 
ovarian cancer, but they are not absolute indicators of disease. Althougji rougjaly 85% of 

1 5 Women with clinically apparent ovarian cancer have increased levels of CA125, CA125 is 
also increased during the first trimester of pregnancy, during menstruation, in the presence of 
non-cancerous illnesses, and in cancers of other sites. 

While industry and academia have identified novel gene sequences, there has not been 
an equal effort exerted to identify the function of these novel sequences. The elucidation of a 

20 role for novel proteins and compounds in disease states for identification of therapeutic 
targets and diagnostic markers is essential for improving the current treatment of ovarian 
cancer patients. Accordingly, provided herein are molecular targets for therapeutic 
intervention in ovarian and other cancers. Additionally, provided herein are methods that can 
be used in diagnosis and prognosis of ovarian cancer. Further provided are methods that can 

25 be used to screen candidate bioactive agents for the ability to modulate ovarian cancer. 
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SUMMARY OF THE INVENTION 
The present invention therefore provides nucleotide sequences of genes that are up- 
and down-regulated in ovarian cancer cells. Such genes are useful for diagnostic purposes, 
and also as targets for screening for therapeutic compounds that modulate ovarian cancer, 
5 such as hormones or antibodies. The methods of detecting nucleic acids of the invention or 
their encoded proteins can be used for many purposes, e.g., early detection of ovarian 
cancers, monitoring and early detection of relapse following treatment, monitoring response 
to therapy, selecting patients for postoperative chemotherapy or radiation therapy, selecting 
therapy, determining tumor prognosis, treatment, or response to treatment (of primary or 
1 0 metastatic tumors), and early detection of pre-cancerous lesions. Other aspects of the 
invention will become apparent to the skilled artisan by the following description of the 
invention. 

In one aspect, the present invention provides a method of detecting an ovarian cancer- 
associated transcript in a cell ftom a patient, the method comprising contacting a biological 
1 5 sample from the patient with a polynucleotide that selectively hybridizes to a sequence at 
least 80% identical to a sequence as shown in Tables 1-26. 

In one embodiment, the present invention provides a method of determining the level 
of an ovarian cancer associated transcript in a cell from a patient. 

In one embodiment, the present invention provides a method of detecting an ovarian 
20 cancer-associated transcript in a cell from a patient, the method comprising contacting a 
biological sample from the patient with a polynucleotide that selectively hybridizes to a 
sequence at least 80% identical to a sequence as shown in Tables 1-26. 

In one embodiment, the polynucleotide selectively hybridizes to a sequence at least 
95% identical to a sequence as shown in Tables 1-26. 
25 In one embodiment, the biological sample is a tissue sample. In another embodiment, 

the biological sample comprises isolated nucleic acids, e.g., mRNA. 

In one embodiment, the polynucleotide is labeled, e.g., with a fluorescent label. 
In one embodiment, the polynucleotide is immobilized on a solid surface. 
In one embodiment, the patient is undergoing a therapeutic regimen to treat ovarian 
30 cancer. In another embodiment, the patient is suspected of having metastatic ovarian cancer. 
In one embodiment, the patient is a human. 

In one embodiment, the ovarian cancer associated transcript is roRNA. 

In one embodiment, the method fiuther comprises the step of amplifying nucleic acids 
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before the step of contacting the biological sample with the polynucleotide. 

In another aspect, the present invention provides a method of monitoring the efficacy 
of a therapeutic treatment of ovarian cancer, the method comprising the steps of: (i) providing 
a biological sample from a patient undergoing the therapeutic treatment; and (ii) determining 
5 the level of an ovarian cancer-associated transcript in the biological sample by contacting the 
biological sample with a polynucleotide that selectively hybridizes to a sequence at least 80% 
identical to a sequence as shown in Tables 1-26, thereby monitoring the efficacy of the 
therapy. In a further embodiment, the patient has metastatic ovarian cancer. In a fiirther 
embodiment, the patient has a drug resistant form of ovarian cancer. 
10 In one embodiment, the method further comprises the step of: (iii) comparing the 

level of the ovarian cancer-associated transcript to a level of the ovarian cancer-associated 
transcript in a biological sample firom the patient prior to, or earlier in, the therapeutic 
treatment. 

Additionally, provided herein is a method of evaluating the effect of a candidate 
1 5 ovarian cancer drug comprising administering the drug to a patient and removing a cell 

sample from the patient. The expression profile of the cell is then determined. This method 
may further comprise comparing the expression profile to an expression profile of a healthy 
individual. In a preferred embodiment, said expression profile includes a gene of Tables 1- 
26. 

20 In one aspect, the present invention provides an isolated nucleic acid molecule 

consisting of a polynucleotide sequence as shown in Tables 1-26. 

In one embodiment, an expression vector or cell comprises the isolated nucleic acid. 
In one aspect, the present invention provides an isolated polypeptide which is encoded 
by a nucleic acid molecule having polynucleotide sequence as shown in Tables 1-26. 
25 In another aspect, the present invention provides an antibody that specifically binds to 

an isolated polypeptide which is encoded by a nucleic acid molecule having polynucleotide 
sequence as shown in Tables 1-26. 

In one embodiment, the antibody is conjugated to an effector component, e.g., a 
fluorescent label, a radioisotope or a cytotoxic chemical. 
30 In one embodiment, the antibody is an antibody fragment. In another embodiment, 

the antibody is humanized. 

In one aspect, the present invention provides a method of detecting an ovarian cancer 
cell in a biological sample firom a patient, the method comprising contacting the biological 
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sample with an antibody as described herein. 

hi another aspect, the present invention provides a method of detecting antibodies 
specific to ovarian cancer in a patient, the method comprising contacting a biological sample 
from the patient with a polypeptide encoded by a nucleic acid comprising a sequence from 
5 Tables 1-26. 

In another aspect, the present invention provides a method for identifying a compound 
that modulates an ovarian cancer-associated polypeptide, the method comprising the steps of: 
(i) contacting the compound with an ovarian cancer-associated polypeptide, the polypeptide 
encoded by a polynucleotide that selectively hybridizes to a sequence at least 80% identical 
10 to a sequence as shown in Tables 1-26; and (ii) detennining the functional effect of the 
compound upon the polypeptide. 

In one embodiment, the functional effect is a physical effect, an enzymatic effect, or a 
chemical effect. 

In one embodiment, the polypeptide is expressed in a eukaryotic host cell or cell 
15 membrane. In another embodiment, the polypeptide is recombinant. 

In one embodiment, the functional effect is determined by measuring ligand binding 

to the polypeptide. 

In another aspect, the present invention provides a method of inliibiting proliferation 
of an ovarian cancer-associated cell to treat ovarian cancer in a patient, the method 

20 comprising the step of administering to the subject a ther^eutically effective amount of a 
compound identified as described herein. 

In one embodiment, the compound is an antibody. 
In another aspect, the present invention provides a drug screening assay comprising the steps 
of: (i) administering a test compound to a mammal having ovarian cancer or to a cell sample 

25 isolated from; (ii) comparing the level of gene expression of a polynucleotide that selectively 
hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 1-26 in a 
treated cell or mammal with the level of gene expression of the polynucleotide in a control 
cell sample or mammal, wherein a test compound that modulates the level of expression of 
the polynucleotide is a candidate for the treatment of ovarian cancer. 

30 In one embodiment, the control is a mammal with ovarian cancer or a cell sample that 

has not been treated with the test compound. In another embodiment, the control is a normal 
cell or mammal, or is non-malignant tissue. 

In one embodiment, the test compound is administered in varying amounts or 



wo 02/102235 



PCT/US02/19297 



concentrations. In another embodiment, the test compound is administered for varying time 
periods. In another embodiment, the comparison can occur after addition or removal of the 
drug candidate. 

In one embodiment, the levels of a plurality of polynucleotides that selectively 
5 hybridize to a sequence at least 80% identical to a sequence as shown in Tables 1-26 are 
individually compared to their respective levels in a control cell sample or mammal. In a 
preferred embodiment the plurality of polynucleotides is from three to ten. 

In another aspect, the present invention provides a method for treating a mammal 
having ovarian cancer comprising administering a compound identified by the assay 
10 described herein. 

In another aspect, the present invention provides a pharmaceutical composition for treating a 
mammal having ovarian cancer, the composition comprising a compound identified by the 
assay described herein and a physiologically acceptable excipient. 

In one aspect, the present invention provides a method of screening drug candidates 

15 by providing a cell expressing a gene that is up- and down-regulated as in an ovarian cancer. 
In one embodiment, a gene is selected fcom Tables 1-26. The method fiirther includes 
adding a drug candidate to the cell and determining the effect of the drug candidate on the 
expression of the expression profile gene. 

In one embodiment, the method of screening drug candidates includes comparing the 

20 level of expression in the absence of the drug candidate to the level of expression in the 
presence of the drug candidate, wherein the concentration of the drug candidate can vary 
when present, and wherein the comparison can occur after addition or removal of the drug 
candidate. In a preferred embodiment, the cell expresses at least two expression profile 
genes. The profile genes may show an increase or decrease. 

25 Also provided is a method of evaluating the effect of a candidate ovarian cancer drug 

comprising administering the drug to a transgenic animal expressing or over-expressing the 
ovarian cancer modulatory,protein, or an animal lacking the ovarian cancer modulatory 
protein, for example as a result of a gene knockout. 

Moreover, provided herein is a biochip comprising one or more nucleic acid segments 

30 of Tables 1-26, wherein the biochip comprises fewer than 1000 nucleic acid probes. 

Preferably, at least two nucleic acid segments are included. More preferably, at least three 
nucleic acid segments are included. 

Furthermore, a method of diagnosing a disorder associated with ovarian cancer is 
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provided. The method comprises determining the expression of a gene of Tables 1-26 in a 
first tissue type of a first individual, and comparing the distribution to the expression of the 
gene firom a second normal tissue type from the first individual or a second unaffected 
individual. A difference in the expression indicates that the first individual has a disorder 
5 associated with ovarian cancer. 

In a further embodiment, the biochip also includes a polynucleotide sequence of a 
gene that is not up- and down-regulated in ovarian cancer. 

In one embodiment a method for screening for a bioactive agent capable of interfering 
with the binding of an ovarian cancer modulating protein (ovarian cancer modulatory protein) 

10 or a fragment thereof and an Mitibody which binds to said ovarian cancer modulatory protein 
or fragment thereof In a preferred embodiment, the method comprises combining an ovarian 
cancer modulatory protein or fragment thereof, a candidate bioactive agent and an antibody 
which binds to said ovarian cancer modulatory protein or fragment thereof The method 
further includes determining the binding of said ovarian cancer modulatory protein or 

15 fragment thereof and said antibody. Wherein there is a change in binding, an agent is 

identified as an interfering agent. The interfering agent can be an agonist or an antagonist. 
Preferably, the agent inhibits ovarian cancer. 

Also provided herein are methods of eliciting an immune response in an individual. 
In one embodiment a method provided herein comprises administering to an individual a 

20 composition comprising an ovarian cancer modulating protein, or a fragment thereof In 
another embodiment, the protein is encoded by a nucleic acid selected from those of Tables 
1-26. 

Further provided herein are compositions capable of eliciting an immime response in 
an individual. In one embodiment, a composition provided herein comprises an ovarian 
25 cancer modulating protein, preferably encoded by a nucleic acid of Table 1 -26 or a fragment 
thereof and a pharmaceutically acceptable carrier. In another embodiment, said composition 
comprises a nucleic acid comprising a sequence encoding an ovarian cancer modulating 
protein, prefei'ably selected from the nucleic acids of Tables 1-26, and a pharmaceutically 
acceptable carrier. 

30 Also provided are methods of neutralizing the effect of an ovarian cancer protein, or a 

fragment thereof, comprising contacting an agent specific for said protein with said protein in 
an amount sufficient to effect neutralization. In another embodiment, the protein is encoded 
by a nucleic acid selected from those of Tables 1-26. 
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In another aspect of the invention, a method of treating an individual for ovarian 
cancer is provided, hi one embodiment, the method comprises administering to said 
individual an inhibitor of an ovarian cancer modulating protein. In another embodiment, the 
method comprises administering to a patient having ovarian cancer an antibody to an ovarian 
cancer modulating protein conjugated to a therapeutic moiety. Such a therapeutic moiety can 
be a cytotoxic agent or a radioisotope. 



9 
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DETAE.ED DESCRIPTION OF THE INVENTION 
In accordance with the objects outlined above, the present invention provides novel 
methods for ^agnosis and prognosis evaluation for ovarian cancer (OC), including metastatic 
ovarian cancer, as well as methods for screening for compositions which modulate ovarian 
5 cancer. Also provided are methods for treating ovarian cancer and related conditions, e.g., 
ovarian carcinoma (e.g., epitheUal (including maUgnant serous tumors, malignant mucinous 
tumors, and malignant endometrioid tumors), germ cell (including teratomas, 
choriocarcinomas, polyembryomas, embryonal carcinoma, endodermal sinus tumor, 
dysgerminoma, and gonadoblastoma), and stromal carcinomas (e.g., granulosal stromal cell 
10 tumors)), fallopian tube carcinoma, and peritoneal carcinoma. 

Tables 1-26 provide unigene cluster identification numbers for the nucleotide 
sequence of genes that exhibit increased or decreased expression in ovarian cancer samples. 
Tables 1-26 also provide an exemplar accession number that provides a nucleotide sequence 
that is part of the unigene cluster. 

15 

Definitions 

The term "ovarian cancer protein" or "ovarian cancer polynucleotide" or "ovarian 
cancer-associated transcript" refers to nucleic acid and polypeptide polymorphic variants, 
alleles, mutants, and interspecies homologues that: (1) have a nucleotide sequence that has 

20 greater than about 60% nucleotide sequence identity, 65%, 70%, 75%, 80%, 85%, 90%, 
preferably 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% or greater nucleotide 
sequence identity, preferably over a region of over a region of at least about 25, 50, 100, 200, 
500, 1000, or more nucleotides, to a nucleotide sequence of or associated with a gene of 
Tables 1-26; (2) bind to antibodies, e.g., polyclonal antibodies, raised against an immunogen 

25 comprising an amino acid sequence encoded by a nucleotide sequence of or associated with a 
gene of Tables 1-26, and conservatively modified variants thereof; (3) specifically hybridize 
under stringent hybridization conditions to a nucleic acid sequence, or the complement 
thereof of Tables 1-26 and conservatively modified variants thereof; or (4) have an amino 
acid sequence that has greater than about 60% amino acid sequence identity, 65%, 70%, 75%, 

30 80%, 85%, 90%, preferably 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% or greater 

amino sequence identity, preferably over a region of over a region of at least about 25, 50, 

100, 200, 500, 1000, or more amino acid, to an amino acid sequence encoded by a nucleotide 

sequence of or associated with a gene of Tables 1-26. A polynucleotide or polypeptide 
10 
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sequence is typically from a mammal including, but not limited to, primate, e.g., human; 
rodent, e.g., rat, mouse, hamster; cow, pig, horse, sheep, or other mammal. An "ovarian 
cancer polypeptide" and an "ovarian cancer polynucleotide," include both naturally occiuring 
or recombinant forms. 

5 A "foil length" ovarian cancer protein or nucleic acid refers to an ovarian cancer 

polypeptide or polynucleotide sequence, or a variant thereof, that contains all of the elements 
normally contained in one or more natmally occurring, wild type ovarian cancer 
polynucleotide or polypeptide sequences. The "foil length" may be prior to, or after, various 
stages of post-translation processing or splicing, including alternative splicing. 

1 0 "Biological sample" as used herein is a sample of biological tissue or fluid that 

contains nucleic acids or polypeptides, e.g., of an ovarian cancer protein, polynucleotide or 
transcript. Such samples include, but are not limited to, tissue isolated from primates, e.g., 
himians, or rodents, e.g., mice, and rats. Biological samples may also include sections of 
tissues such as biopsy and autopsy samples, frozen sections taken for histologic purposes, 

15 blood, plasma, serum, sputum, stool, tears, mucus, hair, skin, etc. Biological samples also 
include explants and primary and/or transformed cell cultures derived from patient tissues. A 
biological sample is typically obtained from a eukaryotic organism, most preferably a 
manmial such as a primate e.g., chimpanzee or human; cow; dog; cat; a rodent, e.g., guinea 
pig, rat, mouse; rabbit; or a bird; reptile; or fish. Livestock and domestic animals are of 

20 particular interest. 

"Providing a biological sample" means to obtain a biological sample for use in 
methods described in this invention. Most often, this will be done by removing a sample of 
cells from an animal, but can also be accomplished by using previously isolated cells (e.g., 
isolated by another person, at another time, and/or for another purpose), or by performing the 

25 methods of the invention in vivo. Archival tissues, having treatment or outcome history, will 
be particularly usefol. 

The terms "identical" or percent "identity," in the context of two or more nucleic 
acids or polypeptide sequences, refer to two or more sequences or subsequences that are the 
same or have a specified percentage of amino acid residues or nucleotides that are the same 

30 (e.g., about 60% identity, preferably 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 
95%, 96%, 97%, 98%, 99%, or higher identity over a specified region, when compared and 
aligned for maximum correspondence over a comparison window or designated region) as 
measured using a BLAST or BLAST 2.0 sequence comparison algorithms with default 
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parameters described below, or by manual alignment and visual inspection (see, e.g., NCBI 
web site http://www.ncbi.nlm.nih.gov/BLAST/ or the like). Such sequences are then said to 
be "substantially identical." This definition also refers to, or may be applied to, the 
compliment of a test sequence. The definition also includes sequences that have deletions 
5 and/or additions, as well as those that have substitutions, as well as naturally occurring, e.g., 
polymorphic or allelic variants, and man-made variants. As described below, the preferred 
algorithms can account for gaps and the like. Preferably, identity exists over a region that is 
at least about 25 amino acids or nucleotides in length, or more preferably over a region that is 
50-100 amino acids or nucleotides in length. 

10 For sequence comparison, typically one sequence acts as a reference sequence, to 

which test sequences are compared. When using a sequence comparison algorithm, test and 
reference sequences are entered into a computer, subsequence coordinates are designated, if 
necessary, and sequence algorithm program parameters are designated. Preferably, default 
program parameters can be used, or alternative parameters can be designated. The sequence 

1 5 comparison algorithm then calculates the percent sequence identities for the test sequences 
relative to the reference sequence, based on the program parameters. 

A "comparison window", as used herein, includes reference to a segment of one of the 
number of contiguous positions selected from the group consisting typically of firom 20 to 
600, usually about 50 to about 200, more usually about 100 to about 150 in which a sequence 

20 may be compared to a reference sequence of the same number of contiguous positions after 
the two sequences are optimally aligned. Methods of aligmnent of sequences for comparison 
are well-known in tiie art. Optimal aUgnment of sequences for comparison can be conducted, 
e.g., by the local homology algorithm of Smith and Waterman (1981) Adv. Appl. Math. 
2:482-489, by the homology aligoment algorithm of Needleman and Wunsch (1970) J. Mol. 

25 Biol. 48:443-453, by the search for similarity method of Pearson and Lipman (1988) Proc. 
Nat'l. Acad. Sci. USA 85:2444-2448, by computerized implementations of these algorithms 
(GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, 
Genetics Computer Group, 575 Science Dr., Madison, WI), or by manual aligmnent and 
visual inspection (see, e.g., Ausubel, et al. (eds. 1995 and supplements) Current Protocols in 

30 Molecular Biology Lippincott. 

Preferred examples of algorithms that are suitable for determining percent sequence 
identity and sequence similarity include the BLAST and BLAST 2.0 algorithms, which are 
described in Altschul, et al. (1977) Nuc. Acids Res. 25:3389-3402 and Altschul, et aL (1990) 
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J. Mol. Biol. 215:403-410. BLAST and BLAST 2.0 are used, with the parameters described 
herein, to determine percent sequence identity for the nucleic acids and proteins of the 
invention. Software for performing BLAST analyses is pubUcly available through the 
National Center for Biotechnology Information (http://wvvw.ncbi.nhn.nih.gov/). This 
5 algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying short 
words of length W in the query sequence, which either match or satisfy some positive- valued 
threshold score T when aligned with a word of the same length in a database sequence. T is 
referred to as the neighborhood word score threshold (Altschul, et al., supra). These initial 
neighborhood word hits act as seeds for initiating searches to find longer HSPs containing 

10 them. The word hits are extended in both directions along each sequence for as far as the 
cumulative aUgnment score can be increased. Cumulative scores are calculated using, e.g., 
for nucleotide sequences, the parameters M (reward score for a pair of matching residues; 
always > 0) and N (penalty score for mismatching residues; always < 0). For amino acid 
sequences, a scoring matrix is used to calculate the cumulative score. Extension of the word 

15 hits in each direction are halted when: the cumulative aligrnnent score fells off by the 

quantity X from its maximum achieved value; the ciraiulative score goes to zero or below, 
due to the accumulation of one or more negative-scoring residue alignments; or the end of 
either sequence is reached. The BLAST algorithm parameters W, T, and X determine the 
sensitivity and speed of the alignment. The BLASTN program (for nucleotide sequences) 

20 uses as defauhs a wordlength (W) of 11, an expectation (E) of 10, M=5, N=-4 and a 
comparison of both strands. For amino acid sequences, the BLASTP program uses as 
defaults a word length of 3, and expectation (E) of 10, and the BLOSUM62 scoring matrix 
(see Henikoff and Henikoff (1989) Proc. Nat'l Acad. Sci. USA 89:10915-919) alignments (B) 
of 50, expectation (E) of 10, M=5, N=-4, and a comparison of both strands. 

25 The BLAST algorithm also performs a statistical analysis of the similarity between 

two sequences (see, e.g., Karlin and Altschul (1993) Proc. Nat'l Acad. Sci. USA 90:5873- 
5887). One measure of similarity provided by the BLAST algorithm is the smallest sum 
probability (P(N)), which provides an indication of the probability by which a match between 
two nucleotide or amino acid sequences would occur by chance. For example, a nucleic acid 

30 is considered similar to a reference sequence if the smallest sum probability in a comparison 
of the test nucleic acid to the reference nucleic acid is less than about 0.2, more preferably 
less than about 0.01, and most preferably less than about 0.001. Log values may be large 
negative numbers, e.g., 5, 10, 20, 30, 40, 40, 70, 90, 110, 150, 170, etc. 
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An indication that two nucleic acid sequences or polypeptides are substantially 
identical is that the polypeptide encoded by the first nucleic acid is immunologically cross 
reactive with the antibodies raised against the polypeptide encoded by the second nucleic 
acid, as described below. Thus, a polypeptide is typically substantially identical to a second 
5 polypeptide, e.g., where the two peptides differ only by conservative substitutions. Another 
indication that two nucleic acid sequences are substantially identical is that the two molecules 
or their complements hybridize to each other under stringent conditions, as described below. 
Yet another indication that two nucleic acid sequences are substantially identical is that the 
same primers can be used to amplify the sequences. 

10 A "host cell" is a natmally occurring cell or a transformed cell that contains an 

expression vector and supports the replication or expression of the expression vector. Host 
cells may be cultured cells, explants, cells in vivo, and the Uke. Host cells may be 
prokaryotic cells such as E. coli, or eukaryotic cells such as yeast, insect, amphibian, or 
mammalian cells, such as CHO, HeLa, and the like (see, e.g., the American Type Culture 

15 Collection catalog or web site, www.atcc.org). 

The terms "isolated," "purified," or "biologically pure" refer to material that is 
substantially or essentially free from components that normally accompany it as found in its 
native state. Purity and homogeneity are typically determined using analytical chemistry 
techniques such as polyacrylamide gel electrophoresis or high performance liquid 

20 chromatography. A protein or nucleic acid that is the predominant species present in a 

preparation is substantially purified. In particular, an isolated nucleic acid is separated from 
some open reading firames that naturally flank the gene and encode proteins other than protein 
encoded by the gene. The term "purified" in some embodiments denotes that a nucleic acid 
or protein gives rise to essentially one band in an electrophoretic gel. Preferably, it means 

25 that the nucleic acid or protein is at least 85% pure, more preferably at least 95% pure, and 
most preferably at least 99% pure. "Purify" or "purification" in other embodiments means 
removing at least one contaminant from the composition to be purified. In this sense, 
purification does not require that the purified compound be homogenous, e.g., 100% pure. 
The terms "polypeptide," "peptide" and "protein" are used interchangeably herein to 

30 refer to a polymer of amino acid residues. The terms apply to amino acid polymers in which 

one or more amino acid residue is an artificial chemical mimetic of a corresponding naturally 

occurring amino acid, as well as to naturally occurring amino acid polymers, those containing 

modified residues, and non-naturally occurring amino acid polymers. 

14 
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The terai "amino acid" refers to naturally occurring and synthetic amino acids, as well 
as amino acid analogs and amino acid mimetics tiiat function similarly to the naturally 
occurring amino acids. Naturally occurring amino acids are those encoded by the genetic 
code, as well as those amino acids that are later modified, e.g., hydrox3T3roline, y- 
5 carboxyglutamate, and 0-phosphoserine. Amino acid analogs refers to compounds that have 
the same basic chemical structure as a naturally occurrmg amino acid, e.g., an a carbon that is 
bound to a hydrogen, a carboxyl group, an amino group, and an R group, e.g., homoserine, 
norleucine, methionine sulfoxide, methionine methyl sulfonium. Such analogs may have 
modified R groups (e.g., norleucine) or modified peptide backbones, but retain the same basic 

1 0 chemical structure as a naturally occurring amino acid. Amino acid mimetics refers to 
chemical compounds that have a structure that is different firom the general chemical 
structure of an amino acid, but that functions similarly to a naturally occurring ammo acid. 

Amino acids may be referred to herein by either their commonly known three letter 
symbols or by the one-letter symbols recommended by the lUPAC-IUB Biochemical 

1 5 Nomenclature Commission. Nucleotides, likewise, may be referred to by their commonly ' 
accepted single-letter codes. 

"Conservatively modified variants" applies to both amino acid and nucleic acid 
sequences. With respect to particular nucleic acid sequences, conservatively modified 
variants refers to those nucleic acids which encode identical or essentially identical amino 

20 acid sequences, or where the nucleic acid does not encode an amino acid sequence, to 
essentially identical or associated, e.g., naturally contiguous, sequences. Because of the 
degeneracy of the genetic code, a large number of functionally identical nucleic acids encode 
most proteins. For instance, the codons GCA, GCC, GCG, and GCU all encode the amino 
apid alanine. Thus, at every position where an alanine is specified by a codon, the codon can 

25 be altered to another of the corresponding codons described without altering the encoded 
polypeptide. Such nucleic acid variations are "silent variations," which are one species of 
conservatively modified variations. Every nucleic acid sequence herein which encodes a 
polypeptide also describes silent variations of the nucleic acid. In certain contexts each 
codon in a nucleic acid (except AUG, which is ordinarily the only codon for methionine, and 

30 TGG, which is ordinarily the only codon for tryptophan) can be modified to yield a 

functionally identical molecule. Accordingly, a silent variation of a nucleic acid which 

encodes a polypeptide is implicit in a described sequence with respect to the ejqpression 

product, but not necessarily with respect to actual probe sequences. 

15 
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As to amino acid sequences, one of skill will recognize that individual substitutions, 
deletions, or additions to a nucleic acid, peptide, polypeptide, or protein sequence "which 
alters, adds, or deletes a single amino acid or a small percentage of amino acids in the 
encoded sequence is a "conservatively modified variant" where the alteration results in the 
5 substitution of an amino acid with a chemically similar amino acid. Conservative substitution 
tables providing functionally similar amino acids are well known in the art. Such 
conservatively modified variants are in addition to and do not exclude polymorphic variants, 
interspecies homologs, and alleles of the invention. Typically conservative substitutions for 
one another: 1) Alanine (A), Glycine (G); 2) Aspartic acid (D), Glutamic acid (E); 3) 
10 Asparagine (N), Glutamine (Q); 4) Arginine (R), Lysine (K); 5) Isoleucine (I), Leucine (L), 
Methionine (M), Valine (V); 6) Phenylalanine (F), Tyrosine (Y), Tryptophan (W); 7) Serine 
(S), Threonine (T); and 8) Cysteine (C), Methionine (M) (see, e.g., Creighton (1984) 
Proteins Freeman). 

Macromolecular structures such as polypeptide structures can be described in terms of 

1 5 various levels of organization. For a general discussion of this organization, see, e.g., 

Alberts, et al. (2001) Molecular Bioloev of the Cell (4th ed.) Garland Pub.; and Cantor and 
Schinunel (1980) Biophvsical Chemistry Part I: The Conformation of Biological 
Macromolecules Freeman. "Primary structure" refers to the amino acid sequence of a 
particular peptide. "Secondary structure" refers to locally ordered, three dimensional 

20 structures within a polypeptide. These structures are commonly known as domains. 

Domains are portions of a polypeptide that often form a compact unit of the polypeptide and 
are typically 25 to approximately 500 amino acids long. Typical domains are made up of 
sections of lesser organization such as stretches of P-sheet and a-helices. "Tertiary structure" 
refers to the complete three dimensional structure of a polypeptide monomer. "Quaternary 

25 structure" refers to the three dimensional structure formed, usually by the non-covalent 

association of independent tertiary units. Anisotropic terms are also known as energy terms. 

"Nucleic acid" or "oligonucleotide" or "polynucleotide" or grammatical equivalents 
used herein means at least two nucleotides covalently linked together. Oligonucleotides are 
typically from about 5, 6, 7, 8, 9, 10, 12, 15, 25, 30, 40, 50, or more nucleotides in length, up 

30 to about 100 nucleotides in length. Nucleic acids and polynucleotides are a polymers of any 
length, including longer lengths, e.g., 200, 300, 500, 1000, 2000, 3000, 5000, 7000, 10,000, 
etc. A nucleic acid of the present invention will generally contain phosphodiester bonds. 
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allhough in some cases, nucleic acid analogs are included that may have at least one different 
linkage, e.g., phosphoramidate, phosphorothioate, phosphorodithioate, or O- 
methylphosphoroamidite linkages (see Eckstein (1992) Oligonucleotides and Analogues: A 
Practical Approach Oxford University Press); and peptide nucleic acid backbones and 
5 linkages. Other analog nucleic acids include those with positive backbones; non-ionic 
backbones, and non-ribose backbones, including those described in U.S. Patent Nos. 
5,235,033 and 5,034,506, and Chapters 6 and 7 of Sanghvi and Cook (eds. 1994) 
Carbohydrate Modifications in Antisense Research ASC Symposium Series 580. Nucleic 
acids containing one or more carbocyclic sugars are also included within one definition of 

1 0 nucleic acids. Modifications of the ribose-phosphate backbone may be done for a variety of 
reasons, e.g., to increase the stability and half-life of such molecules in physiological 
environments or as probes on a biochip. Mixtures of naturally occurring nucleic acids and 
analogs can be made; alternatively, mixtures of different nucleic acid analogs, and mixtures 
of naturally occurring nucleic acids and analogs may be made. 

15 A variety of references disclose such nucleic acid analogs, including, e.g., 

phosphoramidate (Beaucage, et al. (1993) Tetrahedron 49:1925-1963 and references therein; 
Letsinger (1970) J. Org. Chem. 35:3800-3803; Sprinzl, et al. (1977) Eur. J. Biochem. 81:579- 
589; Letsinger, et al. (1986) Nucl. Acids Res. 14:3487-499; Sawai, et al. (1984) Chem. Lett. 
805, Letsinger, et al. (1988) J. Am. Chem. Soc. 1 10:4470-4471; and Pauwels, et al. (1986), 

20 Chemica Scripta 26: 141-149), phosphorothioate (Mag, et al. (1991) Nucl. Acids Res. 

19:1437-441; and U.S. Patent No. 5,644,048), phosphoroditliioate (Brill, et al. (1989) J. Am. 
Chem. Soc. 111:2321-2322), 0-methylphophoroamidite linkages (see Eckstein (1992) 
Oligonucleotides and Analogues: A Practical Approach Oxford Univ. Press), and peptide 
nucleic acid backbones and linkages (see Eghohn (1992) J. Am. Chem. Soc. 1 14:1895-897; 

25 Meier, et al. (1992) Angew. Chem. Int. Ed. Engl. 31:1008-1010; Nielsen (1993) Nature. 
365:566-568; Carlsson, et al. (1996) Nature 380:207, each of which is incorporated by 
reference). Other analog nucleic acids include those with positive backbones (Denpcy, et al. 
(1995) Proc. NatT Acad. Sci. USA 92:6097-101; non-ionic backbones (U.S. Patent Nos. 
5,386,023, 5,637,684, 5,602,240, 5,216,141 and 4,469,863; Kiedrowshi, et al. (1991) Angew, 

30 Chem. hitl. Ed. English 30:423-426; Letsinger, et al. (1988) J. Am. Chem. Soc. 1 10:4470- 
4471; Jung, et al. (1994) Nucleoside and Nucleotide 13:1597; Chapters 2 and 3, in Sanghvi 
and Cook (eds. 1994) Carbohydrate Modifications in Antisense Research ASC Symposium 
Series 580; Mesmaeker, et al. (1994) Bioorganic and Medicinal Chem. Lett. 4:395-398; Jeffs, 
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et al. (1994) J. Biomolecular NMR 34:17-xx; Horn, et al. (1996) Tetrahedron Lett. 37:743- 
xxx) and non-ribose backbones, including those described in U.S. Patent Nos. 5,235,033 and 
5,034,506, and Chapters 6 and 7, in Sanghvi and Cook (eds. 1994) Carbohydrate 
Modifications in Antisense Research ASC Symposium Series 580. Nucleic acids containing 
5 one or more carbocyclic sugars are also included within one definition of nucleic acids (see 
Jenkins, et al. (1995) Chem. Soc. Rev, pp 169-176). Several nucleic acid analogs are 
described in Rawls (p. 35 June 2, 1997) C&E News . Each of these references is hereby 
expressly incorporated by reference. 

Particularly preferred are peptide nucleic acids (PNA) which includes peptide nucleic 
10 acid analogs. These backbone^ are substantially non-ionic under neutral conditions, in 

contrast to the highly charged phosphodiester backbone of naturally occurring nucleic acids. 
This results in two advantages. First, the PNA backbone exhibits unproved hybridization 
kinetics. PNAs have larger changes m the melting temperature (Tm) for mismatched versus 
perfectly matched base pairs. DNA and RNA typically exhibit a 2-4° C drop in Tjn for an 
15 internal mismatch. With the non-ionic PNA backbone, the drop is closer to 7-9° C, 
Similarly, due to their non-ionic nature, hybridization of the bases attached to these 
backbones is relatively insensitive to salt concentration. In addition, PNAs are not degraded 
by cellular enzymes, and thus can be more stable. 

The nucleic acids may be single stranded or double stranded, as specified, or contain 
20 portions of both double stranded or single stranded sequence. As will be appreciated by those 
in the art, the depiction of a single strand also defines the sequence of the complementary 
strand; thus the sequences described herein also provide the complement of the sequence. 
The nucleic acid may be DNA, both genomic and cDNA RNA, or a hybrid, where the 
nucleic acid may contain combinations of deoxyribo- and ribo-nucleotides, and combinations 
25 of bases, including uracil, adenine, thymine, cytosine, guanine, inosine, xanthine 

hypoxanthine, isocytosine, isoguanine, etc. "Transcript" typically refers to a naturally 
occurring RNA, e.g., a pre-mRNA, hnRNA, or mRNA. As used herein, the term 
"nucleoside" includes nucleotides and nucleoside and nucleotide analogs, and modified 
nucleosides such as amino modified nucleosides. In addition, "nucleoside" includes non- 
30 naturally occurring analog structures. Thus, e.g., the individual units of a peptide nucleic 
acid, each containing a base, are referred to herein as a nucleoside. 

A "label" or a "detectable moiety" is a composition detectable by spectroscopic, 

18 
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photochemical, biochemical, immunochemical, chemical, or other physical means. For 
example, useful labels include ^^P, fluorescent dyes, electron-dense reagents, enzymes (e.g., 
as commonly used in an ELISA), biotin, digoxigenin, or haptens and proteins or other entities 
which can be made detectable, e.g., by incorporating a radiolabel into the peptide or used to 
5 detect antibodies specifically reactive with the peptide. The labels may be incorporated into 
the ovarian cancer nucleic acids, proteins and antibodies at any position. Any method known 
in the art for conjugating the antibody to the label may be employed, including those methods 
described by Hunter, et al. (1962) Nature 144:945-xxx; David, et al. (1974) Biochemistry 
13:1014-1021; Pain, et al. (1981) J. Immunol. Meth. 40:219-230; andNygren (1982) L 

10 Histochem. and Cvtochem. 30:407-412. 

An "effector" or "effector moiety" or "effector component" is a molecule that is 
bound (or linked, or conjugated), either covalently, through a linker or a chemical bond, or 
non-covalently, through ionic, van der Waals, electrostatic, or hydrogen bonds, to an 
antibody. The "effector" can be a variety of molecules including, e.g., detection moieties 

15 including radioactive compounds, fluorescent compounds, an enzyme or substrate, tags such 
as epitope tags, a toxin; activatable moieties, a chemotherapeutic agent; a lipase; an 
antibiotic; or a radioisotope emitting "hard" e.g., beta radiation. 

A "labeled nucleic acid probe or oligonucleotide" is one that is boimd, either 
covalently, through a linker or a chemical bond, or non-covalently, through ionic, van der 

20 Waals, electrostatic, or hydrogen bonds to a label such that the presence of the probe may be 
detected by detecting the presence of the label boimd to the probe. Alternatively, method 
using high affinity interactions may achieve the same results where one of a pair of binding 
partners binds to the other, e.g., biotin, streptavidin. 

As used herein a "nucleic acid probe or oligonucleotide" is a nucleic acid capable of 

25 binding to a target nucleic acid of complementary sequence through one or more types of 

chemical bonds, usually through complementary base pairing, usually through hydrogen bond 
formation. As used herein, a probe may include natural (e.g.. A, G, C, or T) or modified 
bases (7-deazaguanosine, inosine, etc.). In addition, the bases in a probe may be joined by a 
linkage other than a phosphodiester bond, so long as it does not functionally interfere with 

30 hybridization. Thus, e.g., probes may be peptide nucleic acids in which the constituent bases 
are joined by peptide bonds rather than phosphodiester linkages. Probes may bind target 
sequences lacking complete complementarity with the probe sequence depending upon the 
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stringency of the hybridization conditions. The probes are preferably directly labeled, e.g., 
with isotopes, chromophores, lumiphores, chromogens, or indirectly labeled such as with 
biotin to which a streptavidin complex may later bind. By assaying for the presence or 
absence of the probe, one can detect the presence or absence of the select sequence or 
5 subsequence. Diagnosis or prognosis may be based at the genomic level, or at the level of 
RNA or protein expression. 

The term "recombinant" when used with reference, e.g., to a cell, or nucleic acid, 
protein, or vector, indicates that the cell, nucleic acid, protein or vector, has been modified by 
the introduction of a heterologous nucleic acid or protein or the alteration of a native nucleic 

10 acid or protein, or that the cell is derived from a cell so modified. Thus, e.g., recombinant 
cells express genes that are not found within the native (non-recombinant) form of the cell or 
express native genes that are otherwise abnormally expressed, under expressed or not 
expressed at all. By the term "recombinant nucleic acid" herein is meant nucleic acid, 
originally formed in vitro, in general, by the manipulation of nucleic acid, e.g., using 

1 5 polymerases and endonucleases, in a form not normally found m nature. In this manner, 

operably linkage of different sequences is achieved. Thus an isolated nucleic acid, in a linear 
form, or an expression vector formed in vitro by ligating DNA molecules that are not 
normally joined, are both considered recombinant for the purposes of this invention. It is 
understood that once a recombinant nucleic acid is made and reintroduced into a host cell or 

20 organism, it will replicate non-recombinantly, e.g., using the in vivo cellular machinery of the 
host cell rather than in vitro manipulations; however, such nucleic acids, once produced 
recombinantly, although subsequently replicated non-recombinantly, are still considered 
recombinant for the purposes of the invention. Similarly, a "recpmbinant protein" is a protein 
made using recombinant techniques, e.g., through the expression of a recombinant nucleic 

25 acid as dq)icted above. 

The term "heterologous" when used with reference to portions of a nucleic acid 
indicates that the nucleic acid comprises two or more subsequences that are not normally 
found in the same relationship to each other in nature. For instance, the nucleic acid is 
typically recombinantly produced, having two or more sequences, e.g., from unrelated genes 

30 arranged to make a new functional nucleic acid, e.g., a promoter from one source and a 

coding region from another source. Similarly, a heterologous protein will often refer to two 
or more subsequences that are not found in the same relationship to each other in nature (e.g., 
a ftision protein). 
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A "promoter" is defined as an array of nucleic acid control sequences that direct 
transcription of a nucleic acid. As used herein, a promoter includes necessary nucleic acid 
sequences near the start site of transcription, such as, in the case of a polymerase II type 
promoter, a TATA element. A promoter also optionally includes distal enhancer or repressor 
5 elements, which can be located as much as several thousand base pairs from the start site of 
transcription. A "constitutive" promoter is a promoter that is active under most 
environmental and developmental conditions. An "inducible" promoter is a promoter that is 
active under environmental or developmental regulation. The term "operably linked" refers 
to a functional linkage between a nucleic acid expression control sequence (such as a 

1 0 promoter, or array of transcription factor binding sites) and a second nucleic acid sequence, 
e.g., wherein the expression control sequence directs transcription of the nucleic acid 
corresponding to the second sequence. 

An "expression vector" is a nucleic acid construct, generated recombinantly or 
synthetically, with a series of specified nucleic acid elements that permit transcription of a 

15 particular nucleic acid in a host cell. The expression vector can be part of a plasmid, virus, or 
nucleic acid fragment. Typically, the expression vector includes a nucleic acid to be 
transcribed operably linked to a promoter. 

The phrase "selectively (or specifically) hybridizes to" refers to the binding, 
duplexing, of hybridizing of a molecule only to a particular nucleotide sequence under 

20 stringent hybridization conditions when that sequence is present in a complex mixture (e.g., 
total cellular or library DNA or RNA). 

The phrase "stringent hybridization conditions" refers to conditions under which a 
probe will hybridize to its target subsequence, typically in a complex mixture of nucleic 
acids, but to no other sequences. Stringent conditions are sequence-dependent and will be 

25 different in different circumstances. Longer sequences hybridize specifically at higher 

temperatures. An extensive guide to the hybridization of nucleic acids is found in "Overview 
of principles of hybridization and the strategy of nucleic acid assays" in Tijssen (1993) 
Hybridization with Nucleic Probes (Laboratory Techniques in Biochemistry and Molecular 
Biology) (vol. 24) Elsevier. Generally, stringent conditions are selected to be about 5-10° C 

30 lower than the thermal melting point (Tj^) for the specific sequence at a defined ionic 
strength pH. The Tj^ is the temperature (under defined ionic strength, pH, and nucleic 
concentration) at which 50% of the probes complementary to the target hybridize to the target 
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sequence at equilibrium (as the target sequences are present in excess, at T^^, 50% of the . 
probes are occupied at equihbrium). Stringent conditions will be those in which the salt 
concentration is less than about 1.0 M sodium ion, typically about 0.01 to 1.0 M sodium ion 
concentration (or other salts) at pH 7.0 to 8.3 and the temperature is at least about 30° C for 
5 short probes (e.g., 10 to 50 nucleotides) and at least about 60° C for long probes (e.g., greater 
than 50 nucleotides). Stringent conditions may also be achieved with the addition of 
destabilizing agents such as formamide. For selective or specific hybridization, a positive 
signal is typically at least two times background, preferably 10 times background 
hybridization. Exemplary stringent hybridization conditions can be as following: 50% 

10 formamide, 5x SSC, and 1% SDS, incubating at 42° C, or, 5x SSC, 1% SDS, incubating at 
65° C, with wash in 0.2x SSC, and 0. 1% SDS at 65° C. For PGR, a temperature of about 36° 
C is typical for low stringency amplification, although annealing temperatures may vary 
between about 32-48° C depending on primer length. For high stringency PGR amplification, 
a temperature of about 62° C is typical, although high stringency annealing temperatures can 

15 range from about 50° G to about 65° G, depending on the primer length and specificity. 
Typical cycle conditions for both high and low stringency amplifications include a 
denaturation phase of 90-95° G for 30-120 sec, an annealing phase lasting 30-120 sec, and an 
extension phase of about 72° G for 1-2 min. Protocols and guidelines for low and high 
stringency amplification reactions are available, e.g., in Linis, et al. (1990) PGR Protocols: A 

20 Guide to Methods and Applications Academic Press, N. Y. 

Nucleic acids that do not hybridize to each other under stringent conditions are still 
substantially identical if the polypeptides which they encode are substantially identical. This 
occurs, e.g., when a copy of a nucleic acid is created using the maximum codon degeneracy 
permitted by the genetic code. In such cases, the nucleic acids typically hybridize under 

25 moderately stringent hybridization conditions. Exemplary "moderately stringent 

hybridization conditions" include a hybridization in a buffer of 40% formamide, 1 M NaGl, 
1% SDS at 37° C, and a wash in IX SSC at 45° C. A positive hybridization is at least twice 
background. Alternative hybridization and wash conditions can be utilized to provide 
conditions of similar stringency. Additional guidelines for determining hybridization 

30 parameters are provided, e.g., Ausubel, et al. (ed. 1991 and supplements) Current Protocols in 
Molecular Biology Lippincott. 

The phrase "fimctional effects" in the context of assays for testing compounds that 
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modulate activity of an ovarian cancer protein includes the determination of a parameter that 
is indirectly or directly under the influence of the ovarian cancer protein or nucleic acid, e.g., 
a functional, physical, physiological, or chemical effect, such as the ability to decrease 
ovarian cancer. It includes ligand binding activity; cell growth on soft agar; anchorage 
5 dependence; contact inhibition and density Umitation of growth; cellular proliferation; 
cellular transformation; growth factor or serum dependence; tumor specific marker levels; 
invasiveness into Matrigel; tumor growth and metastasis in vivo; mRNA and" protein 
expression in cells undergoing metastasis, and other characteristics of ovarian cancer cells. 
"Functional effects" include in vitro, in vivo, and ex vivo activities. 

10 By "determining the functional effect" is meant assaying for a compound that 

increases or decreases a parameter that is indirectly or directly under the influence of an 
ovarian cancer protein sequence, e.g., fimctional, enzymatic, physical, physiological, and 
chemical effects. Such fimctional effects can be measured by any means known to those 
skilled in the art, e.g., changes in spectroscopic characteristics (e.g., fluorescence, 

15 absorbance, refractive index), hydrodynamic (e.g., shape), chromatographic, or solubility 
properties for the protein, measuring inducible markers or transcriptional activation of the 
ovarian cancer protein; measuring binding activity or binding assays, e.g., binding to 
antibodies or other ligands, and measuring cellular proliferation. Determination of the 
functional effect of a compound on ovarian cancer can also be performed using ovarian 

20 cancer assays known to those of skill in the art such as an in vitro assays, e.g., cell growth on 
soft agar; anchorage dependence; contact inhibition and density limitation of growth; cellular 
proliferation; cellular transformation; growth factor or serum dependence; tumor specific 
marker levels; invasiveness into Matrigel; tumor growth and metastasis in vivo; mRNA and 
protein expression in cells undergoing metastasis, and other characteristics of ovarian cancer 

25 cells. The fimctional effects can be evaluated by means known to those skilled in the art, e.g., 
microscopy for quantitative or qualitative measures of alterations in morphological features, 
measurement of changes in RNA or protein levels for ovarian cancer-associated sequences, 
measurement of RNA stability, or identification of downstream or reporter gene expression 
(CAT, luciferase, P-gal, GFP, and the like), e.g., via chemiluminescence, fluorescence, 

30 colorimetric reactions, antibody binding, inducible markers, and ligand binding assays. 

"Inhibitors", "activators", and "modulators" of ovarian cancer polynucleotide and 
polypeptide sequences are used to refer to activating, inhibitory, or modulating molecules or 
compounds identified using in vitro and in vivo assays of ovarian cancer polynucleotide and 
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polypeptide sequences. Inhibitors are compounds that, e.g., bind to, partially or totally block 
activity, decrease, prevent, delay activation, inactivate, desensitize, or down regulate the 
activity or expression of ovarian cancer proteins, e.g., antagonists. Antisense or inhibitory 
nucleic acids may inhibit expression and subsequent function of the protein. "Activators" are 
5 compounds that increase, open, activate, facilitate, enliance activation, sensitize, agonize, or 
up regulate ovarian cancer protein activity. Inhibitors, activators, or modulators also include 
genetically modified versions of ovarian cancer proteins, e.g., versions with altered activity, 
as well as naturally occurring and synthetic ligands, antagonists, agonists, antibodies, small 
chemical molecules, and the like. Assays for inhibitors and activators include, e.g., 

1 0 expressing the ovarian cancerj)rotein in vitro, in cells, or cell membranes, applying putative 
modulator compounds, and then determining the functional effects on activity, as described 
above. Activators and inhibitors of ovarian cancer can also be identified by incubating 
ovarian cancer cells with the test compound and determining increases or decreases in the 
expression of one or more ovarian cancer proteins, e.g., 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 40, 

15 50, or more ovarian cancer proteins, such as ovarian cancer proteins encoded by the 
sequences set out in Tables 1-26. 

Samples or assays comprising ovarian cancer proteins that are treated with a potential 
activator, inhibitor, or modulator are compared to control samples without the inhibitor, 
activator, or modulator to examine the extent of inhibition. Control samples (untreated with 

20 inhibitors) are assigned a relative protein activity value of 100%. Inhibition of a polypeptide 
is achieved when the activity value relative to the control is about 80%, preferably 50%, more 
preferably 25% or less. Activation of an ovarian cancer polypeptide is achieved when the 
activity value relative to the control (untreated with activators) is 1 10%, more preferably 
150%, more preferably 200-500% (e.g., 2-5 fold higher relative to the control), more 

25 preferably 1000-3000% higher. 

The phrase "changes in cell growth" refers to a change in cell growth and 
proliferation characteristics in vitro or in vivo, e,g., cell viability, formation of foci, 
anchorage independence, semi-solid or soft agar growth, change in contact inhibition or 
density limitation of growth, loss of growth factor or serum requirements, change in cell 

30 morphology, gain or loss of immortalization, gain or loss of tumor specific markers, abilhy to 
form or suppress tumors when injected into suitable animal hosts, and/or immortalization of 
the cell. See, e.g., pp. 231-241 in Freshney (1994) Cnltiirp of Ani mal Cells: A Manual of 
Basic Technique (3d ed.) Wiley-Liss. 
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"Tumor cell" refers to pre-cancerous, cancerous, and normal cells in a tumor. 
"Cancer cells," "transformed" cells or "transformation" in tissue culture, refers to 
spontaneous or induced phenotypic changes that do not necessarily involve the uptake of new 
genetic material. Although transformation can arise from infection with a transforming virus 
5 and incorporation of new genomic DNA, or uptake of exogenous DNA, it can also arise 

spontaneously or following exposure to a carcinogen, thereby mutating an endogenous gene. 
Transformation is typically associated with phenotypic changes, such as immortalization of 
cells, aberrant growth control, non-morphological changes, and/or malignancy. See, 
• Freshney (1994) Culture of Animal Cells . 

10 "Antibody" refers to a polypeptide comprising a framework region from an 

immunoglobulin gene or fragments thereof that specifically binds and recognizes an antigen. 
The recognized immunoglobulin genes include the kappa, lambda, alpha, gamma, delta, 
epsilon, and mu constant region genes, as well as the myriad immunoglobulin variable region 
genes. Light chains are classified as eittier kappa or lambda. Heavy chains are classified as 

1 5 gamma, mu, alpha, delta, or epsilon, which in turn define the immunoglobulin classes, IgG, 
IgM, IgA, IgD, and IgE, respectively. Typically, the antigen-binding region of an antibody or 
its functional equivalent will be most critical in specificity and affinity of binding. See, e.g., 
Paul (ed. 1999) Fundamental Immunology (4th ed.) Raven. 

An exemplary immunoglobulin (antibody) structural unit comprises a tetramer. Each 

20 tetramer is composed of two identical pairs of polypeptide chains, each pair having one 
"light" (about 25 kD) and one "heavy" chain (about 50-70 kD). The N-terminus of each 
chain defmes a variable re^on of about 100 to 1 10 or more amino acids primarily responsible 
for antigen recognition. The terms variable light chain (Vl) and variable heavy chain (Vji) 
refer to these light and heavy chains respectively. 

25 Antibodies exist, e.g., as intact immunoglobulins or as a number of well-characterized 

fragments produced by digestion with various peptidases. Thus, e.g., pepsin digests an 
antibody below the disulfide linkages in the hinge region to produce F(ab)'2, a dimer of Fab 
which itself is a light chain joined to Vjj-Cjjl by a disulfide bond. The F(ab)'2 may be 
reduced under mild conditions to break the disulfide linkage in the hinge region, thereby 

30 converting the F(ab)'2 dimer into an Fab' monomer. The Fab' monomer is essentially Fab 
with part of the hinge region. See Paul (ed. 1 999) Fundamental Immunology (4th ed.) Raven. 
While various , antibody fiagments are defined in terms of tihe digestion of an intact antibody, 
25 
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one of skill will appreciate that such fragments may be synthesized de novo either chemically 
or by using recombinant DNA methodology. Thus, the term antibody, as used herein, also 
includes antibody fragments either produced by the modification of whole antibodies, or 
those synthesized de novo using recombinant DNA methodologies (e.g., single chain Fv) or 
5 tiiose identified using phage display libraries. See, e.g., McCafferty, et al. (1990) Nature 
348:552-554. 

For preparation of antibodies, e.g., recombinant, monoclonal, or polyclonal 
antibodies, many techniques known in tiie art can be used (see, e.g., Kohler and Milstein 
(1975) Nature 256:495-497; Kozbor, et al. (1983) Immunologv Today 4:72; Cole, et al., pp. 

1 0 77-96 in Reisfeld and Sell (1 985) Monoclonal Antibodies and Cancer Therapy Liss; Coligan 
(1991) Current Protocols in Immunologv Lippincott; Harlow and Lane (1988) Antibodies: A 
Laboratory Manual CSH Press; and Coding (1986) Monoclonal Antibodies: Principles and 
Practice (2d ed.) Academic Press. Techniques for the production of single chain antibodies 
(U.S. Patent 4,946,778) can be adapted to produce antibodies to polypeptides of this 

1 5 invention. Transgenic mice, or other organisms, e.g., other mammals, may be used to express 
humanized antibodies. Alternatively, phage display technology can be used to identify 
antibodies and heteromeric Fab fragments that specifically bind to selected antigens. See, 
e.g., McCafferty, et al. (1990) Nature 348:552-554; and Marks, et al. (1992) Biotechnology 
10:779-783. 

20 A "chimeric antibody" is an antibody molecule in which (a) the constant region, or a 

portion thereof, is altered, replaced or exchanged so that the antigen binding site (variable 
region) is linked to a constant region of a different or altered class, effector function and/or 
species, or an entirely different molecule which confers new properties to the chimeric 
antibody, e.g., an enzyme, toxin, hormone, growth factor, drug, etc.; or (b) the variable 

25 region, or a portion thereof is altered, replaced or exchanged with a variable region having a 
different or altered antigen specificity. 

Identification of ovarian cancer-associated sequences 

In one aspect, the expression levels of genes are determined in different patient 
30 samples for which diagnosis information is desired, to provide expression profiles. An 
expression profile of a particular sample is essentially a "fmgerprint" of the state of the 
sample; while two states may have any particular gene similarly expressed, the evaluation of 
a number of genes simultaneously allows the generation of a gene expression profile that is 
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characteristic of the state of the cell. That is, normal tissue (e.g., normal ovarian or other 
tissue) may be distinguished from cancerous or metastatic cancerous tissue of the ovarian, or 
ovarian cancer tissue or metastatic ovarian cancerous tissue can be compared with tissue 
samples of ovarian and other tissues from surviving cancer patients. By comparing 

5 expression profiles of tissue in known different ovarian cancer states, information regarding 
which genes are important (including both up- and down-regulation of genes) in each of these 
states is obtained. Molecular profiling may distinguish subtypes of a currently collective 
disease designation, e.g., different forms of a cancer. 

The identification of sequences that are differentially expressed in ovarian cancer 

1 0 versus non-ovarian cancer tissue allows the use of this information in a number of ways. For 
example, a particular treatment regime may be evaluated: does a chemotherapeutic drug act 
to down-regulate ovarian cancer, and thus tumor growth or recurrence, in a particular patient. 
Alternatively, does existing treatment induce expression of a target. Similarly, diagnosis and 
treatment outcomes may be done or confumed by comparing patient samples with the known 

15 expression profiles. Metastatic tissue can also be analyzed to determine the stage of ovarian 
cancer in the tissue or origin of the primary tumor. Furthermore, these gene expression 
profiles (or individual genes) allow screening of drug candidates with an eye to mimicking or 
altering a particular expression profile; e.g., screening can be done for drugs that suppress the 
ovarian cancer expression profile. This may be done by making biochips comprising sets of 

20 the important ovarian cancer genes, which can then be used in these screens. These methods 
can also be based on evaluating protein expression; that is, protein expression levels of the 
ovarian cancer proteins can be evaluated for diagnostic purposes or to screen candidate 
agents. In addition, the ovarian cancer nucleic acid sequences can be administered for gene 
therapy purposes, including the administration of antisense or RNAi nucleic acids, or the 

25 ovarian cancer proteins (including antibodies and other modulators thereof) administered as 
therapeutic drugs. 

Thus the present invention provides nucleic acid and protein sequences that are 
differentially expressed in ovarian cancer relative to normal tissues and/or non-malignant 
tissues, herein termed "ovarian cancer sequences." As outlined below, ovarian cancer 
30 sequences include those that are up-regulated (e.g., expressed at a higher level) in ovarian 
cancer, as well as those that are down-regulated (e.g., expressed at a lower level). In a 
preferred embodiment, the ovarian cancer sequences are from humans; however, as will be 
appreciated by those in the art, ovarian cancer sequences from other organisms may be useful 
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in animal models of disease and drug evaluation; thus, other ovarian cancer sequences are 
provided, from vertebrates, including mammals, including rodents (rats, mice, hamsters, 
guinea pigs, etc.), primates, farm animals (including sheep, goats, pigs, cows, horses, etc.) 
and pets (e.g., dogs, cats, etc.). Ovarian cancer sequences, e.g., coimterpart genes, from other 
5 organisms may be obtained using the techniques outUned below. 

Ovarian cancer sequences can include both nucleic acid and amino acid sequences. 
Ovarian cancer nucleic acid sequences are useful in a variety of applications, including 
diagnostic applications, which will detect naturally occurring nucleic acids. Screening 
applications; e.g., biochips comprising nucleic acid probes or PGR microtiter plates with 

1 0 selected probes to the ovariar^, cancer sequences, are also provided. 

An ovarian cancer sequence can be initially identified by substantial nucleic acid 
and/or amino acid sequence homology to the ovarian cancer sequences outlined herein. Such 
homology can be based upon the overall nucleic acid or amino acid sequence, and is 
generally determined as outlined below, using either homology programs or hybridization 

15 conditions. 

For identifying ovarian cancer-associated sequences, the ovarian cancer screen 
typically includes comparing genes identified in different tissues, e.g., normal and cancerous 
tissues, or tumor tissue samples from patients who have metastatic disease vs. non metastatic 
tissue. Other suitable tissue comparisons include comparing ovarian cancer samples with 

20 metastatic cancer samples from other cancers, such as lung, ovarian, gastrointestinal cancers, 
etc. Samples of different stages of ovarian cancer, e.g., survivor tissue, drug resistant states, 
and tissue xmdergoing metastasis, are applied to biochips comprising nucleic acid probes. 
The samples are fu^t microdissected, if appUcable, and treated for the preparation of mRNA. 
Suitable biochips are commercially available, e.g., from Affymetrix. Grene expression 

25 profiles as described herein are generated and the data analyzed. 

In one embodiment, the genes showing changes in expression as between normal and 
disease states are compared to genes expressed in other normal tissues, preferably normal 
ovarian, but also including, and not limited to, lung, heart, brain, liver, ovarian, kidney, 
muscle, colon, small intestine, large intestine, spleen, bone, and/or placenta. In a preferred 

30 embodiment, those genes identified during the ovarian cancer screen that are expressed in any 

significant amount in other tissues are removed from the profile, although in some 

embodiments, expression in non-essential tissues may be tolerated. That is, when screening 

for drugs, it is usually preferable that the target be disease specific, to mmimize possible side 
28 
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effects by interaction with target present in other organs. 

In a preferred embodiment, ovarian cancer sequences are those that are up-regulated 
in ovarian cancer; that is, the expression of these genes is higher in the ovarian cancer tissue 
as compared to non-cancerous tissue. "Up-regulation" as used herein often means at least 
5 about a two-fold change, preferably at least about a three fold change, with at least about 
five-fold or higher being prefeired. Other embodiments are directed to sequences up 
regulated in non-malignant conditions relative to normal. 

Umgene cluster identification numbers and accession numbers herein refer to the 
GenBank sequence database and the sequences of the accession numbers are hereby 

1 0 expressly incorporated by reference. GenBank is known in the art, see, e.g., Benson, et al. 
(1998) Nucl. Acids Res. 26:1-7; and http://www.ncbi.nlm.nih.gov/. Sequences are also 
available in other databases, e.g., European Molecular Biology Laboratory (EMBL) and 
DNA Database of Japan (DDBJ). In some situations, the sequences may be derived from 
assembly of available sequences or be predicted firom genomic DNA using exon prediction 

1 5 algorithms, e.g., FGENESH. See Salamov and Solovyev (2000) Genome Res. 10:5 16-522. 
In other situations, sequences have been derived from cloning and sequencing of isolated 
nucleic acids. 

In another preferred embodiment, ovarian cancer sequences are those that are down- 
regulated in ovarian cancer; that is, the expression of these genes is lower in ovarian cancer 
20 tissue as compared to non-cancerous tissue. "Down-regulation" as used herein often means 
at least about a two-fold change, preferably at least about a three-fold change, with at least 
about five-fold or higher being preferred. 

Informatics 

25 The ability to identify genes that are over or under expressed in ovarian cancer can 

additionally provide high-resolution, high-sensitivity datasets which can be used in the areas 
of diagnostics, therapeutics, drug development, pharmacogenetics, protein structure, 
biosensor development, and other related areas. Expression profiles can be used in diagnostic 
or prognostic evaluation of patients with ovarian cancer. Subcellular toxicological 

30 information can be generated to better direct drug structure and activity correlation (see 

Anderson (June 1 1-12, 1998) Pharmaceutical Proteomics: Targets. Mechanism, and Function. 
paper presented at the IBC Proteomics conference, Coronado, CA) or in a biological sensor 
device to predict the likely toxicological effect of chemical exposures and likely tolerable 



wo 02/102235 



PCT/US02/19297 



exposure thresholds (see U.S. Patent No. 5,811,231). Similar advantages accrue from 
datasets relevant to other biomolecules and bioactive agents (e.g., nucleic acids, saccharides, 
lipids, drugs, and the like). 

Thus, in another embodiment, the present invention provides a database that includes 
5 at least one set of assay data. The data contained in the database is acquired, e.g., using array 
analysis either singly or in a library format. The database can be in a form in which data can 
be maintained and transmitted, but is preferably an electronic database, and can be 
maintained on any electronic device allowing for the storage of and access to the database, 
such as a personal computer, but is preferably distributed on a wide area network, such as the 

10 World Wide Web. 

The focus of the present section on databases that include peptide sequence data is for 
clarity of illustration only. It will be apparent to those of skill in the art that similar databases 
can be assembled for any assay data acquired using an assay of the invention. 

The compositions and methods for identifying and/or quantitating the relative and/or 

1 5 absolute abundance of a variety of molecular and macromolecular species from a biological 
sample undergoing ovarian cancer, e.g., the identification of ovarian cancer-associated 
sequences described herein, provide an abundance of information which can be correlated 
with pathological conditions, predisposition to disease, drug testing, therapeutic monitoring, 
gene-disease causal linkages, identification of correlates of immunity and physiological 

20 status, and outcome data, among others. Although data generated from the assays of the 
invention is suited for manual review and analysis, in a preferred embodiment, data 
processing using high-speed computers is utilized. 

An array of methods for indexing and retrieving biomolecular information is known 
in the art. For example, U.S. Patents 6,023,659 and 5,966,712 disclose a relational database 

25 system for storing biomolecular sequence information in a manner that allows sequences to 
be catalogued and searched according to one or more protein function hierarchies. U.S. 
Patent 5,953,727 discloses a relational database having sequence records containing 
information in a format that allows a collection of partial-length DNA sequences to be 
catalogued and searched according to association with one or more sequencing projects for 

30 obtaining full-length sequences from the collection of partial length sequences. U.S. Patent 
5,706,498 discloses a gene database retrieval system for making a retrieval of a gene 
sequence similar to a sequence data item in a gene database based on the degree of similarity 
between a key sequence and a target sequence. U.S. Patent 5,538,897 discloses a method 
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using mass spectroscopy fragmentation patterns of peptides to identify amino acid sequences 
in computer databases by comparison of predicted mass spectra with experimentally-derived 
mass spectra using a closeness-of-fit measure. U.S. Patent 5,926,818 discloses a multi- 
dimensional database comprising a functionality for multi-dimensional data analysis 
5 described as on-line analytical processing (OLAP), which entails the consohdation of 

projected and actual data according to more than one consolidation path or dimension. U.S. 
Patent 5,295,261 reports a hybrid database structure in which the fields of each database 
record are divided into two classes, navigational and informational data, with navigational 
fields stored in a hierarchical topological map which can be viewed as a tree structure or as 

10 the merger of two or more such tree structures. 

Fundamentals of bioinfonnatics are provided, e.g., in Mount, et al. (2001) 
Bioinformatics: Sequence and Grenome Analysis CSH Press, NY; Durbin, et al. (eds. 1999) 
Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Adds 
Cambridge Univ. Press; Baxevanis and OeuUette (eds. 1998) Rininfnr matics: A Practical 

1 5 Guide to the Analysis of Genes and Proteins (2d ed.) Wiley-Liss; Rashidi and Buehler (1 999) 
Bioinformatics: Basic Applications in Biological Science and Medicine CRC Press; Setubal, 
et al. (eds 1997) Introduction to Computational Molecular Biolo gy Brooks/Cole; Misener and 
Krawetz (eds. 2000) Bioinformatics: Methods and Protocols Humana Press; Higgins and 
Taylor (eds. 2000) Bioinformatics: Sequence. Structure, and Databanks: A Practical 

20 Approach Oxford Univ. Press: Brown (2001) Bioinformatics: A Biologist's Guide to 

Biocomputing and the Internet Eaton Pub.; Han and Kamber (2000) Data Mining: Concepts 
and Techniques BCaufinaim Pub.; and Waterman (1995) Introduction to Computational 
Biology: Maps. Sequences, and Grenomes Chap and Hall. 

The present invention provides a computer database comprising a computer and 

25 software for storing in computer-retrievable form assay data records cross-tabulated, e.g., 
with data specifying the source of the target-containing sample firom which each sequence 
specificity record was obtained. 

In an exemplary embodiment, at least one of the sources of target-containing sample 
is fiom a control tissue sample known to be free of pathological disorders. In a variation, at 

30 least one of the sources is a known pathological tissue specimen, e.g., a neoplastic lesion or 

another tissue specimen to be analyzed for ovarian cancer. In another variation, assay records 

cross-tabulate one or more of the following parameters for a target species in a sample: (1) a 

unique identification code, which can include, e.g., a target molecular structure and/or 
31 



wo 02/102235 



PCT/US02/19297 



characteristic sq)aration coordinate (e.g., electrophoretic or genomic position coordinates); 
(2) sample source; and (3) absolute and/or relative quantity of target species present in the 
sample. 

The invention also provides for ttie storage and retrieval of a collection of target data 
5 in a computer data storage apparatus, which can include magnetic disks, optical disks, 

magneto-optical disks, DRAM, SRAM, SGRAM, SDRAM, RDRAM, DDR RAM, magnetic 
bubble memory devices, and other data storage devices, including CPU registers and on-CPU 
data storage arrays. Typically, the target data records are stored as a bit pattern in an array of 
magnetic domains on a magnetizable medium or as an array of charge states or transistor gate 

10 states, such as an array of cells in a DRAM device (e.g., each cell comprised of a transistor 
and a charge storage area, which may be on the transistor). In one embodiment, the invention 
provides such storage devices, and computer S5^tems built therewith, comprising a bit pattern 
encoding a protein expression fingerprint record comprising unique identifiers for at least 10 
target data records cross-tabulated with target source. 

1 5 When the target is a peptide or nucleic aci4 the invention preferably provides a 

method for identifying related peptide or nucleic acid sequences, comprising performing a 
computerized comparison between a peptide or nucleic acid sequence assay record stored in 
or retrieved from a computer storage device or database and at least one other sequence. The 
comparison can include a sequence analysis or comparison algorithm or computer program 

20 embodiment thereof (e.g., FASTA, TFASTA, GAP, BESTFIT) and/or the comparison may 
be of the relative amount of a peptide or nucleic acid sequence in a pool of sequences 
determined from a polypeptide or nucleic acid sample of a specimen. 

The invention also preferably provides a magnetic disk, such as an IBM-compatible 
(DOS, Windows, Windows95/98/2000, Windows NT, OS/2) or other format (e.g., Linux, 

25 SunOS, Solaris, ADC, SCO Unix, VMS, MV, Macintosh, etc.) floppy diskette or hard (fixed, 
Winchester) disk drive, comprising a bit pattern encoding data from an assay of the invention 
in a file format suitable for retrieval and processing in a computerized sequence analj^is, 
comparison, or relative quantitation method. 

The invention also provides a network, comprising a plurality of computing devices 

30 linked via a data link, such as an Ethernet cable (coax or lOBaseT), telephone line, ISDN 

line, wireless network, optical fiber, or other suitable signal transmission medium, whereby at 

least one network device (e.g., computer, disk array, etc.) comprises a pattern of magnetic 

domams (e.g., magnetic disk) and/or charge domains (e.g., an array of DRAM cells) 
32 
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composing a bit pattern encoding data acquired from an assay of the invention. 

The invention also provides a method for transmitting assay data that includes 
generating an electronic signal on an electronic communications device, such as a modem, 
ISDN terminal adapter, DSL, cable modem, ATM switch, or the like, wherein the signal 
5 includes (in native or encrypted format) a bit pattern encoding data jfrom an assay or a 
database comprising a plurality of assay results obtained by the method of the invention. 

In a preferred embodiment, the invention provides a computer system for comparing a 
query target to a database containing an array of data structures, such as an assay result 
obtained by the method of the invention, and ranking database targets based on the degree of 

10 identity and gap weight to the target data. A central processor is preferably initialized to load 
and execute the computer program for alignment and/or comparison of the assay results. 
Data for a query target is entered into the central processor via an I/O device. Execution of 
the computer program results in the central processor retrieving the assay data from the data 
file, which comprises a binary description of an assay result. 

1 5 The target data or record and the computer program can be transferred to secondary 

memory, which is typically random access memory (e.g., DRAM, SRAM, SGRAM, or 
SDRAM). Targets are ranked according to the degree of correspondence between a selected 
assay characteristic (e.g., binding to a selected affinity moiety) and the same characteristic of 
the query target and results are output via an I/O device. For example, a central processor 

20 can be a conventional computer (e.g., Intel Pentium, PowerPC, Alpha, PA-8000, SPARC, 
MIPS 4400, MIPS 10000, VAX, etc.); a program can be a commercial or pubUc domain 
molecular biology software package (e.g., UWGCG Sequence Analysis Software, Darwin); a 
data file can be an optical or magnetic disk, a data server, a memory device (e.g., DRAM, 
SRAM, SGRAM, SDRAM, EPROM, bubble memory, flash memory, etc.); an I/O device can 

25 be a terminal comprising a video display and a keyboard, a modem, an ISDN terminal 

adapter, an Ethernet port, a punched card reader, a magnetic strip reader, or other suitable I/O 
device. 

The invention also preferably provides the use of a computer system, e.g., which 
typically comprises one or more of: (1) a computer; (2) a stored bit pattern encoding a 
30 collection of peptide sequence specificity records obtained by methods of the inventions, 

which may be stored in the computer; (3) a comparison target, such as a query target; and (4) 
a program for alignment and comparison, typically with rank-ordering of comparison results 
on the basis of computed similarity values. 
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Characteristics of ovarian cancer-associated proteins 

Ovarian cancer proteins of the present invention may be categorized as secreted 
proteins, transmembrane proteins, or intracellular proteins. In one embodiment, the ovarian 
5 cancer protein is an intracellular protein. Intracellular proteins may be found in the 

cytoplasm and/or in the nucleus. Intracellular proteins are involved in all aspects of cellular 
function and replication (including, e.g., signaling pathways); aberrant expression of such 
proteins often results in unregulated or disregulated cellular processes. See, e.g., Alberts, et 
al. (eds. 1994) Molecular Biology of the Cell (3d ed.) Garland. For example, many 

10 intracellular proteins have enzymatic activity such as protein kinase activity, protein 

phosphatase activity, protease activity, nucleotide cyclase activity, polymerase activity, and 
the like. Intracellular proteins can also serve as docking proteins that are involved in 
organizing complexes of proteins, or targeting proteins to various subcellular localizations, 
and are often involved in maintaining the structural integrity of organelles. 

15 An increasingly appreciated concept in characterizing proteins is the presence in the 

proteins of one or more structural motifs for which defined functions have been attributed. 
In addition to the highly conserved sequences found in the enzymatic domain of proteins, 
highly conserved sequences have been identified in proteins that are involved in protein- 
protein interaction. For example, Src-homology-2 (SH2) domains bind tyrosine- 

20 phosphorylated targets in a sequence dependent manner. PTB domains, which are distinct 
from SH2 domains, also bind tyrosine phosphorylated targets. SH3 domains bind to proline- 
rich targets. In addition, PH domains, tetratricopeptide repeats and WD domains to name 
only a few, have been shown to mediate protein-protein interactions. Some of these may also 
be involved in binding to phospholipids or other second messengers. As will be appreciated 

25 by one of ordinary skill in the art, these motifs can be identified on the basis of amino acid 
sequence; thus, an analysis of the sequence of proteins may provide insight into both the 
enzymatic potential of the molecule and/or molecules with which the protein may associate. 
One useful database is Pfam (protein families), which is a large collection of multiple 
sequence alignments and hidden Markov models covering many common protein domains. 

30 Versions are available via the intemet firom Washington University in St. Louis, the Sanger 
Center in England, and the Karolinska Institute in Sweden. See, e.g., Bateman, et al. (2000) 
Nuc. Acids Res. 28:263-266; Sonnhammer, et al. (1997) Proteins 28:405-420; Bateman, et al. 
(1999) Nuc. Acids Res. 27:260-262; and Sonnhammer, et al. (1998) Nuc. Acids Res. 26:320- 
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322. 

In another preferred embodiment, the ovarian cancer sequences are transmembrane 
proteins. Transmembrane proteins are molecules that span a phospholipid bilayer of a cell. 
They may have an intracellular domain, an extracellular domain, or both. The intracellular 
5 domains of such proteins may have a mmiber of functions including those already described 
for intracellular proteins. For example, the intracellular domain may have enzymatic activity 
and/or may serve as a binding site for additional proteins. Frequently the intracellular 
domain of transmembrane proteins serves both roles. For example certain receptor tyrosine 
kinases have both protein kinase activity and SEE2 domains. In addition, autophosphorylation 
10 of tyrosines on the receptor molecule itself creates binding sites for additional SH2 domain 
containing proteins. 

Transmembrane proteins may contain fiom one to many transmembrane domains. 
For example, receptor tyrosine kinases, certain cytokine receptors, receptor guanylyl cyclases 
and receptor serine/threonine protein kinases contain a single transmembrane domain. 

15 However, various oflier proteins including channels and adenylyl cyclases contain numerous 
transmembrane domains. Many important cell surface receptors such as G protein coupled 
receptors (GPCRs) are classified as "seven transmembrane domain" proteins, as they contain 
7 membrane spanning regions. Characteristics of transmembrane domains include 
approximately 17 consecutive hydrophobic amino acids that may be followed by charged 

20 amino acids. Therefore, upon analysis of the amino acid sequence of a particular protein, the 
localization and number of transmembrane domains within the protein may be predicted (see, 
e.g., PSORT web site http://psort.nibb.ac.jp/). Important transmembrane protein receptors 
include, but are not limited to the insulin receptor, insulin-like growth factor receptor, human 
growth hormone receptor, glucose transporters, transferrin receptor, epidermal growth factor 

25 receptor, low density lipoprotein receptor, epidermal growth factor receptor, leptin receptor, 
interleukin receptors, e.g., JLrl receptor, IL-2 receptor, etc. 

The extracellular domains of transmembrane proteins are diverse; however, conserved 
motifs are found repeatedly among various extracellular domains. Conserved structure 
and/or functions have been ascribed to different extracellular motifs. Many extracellular 

30. domains are involved in binding to other molecules. In one aspect, extracellular domains are 
found on receptors. Factors that bind the receptor domain include circulating ligands, which 
may be peptides, proteins, or small molecules such as adenosine and the like. For example, 
growth factors such as EGF, FGF, and PDGF are circulating growth factors that bind to their 
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cognate receptors to initiate a variety of cellular responses. Other factors include cjrtokines, 
mitogenic factors, neurotrophic factors and the like. Extracellular domains also bind to cell- 
associated molecules, or may be processed or shed to the blood stream. In this respect, they 
can mediate cell-cell interactions. Cell-rassociated ligands can be tethered to the cell, e.g., via 
5 a glycosylphosphatidylinositol (GPI) anchor, or may themselves be transmembrane proteins. 
Extracellular domains also associate with the extracellular matrix and contribute to the 
maintenance of the cell structure. 

Ovarian cancer proteins that are transmembrane are particularly preferred in the 
present invention as they are readily accessible targets for immunotherapeutics, as are 

1 0 described herein, hi addition^ as outlined below, transmembrane proteins can be also useful 
in imaging modalities. Antibodies may be used to label such readily accessible proteins in 
situ. Alternatively, antibodies can also label intracellular proteins, in which case samples are 
typically permeablized to provide access to intracellular protems. In addition, some 
membrane proteins can be processed to release a soluble protein, or to expose a residual 

1 5 fragment. Released soluble proteins may be useful diagnostic markers, processed residual 
protein fragments may be useful ovarian markers of disease. 

It will also be appreciated by those in the art that a transmembrane protein can be 
made soluble by removing transmembrane sequences, e.g., through recombinant methods. 
Furthermore, transmembrane proteins that have been made soluble can be made to be 

20 secreted through recombinant means by adding an appropriate signal sequence. 

In another embodiment, the ovarian cancer proteins are secreted proteins; the 
secretion of which can be either constitutive or regulated. These proteins may have a signal 
peptide or signal sequence that targets the molecule to the secretory pathway. Secreted 
proteins are involved in numerous physiological events; e.g., if circulating, they often serve 

25 to transmit signals to various other cell types. The secreted protein may function in an 

autocrine manner (acting on the cell that secreted the factor), a paracrine manner (acting on 
cells in close proximity to the cell that secreted the fector), an endocrine manner (acting on 
cells at a distance, e.g., secretion into the blood stream), or exocrine (secretion, e.g., through a 
duct or to an adjacent epithelial sur&ce as sweat glands, sebaceous glands, pancreatic ducts, 

30 lacrimalglands, mammary glands, wax producing glands of the ear, etc.). Thus, secreted 

molecules often fmd use in modulating or altering numerous aspects of physiology. Ovarian 

cancer proteins that are secreted proteins are particularly preferred as good diagnostic 

maikers, e.g., for blood, plasma, serum, or stool tests. Those which are enzymes may be 
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antibody or small molecule therapeutic targets. Others may be useful as vaccine targets, e.g., 
via CTL mechanisms, as protein or DNA vaccines. 

Use of ovarian cancer nucleic acids 

5 As described above, ovarian cancer sequence is initially identified by substantial 

nucleic acid and/or amino acid sequence homology or linkage to the ovarian cancer 
sequences outlined herein. Such homology can be based upon the overall nucleic acid or 
amino acid sequence, and is generally determined as outlined below, using either homology 
programs or hybridization conditions. Typically, linked sequences on a mRNA are found on 

10 the same molecule. 

The ovarian cancer nucleic acid sequences of the invention, e.g., in Table 1-26, can be 
fragments of larger genes, e.g., they are nucleic acid segments. "Genes" in this context 
includes coding regions, non-coding regions, and mixtures of coding and non-coding regions. 
Accordingly, as will be appreciated by those in the art, using the sequences provided herein, 

1 5 extended sequences, in either direction, of the ovarian cancer genes can be obtained, using 
techniques well known in the art for cloning either longer sequences or the full length 
sequences; see Ausubel, et al., supra. Much can be done by informatics and many sequences 
can be clustered to include multiple sequences corresponding to a single gene, e.g., systems 
such as UniGene (see, http://www.ncbi.nlm.nih.gov/UniGene/). 

20 Once the ovarian cancer nucleic acid is identified, it can be cloned and, if necessary, 

its constituent parts recombined to form the entire ovarian cancer nucleic acid coding regions 
or the entire mRNA sequence. Once isolated from its natm^al source, e.g., contained within a 
plasmid or other vector or excised as a linear nucleic acid segment, the recombinant ovarian 
cancer nucleic acid can be further-used as a probe to identify and isolate other ovarian cancer 

25 nucleic acids, e.g., extended coding regions. It can also be used as a "precursor" nucleic acid 
to make modified or variant ovarian cancer nucleic acids and proteins. 

The ovarian cancer nucleic acids of the present invention are useful in several ways. 
In a first embodiment, nucleic acid probes to the ovarian cancer nucleic acids are made and 
attached to biochips to be used in screening and diagnostic methods, as outlined below, or for 

30 administration, e.g., for gene therapy, vaccine, RNAi, and/or antisense applications. 

Alternatively, the ovarian cancer nucleic acids that include coding regions of ovarian cancer 

proteins can be put into expression vectors for the expression of ovarian cancer proteins, 

again for screening purposes or for administration to a patient. 
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In a preferred embodiment, nucleic acid probes to ovarian cancer nucleic acids (both 
the nucleic acid sequences outlined in the figures and/or the complements thereof) are made. 
The nucleic acid probes attached to the biochip are designed to be substantially 
complementary to the ovarian cancer nucleic acids, e.g., the target sequence (either the target 
5 sequence of the sample or to other probe sequences, e.g., in sandwich assays), such that 
hybridization of the target sequence and the probes of the present invention occurs. As 
outlined below, this complementarity need not be perfect; there may be any number of base 
pair mismatches which will interfere with hybridization between the target sequence and the 
single stranded nucleic acids of the present invention. However, if the number of mutations 

10 is so great that no hybridization can occur under even the least stringent of hybridization 
conditions, the sequence is not a complementary target sequence. Thus, by "substantially 
complementary" herein is iheant that the probes are sufficiently complementary to the target 
sequences to hybridize under normal reaction conditions, particularly high stringency 
conditions, as outlined herein. 

15 A nucleic acid probe is generally single stranded but can be partially single and 

partially double stranded. The strandedness of the probe is dictated by the structure, 
composition, and properties of the target sequence. In general, the nucleic acid probes range 
from about 8 to about 100 bases long, with from about 10 to about 80 bases being preferred, 
and from about 30 to about 50 bases being particularly preferred. That is, generally whole 

20 genes are not used. In some embodiments, much longer nucleic acids can be used, up to 
himdreds of bases. 

In a preferred embodiment, more than one probe per sequence is used, with either 
overlapping probes or probes to different sections of the target being used. That is, two, 
three, four or more probes, with three being preferred, are used to build in a redundancy for a 

25 particular target. The probes can be overlapping (e.g., have some sequence in common), or 
separate. In some cases, PGR primers may be used to amplify signal for higher sensitivity. 

As will be appreciated by those in the art, nucleic acids can be attached or 
immobilized to a solid support in a wide variety of ways. By "immobilized" and grammatical 
equivalents herein is meant the association or binding between the nucleic acid probe and the 

30 solid support is sufficient to be stable under the conditions of binding, washing, analysis, and 

removal as outlined below. The binding can tj'pically be covalent or non-covalent. By "non- 

covalent binding" and grammatical equivalents herein is meant one or more of electrostatic, 

hydrophilic, and hydrophobic interactions. Included in non-covalent binding is the covalent 
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attachment of a molecule, such as, streptavidin to the support and the non-covalent binding of 
the biotinylated probe to the streptavidin. By "covalent binding" and grammatical" 
equivalents herein is meant that the two moieties, the solid support and the probe, are 
attached by at least one bond, including sigma bonds, pi bonds and coordination bonds. 
5 Covalent bonds can be formed directly between the probe and the solid support or can be 
formed by a cross linker or by inclusion of a specific reactive group on either the solid 
support or the probe or both molecules. Immobilization may also involve a combinatioii of 
covalent and non-covalent interactions. 

In general, the probes are attached to the biochip in a wide variety of ways, as will be 
1 0 appreciated by those in the art. As described herein, the nucleic acids can either be 

synthesized first, with subsequent attachment to the biochip, or can be directly synthesized on 
the biochip. 

The biochip comprises a smtable solid substrate. By "substrate" or "solid support" or 
other grammatical equivalents herein is meant a material that can be modified to contain 

1 5 discrete individual sites appropriate for the attachment or association of the nucleic acid 
probes and is amenable to at least one detection method. As will be appreciated by those in 
the art, the number of possible substrates are very large, and include, but are not limited to, 
glass and modified or fimctionalized glass, plastics (including acrylics, polystyrene and 
copolymers of styrene and other materials, polypropylene, polyethylene, polybutylene, 

20 polyurethanes, TeflonJ, etc.), polysaccharides, nylon or nitrocellulose, resins, silica or silica- 
based materials including silicon and modified silicon, carbon, metals, inorganic glasses, 
plastics, etc. In general, the substrates allow optical detection and do not appreciably 
fluoresce. See, e.g., WO0055627 Reusable Low Fluorescent Plastic Biochip. 

Generally the substrate is planar, although as will be appreciated by those in the art, 

25 other configurations of substrates may be used as well. For example, the probes may be 

placed on the inside surface of a tube, for flow-through sample analysis to minimize sample 
volume. Similarly, the substrate maybe flexible, such as a flexible foam, including closed 
cell foams made of particular plastics. 

In a preferred embodiment, the surface of the biochip and the probe may be 

30 derivatized with chemical fimctional groups for subsequent attachment of the two. Thus, e.g., 
the biochip is derivatized with a chemical functional group including, but not limited to, 
amino groups, carboxyl groups, oxo groups and thiol groups, with amino groups being 
particularly preferred. . Using these fimctional groups, the probes can be attached using 
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functional groups on the probes. For example, nucleic acids containing amino groups can be 
attached to surfaces comprising amino groups, e.g., using linkers as are known in the art; e.g., 
homo-or hetero-bifimctional linkers as are well known (see 1994 Pierce Chemical Company 
catalog, technical section on cross-linkers, pages 155-200). In addition, in some cases, 
5 additional linkers, such as alkyl groups (including substituted and heteroalkyl groups) may be 
used. 

In this embodiment, oligonucleotides are synthesized as is known in the art, and then 
attached to the surface of the solid support. As will be appreciated by those skilled in the art, 
either the 5' or 3 ' terminus may be attached to the solid support, or attachment may be via an 

10 internal nucleoside. 

In another embodiment, the immobilization to the solid support may be very strong, 
yet non-covalent For example, biotinylated oligonucleotides can be made, which bind to 
surfaces covalently coated with streptavidin, resulting in attachment. 

Alternatively, the oligonucleotides may be synthesized on the surface, as is known in 

15 the art. For example, photoactivation techniques utilizing photopolymerization compounds 
and techniques are used. In a preferred embodiment, the nucleic acids can be synthesized in 
situ, using well known photolithographic techniques, such as those described in WO 
95/25 1 16; WO 95/35505; U.S. Patent Nos. 5,700,637 and 5,445,934; and references cited 
within, all of which are expressly incorporated by reference; these methods of attachment 

20 form the basis of the Affymetrix GeneChip™ technology. 

Often, amplification-based assays are performed to measure the expression level of 
ovarian cancer-associated sequences. ITiese assays are typically performed in conjunction 
with reverse transcription, hi such assays, an ovarian cancer-associated nucleic acid 
sequence acts as a template in an amplification reaction (e.g., Polymerase Chain Reaction, or 

25 PGR). In a quantitative amplification, the amount of amplification product will be 

proportional to the amount of template in the original sample. Comparison to appropriate 
controls provides a measure of the amount of ovarian cancer-associated RNA Methods of 
quantitative amplification are well known to those of skill in the art. Detailed protocols for 
quantitative PGR are available. See, e.g., Innis, et al.(1990) PGR Protocols: A Guide to 

3 0 Methods and Applications Academic Press. 

In some embodiments, a TaqMan based assay is used to measure expression. 

TaqMan based assays use a fluorogenic oligonucleotide probe that contains a 5' fluorescent 

dye and a 3' quenching agent. The probe hybridizes to a PGR product, but cannot itself be 
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extended due to a blocking agent at the 3' end. When the PGR product is amplified in 
subsequent cycles, the 5' nuclease activity of the polymerase, e.g., AmpliTaq, resvQts in the 
cleavage of the TaqMan probe. This cleavage separates the 5' fluorescent dye and the 3' 
quenching agent, thereby resulting in an increase in fluorescence as a function of 
5 ampUfication (see, e.g., literature provided by Perkin-Elmer, e.g., www2.perkin-elmer.com). 
Other suitable amplification methods include, but are not limited to, ligase chain 
reaction (LCR; see Wu and Wallace (1989) Genomics 4:560-569; Landegren, et al. (1988) 
Science 241:1077-1980: andBarringer, etal. (1990) Gene 89:117-122), transcription 
amplification (Kwoh, et al. (1989) Proc. Nat'l Acad. Sci. USA 86:1173-1177), self-sustained 
10 sequence replication (Guatellij et al. (1990) Proc. Nat'l Acad. Sci. USA 87:1874-1878), dot 
PGR, linker adapter PGR, etc. 

Expression of ovarian cancer proteins from nucleic acids 

In a preferred embodiment, ovarian cancer nucleic acids, e.g., encoding ovarian 

1 5 cancer proteins are used to make a variety of expression vectors to express ovarian cancer 
proteins which can then be used in screening assays, as described below. Expression vectors 
and recombinant DNA technology are well known and are used to express proteins. See, e.g., 
Ausubel, supra; and Fernandez and Hoeffler (eds. 1999) Gene Expression Systems Academic 
Press. The expression vectors maybe either self-replicating extrachromosomal vectors or 

20 vectors which integrate into a host genome. Generally, these expression vectors include 
transcriptional and translational regulatory nucleic acid operably linked to the nucleic acid 
encoding the ovarian cancer protein. The term "control sequences" refers to DNA sequences 
used for the expression of an operably linked coding sequence in a particular host organism. 
Control sequences that are suitable for prokaryotes, e.g., include a promoter, optionally an 

25 operator sequence, and a ribosome binding site. Eukaryotic cells are known to utilize 
promoters, polyadenylation signals, and enhancers. 

Nucleic acid is "operably linked" when it is placed into a functional relationship with 
another nucleic acid sequence. For example, DNA for a pre-sequence or secretory leader is 
operably linked to DNA for a polypeptide if it is expressed as a pre-protein that participates 

30 in the secretion of the polypeptide; a promoter or enhancer is operably linked to a coding 
sequence if it affects the transcription of the sequence; a ribosome binding site is operably 
linked to a coding sequence if it is positioned so as to facilitate translation; and two sequences 
may be operably linked when they are physically part of the same polymer. Generally, 
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"operably linked" means that the DNA sequences being linked are contiguous, and, in the 
case of a secretory leader, contiguous and in reading phase. However, enhancers do not have 
to be contiguous. Linking is typically accomplished by ligation at convenient restriction 
sites. If such sites do not exist, synthetic oligonucleotide adaptors or linkers are used in 

5 accordance with conventional practice. Transcriptional and translational regulatory nucleic 
acid will generally be appropriate to the host cell used to express the ovarian cancer protein. 
Numerous tj^es of appropriate expression vectors, and suitable regulatory sequences are 
known in the art for a variety of host cells. 

In general, transcriptional and translational regulatory sequences may include, but are 

10 not limited to, promoter sequences, ribosomal binding sites, transcriptional start and stop 
sequences, translational start and stop sequences, and enhancer or activator sequences. In a 
preferred embodiment, the regulatory sequences include a promoter and transcriptional start 
and stop sequences. 

Promoter sequences typically encode constitutive or inducible promoters. The 

1 5 promoters may be naturally occurring promoters or hybrid promoters. Hybrid promoters, 
which combine elements of more than one promoter, are also known in the art, and are useful 
in the present invention. 

in addition, an expression vector may comprise additional elements. For example, the 
expression vector may have two replication systems, thus allowing it to be maintained in two 

20 organisms, e.g., in mammalian or insect cells for expression and in a procaryotic host for 
cloning and amplification. Furthermore, for integrating expression vectors, the expression 
vector contains at least one sequence homologous to the host cell genome, and preferably two 
homologous sequences which flank the expression construct. The integrating vector may be 
directed to a specific locus in the host cell by selecting the appropriate homologous sequence 

25 for inclusion in the vector. Constructs for integrating vectors are available. See, e.g., 
Fernandez and Hoeffler, supra. 

In addition, in a preferred embodiment, the expression vector contains a selectable 
marker gene to allow the selection of transformed host cells. Selection genes are well known 
in the art and will vary with the host cell used. 

30 The ovarian cancer proteins of the present invention are produced by culturing a host 

cell transformed with an expression vector containing nucleic acid encoding an ovarian 

cancer protein, under the appropriate conditions to induce or cause expression of the ovarian 

cancer protein. Conditions appropriate for ovarian cancer protein expression will vary with 
42 



wo 02/102235 



PCT/US02/19297 



the choice of the expression vector and the host cell, and will be easily ascertained by one 
skilled in the art tiirough routine experimentation or optimization. For example, the use of 
constitutive promoters in the expression vector will require optimizing the growth and 
proliferation of the host cell, while the use of an inducible promoter requires the appropriate 
5 growth conditions for induction. In addition, in some embodiments, the timing of the harvest 
is important. For example, the baculovirus systems used in insect cell expression are lytic 
viruses, and thus harvest time selection can be crucial for product yield. 

Appropriate host cells include yeast, bacteria, archaebacteria, fungi, and insect and 
animal cells, including mammalian cells. Of particular interest are Saccharomyces cerevisiae 

1 0 and other yeasts, E. coU, Bacillus subtilis, Sf9 cells, C129 cells, 293 cells, Neurospora, BHK, 
CHO, COS, HeLa cells, HUVEC (human umbilical vein endothelial cells), THPl cells (a 
macrophage cell line) and various other human cells and cell lines. 

In a preferred embodiment, the ovarian cancer proteins are expressed in mammalian 
cells. Mammalian expression systems are also known In the art, and include retroviral and 

1 5 adenoviral systems. One expression vector system is a retroviral vector system such as is 
generally described in PCT/US97/01019 and PCT/US97/01048, both of which are hereby 
expressly incorporated by reference. Of particular use as mammalian promoters are the 
promoters from mammalian viral genes, since the viral genes are often highly expressed and 
have a broad host range. Examples include the S V40 early promoter, mouse mammary tumor 

20 virus LTR promoter, adenovirus major late promoter, herpes simplex virus promoter, and the 
CMV promoter. See, e.g., Fernandez and Hoeffler, supra. Typically, transcription 
termination and polyadenylation sequences recognized by mammalian cells are regulatory 
regions located 3' to the translation stop codon and thus, together with the promoter elements, 
flaiik the coding sequence. Examples of transcription terminator and polyadenylation signals 

25 include those derived form SV40. 

The methods of introducing exogenous nucleic acid into mammalian hosts, as well as 
other hosts, is well known in the art, and will vary with the host cell used. Techniques 
include dextran-mediated transfection, calcium phosphate precipitation, polybrene mediated 
transfection, protoplast fiision, electroporation, viral infection, encapsulation of the 

30. polynucleotide(s) m liposomes, and direct microinjection of the DNA into nuclei. 

In a preferred embodiment, ovarian cancer proteins are expressed in bacterial systems. 

Bacterial expression systems are well known in the art. Promoters from bacteriophage may 

also be used and are known in the art. In addition, synthetic promoters and hybrid promoters 
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are also useful; e.g., the tac promoter is a hybrid of ttie trp and lac promoter sequences. 
Furthermore, a bacterial promoter can include natiurally occurring promoters of noh-bacterial 
origin that have the ability to bind bacterial RNA polymerase and initiate transcription. In 
addition to a functioning promoter sequence, an efficient ribosome binding site is desirable. 
5 The expression vector may also include a signal peptide sequence that provides for secretion 
of the ovarian cancer protein in bacteria. The protein is either secreted into the growth media 
(gram-positive bacteria) or into the periplasmic space, located between the iiuier and outer 
membrane of the cell (gram-negative bacteria). The bacterial expression vector may also 
include a selectable marker gene to allow for the selection of bacterial strains that have been 

1 0 transformed. Suitable selection genes include genes which render the bacteria resistant to 
drugs such as ampicillin, chloramphenicol, erythromycin, kanamycin, neomycin, and 
tetracycline. Selectable markers also include biosynthetic genes, such as those in the 
histidine, tryptophan, and leucine biosynthetic pathways. These components are assembled 
into expression vectors. Expression vectors for bacteria are well known in the art, and 

15 include vectors for Bacillus subtilis, E. coli, Streptococcus cremoris, and Streptococcus 

lividans, among others. See Fernandez and Hoeffler, supra The bacterial expression vectors 
are transformed into bacterial host cells using techniques well known in the art, such as 
calcium chloride treatment, electroporation, and others. 

In one embodiment, ovarian cancer proteins are produced in insect cells. Expression 

20 vectors for the transformation of insect cells, and in particular, baculovtrus-based expression 
vectors, are well known in the art. 

In a preferred embodiment, an ovarian cancer protein is produced in yeast cells. 
Yeast expression systems are well known in the art, and include expression vectors for 
Saccharomyces cerevisiae, Candida albicans and C. maltosa, Hansenula polymorpha, 

25 Kluyveromyces fragilis and K. lactis, Pichia guillerimondii and P. pastoris, 
Schizosaccharomyces pombe, and Yarrowia lipolytica. 

The ovarian cancer protein may also be made as a fusion protein, using techniques 
well known in the art. Thus, e.g., for the creation of monoclonal antibodies, if the desired 
epitope is small, the ovarian cancer protein may be fused to a carrier protein to form an 

30 inimunogen. Alternatively, the ovarian cancer protein may be made as a fusion protein to 

increase expression, or for other reasons. For example, when the ovarian cancer protein is an 
ovarian cancer peptide, the nucleic acid encoding the peptide may be linked to other nucleic 
acid for expression purposes. 
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In a preferred embodiment, the ovarian cancer protein is purified or isolated after 
expression. Ovarian cancer proteins may be isolated or purified in a variety of ways known 
to those skilled in the art depending on what other components are present in the sample. 
Standard purification methods include electrophoretic, molecular, immunological and 
5 chromatographic techniques, including ion exchange, hydrophobic, affinity, and reverse- 
phase HPLC chromatography, and chromatofocusing. For example, the ovarian cancer 
protein may be purified using a standard anti-ovarian cancer protein antibody colxmm. 
Ultrafiltration and diafiltration techniques, in conjunction with protein concentration, are also 
useful. For general guidance in suitable purification techniques, see Scopes (1982) Protein 
1 0 Purification Springer- Verlag.^ The degree of purification necessary will vary depending on 
ttie use of the ovarian cancer protein. In some instances no purification will be necessary. 

Once expressed and purified if necessary, the ovarian cancer proteins and nucleic 
acids are usefiil in a number of applications. They may be used as immunoselection reagents, 
as vaccine reagents, as screening agents, etc. 

15 

Variants of ovarian cancer proteins 

In one embodunent, the ovarian cancer proteins are derivative or variant ovarian 
cancer proteins as compared to the wild-type sequence. That is, as outlined more fully below, 
the derivative ovarian cancer peptide will often contain at least one amino acid substitution, 
20 deletion or insertion, with amino acid substitutions being particularly preferred. The amino 
acid substitution, insertion, or deletion may occur at most any residue within the ovarian 
cancer peptide. 

Also included within one embodiment of ovarian cancer proteins of the present 
invention are amino acid sequence variants. These variants typically fall into one or more of 

25 three classes: substitutional, insertional or deletional variants. These variaiits ordinarily are 
prepared by site specific mutagenesis of nucleotides in the DNA encoding the ovarian cancer 
protein, using cassette or PGR mutagenesis or other techniques.well known in the art, to 
produce DNA encoding the variant, and thereafter expressing the DNA in recombinant cell 
culture as outlined above. However, variant ovarian cancer protein firagments having up to 

30 about 100-150 residues may be prepared by in vitro synthesis using established techniques. 

Amino acid sequence variants are characterized by the predetermined nature of the variation, 

a feature that sets them apart fi-om naturally occurring allelic or interspecies variation of the 

ovarian cancer protein, amino acid sequence. The variants typically exhibit the same 
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qualitative biological activity as the naturally occurring analogue, although variants can also 
be selected which have modified characteristics as will be more fiilly outlined below. 

While the site or region for introducing an amino acid sequence variation is 
predetermined, the mutation per se need not be predetermined. For example, in order to 
5 optimize the performance of a mutation at a given site, random mutagenesis may be 

conducted at the target codon or region and the expressed ovarian cancer variants screened 
for the optimal combination of desired activity. Techniques for making substitution 
mutations at predetermined sites in DNA having a known sequence are well known, e.g.. 
Ml 3 primer mutagenesis and PGR mutagenesis. Screening of the mutants is done using 

10 assays of ovarian cancer protein activities. 

Amino acid substitutions are typically of single residues; insertions usually will be on 
the order of firom about 1 to 20 amino acids, althougji considerably larger insertions maybe 
tolerated. Deletions range fi-om about 1 to about 20 residues, although in some cases 
deletions may be much larger. 

15 Substitutions, deletions, insertions or any combination thereof may be used to arrive 

at a final derivative. Generally these changes are done on a few amino acids to minimize the 
alteration of the molecule. However, lar ger changes may be tolerated in certain 
circumstances. When small alterations in the characteristics of the ovarian cancer protein are 
desired, substitutions are generally made in accordance with the amino acid substitution 

20 relationships provided in the definition section. 

The variants typically exhibit the same quahtative biological activity and will elicit 
the same inmiune response as the naturally-occurring analog, althougji variants also are 
selected to modify the characteristics of the ovarian cancer proteins as needed. Alternatively, 
tiie variant may be designed such that the biological activity of the ovarian cancer protein is 

25 altered. For example, glycosylation sites may be altered or removed. 

Substantial changes in function or immimological identity are made by selecting 
substitutions that are less conservative than those described above. For example, 
substitutions maybe made which more significantly affect: the structure of the polypeptide 
backbone in the area of the alteration, for example the alpha-helical or beta-sheet structure; 

30 the charge or hydrophobicity of the molecule at the target site; or the bulk of the side chain. 

The substitutions which in general are expected to produce the greatest changes in the 

polypeptide's properties are those in which (a) a hydrophilic residue, e.g., serine or threonine 

is substituted for (or by) a hydrophobic residue, e.g., leucine, isoleucme, phenylalanine, 
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valine, or alanine; (b) a cysteine or proline is substituted for (or by) any other residue; (c) a 
residue having an electropositive side chain, e.g., lysine, arginine, or histidine, is substituted 
for (or by) an electronegative residue, e.g., glutamic or aspartic acid; (d) a residue having a 
bulky side chain, e.g., phenylalanine, is substituted for (or by) one not having a side chain, 
5 e.g., glycine; or (e) a proline residue is incorporated or substituted, which changes the degree 
of rotational freedom of the peptidyl bond. 

Covalent modifications of ovarian cancer polypeptides are included within the scope 
of this invention. One type of covalent modification includes reacting targeted amino acid 
residues of an ovarian cancer polypeptide with an organic derivatizing agent that is capable of 

1 0 reacting wifli selected side chains or the N-or C-terminal residues of an ovarian cancer 

polypeptide. Derivatization with bifiuictional agents is useful, for instance, for crosslinking 
ovarian cancer polypeptides to a water-insoluble support matrix or surface for use in the 
method for purifying anti-ovarian cancer polypeptide antibodies or screening assays, as is 
more fiilly described below. Commonly used crosslinking agents include, e.g., 1,1- 

15 bis(diazoacetyl)-2-phenylethane, gjutaraldehyde, N-hydroxysuccmimide esters, e.g., esters 
with 4-azidosalicylic acid, homobifimctional imidoesters, including disuccinimidyl esters 
such as 3,3'-dithiobis(succinimidylpropionate), bifimctional maleimides such as bis-N- 
maleimido-l,8-octane and agents such as methyl-3-((p-azidophenyl)dithio)propioimidate. 

Other modifications include deamidation of glutamine and asparagine residues to the 

20 corresponding glutamic and aspartic acid residues, respectively, hydroxylation of proline and 
lysine, phosphorylation of hydroxyl groups of serine, tln-eonine, or tyrosine residues, 
methylation of the amino groups of the lysine, arginine, and histidine side chains (e.g., pp. 
79-86, Creighton (1983) Proteins: Structure and Molecular Properties Freeman), acetylation 
of the N-terminal amine, and amidation of a C-terminal carboxyl group. 

25 Another type of covalent modification of the ovarian cancer polypeptide included 

within the scope of this invention comprises altering the native glycosylation pattern of the 
polypeptide. "Altering the native glycosylation pattern" is intended for purposes herein to 
mean deleting one or more carbohydrate moieties found in native sequence ovarian cancer 
polypeptide, and/or adding one or more glycosylation sites that are not present in the native 

30 sequence ovarian cancer polypeptide. Glycosylation patterns can be altered in many ways. 

For example the use of different cell types to express ovarian cancer-associated sequences 

can result in different glycosylation patterns. 

Addition of glycosylation sites to ovarian cancer polypeptides may also be 
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accomplished by altering the amino acid sequence thereof. The alteration may be made, e.g., 
by the addition of, or substitution by, one or more serine or threonine residues to the native 
sequence ovarian cancer polypeptide (for 0-linked glycosylation sites). The ovarian cancer 
amino acid sequence may optionally be altered through changes at the DNA level, 
5 particularly by mutating the DNA encoding the ovarian cancer polypeptide at pre-selected 
bases such that codons are generated that will translate into the desired amino acids. 

Another means of increasing the number of carbohydrate moieties on the ovarian 
cancer polypeptide is by chemical or enzymatic coupling of glycosides to the polypeptide. 
See, e.g., WO 87/05330, and pp. 259-306 m Aplin and Wriston (1981) CRC Crit. Rev. 

10 Biochem. CRC Press. 

Removal of carbohydrate moieties present on the ovarian cancer polypeptide may be 
accomplished chemically or enzymatically or by mutational substitution of codons encoding 
for amino acid residues that serve as targets for glycosylation. Chemical deglycosylation 
techniques are applicable. See, e.g., Sojar and Bahl (1987) Arch. Biochem. Biophvs. 259:52- 

15 57; and Edge, et al. (1981) Anal. Biochem. 1 18:131-137. Enzymatic cleavage of 

carbohydrate moieties on polypeptides can be achieved by the use of a variety of endo-and 
exo-glycosidases. See, e.g., Thotakura, et al. (1987) Meth. Enzvmol.. 138:350-359. 

Another type of covalent modification of ovarian cancer comprises linking the 
ovarian cancer polypeptide to one of a variety of non-proteinaceous polymers, e.g., 

20 polyethylene glycol, polypropylene glycol, or polyoxyalkylene. See, e.g., U.S. Patent Nos. 
4,640,835; 4,496,689; 4,301,144; 4,670,417; 4,791,192; or 4,179,337. 

Ovarian cancer polypeptides of the present invention may also be modified in a way 
to form chimeric molecules, e.g., comprising an ovarian cancer polypeptide fused to another 
heterologous polypeptide or amino acid sequence. In one embodiment, such a chimeric 

25 molecule comprises a fusion of an ovarian cancer polypeptide with a tag polypeptide which 
provides an epitope to which an anti-tag antibody can selectively bind. The epitope tag is 
generally placed at the amino-or carboxyl-terminus of the ovarian cancer polypeptide. The 
presence of such epitope-tagged forms of an ovarian cancer polypeptide can be detected 
using an antibody against the tag polypeptide. Also, provision of the epitope tag enables the 

30 ovarian cancer polypeptide to be readily purified by affinity purification using an anti-tag 

antibody or another type of affinity matrix that binds to the epitope tag. In an alternative 

embodiment, the chimeric molecule may comprise a fusion of an ovarian cancer polypeptide 

with an immunoglobuUn or a particular region of an inmiunoglobulin. For a bivalent form of 
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the chimeric molecule, such a fusion could be to the Fc region of an IgG molecule. 

Various tag polypeptides and their respective antibodies are well known in the art. 
Examples include poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) tags; His6 
and metal chelation tags, the flu HA tag polypeptide and its antibody 12CA5 (Field, et al. 
5 (1988) Mol. Cell. Biol. 8:21.59-2165); the c-myc tag and the 8F9, 3C7, 6E10, G4, B7, and 
9E10 antibodies thereto (Evan, et al. (1985) Mol. Cell. Biol. 5:3610-3616); and the Herpes 
Simplex virus glycoprotein D (gD) tag and its antibody (Paborsky, et al. (1990) Protein 
Engineering 3:547-553). Other tag polypeptides include, e.g., the Flag-peptide (Hopp, et al. 
(1988) BioTechnologv 6: 1204-1210); the KT3 epitope peptide (Martin, et al. (1992) Science 

10 255:192-194); tubulin epitopepeptide (Skinner, et al. fl99n J. Biol. Chem. 266:15163- 
15 166); and the T7 gene 10 protein peptide tag (Lutz-Freyennuth et al. (1990) Pidc. Nat'l 
Acad. Sci. USA 87:6393-6397). 

Also included are other ovarian cancer proteins of the ovarian cancer family, and 
ovarian cancer proteins from other organisms, which are cloned and expressed as outlined 

15 below. Thus, probe or degenerate polymerase chain reaction (PCR) primer sequences may be 
used to find other related ovarian cancer proteins from humans or other organisms. As will 
be appreciated by those in the art, particularly useful probe and/or PCR primer sequences 
include the unique areas of the ovarian cancer nucleic acid sequence. As is generally known 
in the art, preferred PCR primers are from about 1 5 to about 35 nucleotides in length, with 

20 from about 20 to about 30 being preferred, and may contain inosine as needed. The 

conditions for the PCR reaction are well known in the art (e.g., Iimis, PCR Protocols , supra). 

Antibodies to ovarian cancer proteins 

In a preferred embodiment, when the ovarian cancer protein is to be used to generate 

25 antibodies, e.g., for iirrniunotherapy or immunodiagnosis, the ovarian cancer protein should 

share at least one epitope or determinant with the full length protein. By "epitope" or 

"determinant" herein is typically meant a portion of a protein which will generate and/or bind 

an antibody or T-cell receptor in the context of MHC. Thus, in most instances, antibodies 

made to a smaller ovarian cancer protein will be able to bind to the ftill-length protein, 

30 particularly linear epitopes, hi a preferred embodiment, the epitope is unique; that is, 

antibodies generated to a unique epitope show little or no cross-reactivity. 

Methods of preparing polyclonal antibodies are known to the skilled artisan (e.g., 

Coligan, supra; and Harlow and Lane, supra). Polyclonal antibodies can be raised in a 
49 



wo 02/102235 



PCT/US02/19297 



mammal, e.g., by one or more injections of an immtmizing agent and, if desired, an adjuvant. 
Typically, the immunizing agent and/or adjuvant will be injected in the mammal by multiple 
subcutaneous or intraperitoneal injections. The immunizing agent may include a protein 
encoded by a nucleic acid of the figures or fragment thereof or a fusion protein thereof It 
5 may be useful to conjugate the immunizing agent to a protein known to be immunogenic in 
the mammal being immunized. Examples of such immunogenic proteins include but are not 
limited to keyhole limpet hemocyanin, serum albumin, bovine thyroglobulin, and soybean 
trypsin inhibitor. Examples of adjuvants which may be employed include Freund's complete 
adjuvant and MPL-TDM adjuvant (monophosphoryl Lipid A, synthetic trehalose 

10 dicorynomycolate). The Ltmnunization protocol may be selected by one skilled in the art 
without undue experimentation. 

The antibodies may, alternatively, be monoclonal antibodies. Monoclonal antibodies 
may be prq)ared using hybridoma methods, such as those described by Kohler and Milstein 
(1975) Nature 256:495-497. In a hybridoma method, a mouse, hamster, or other appropriate 

15 host animal, is typically immunized with an immunizing agent to elicit lymphocytes that 
produce or are capable of producing antibodies that will specifically bind to the immunizing 
agent. Alternatively, the lymphocytes may be immunized in vitro. The immunizing agent 
will typically include a polypeptide encoded by a nucleic acid of Tables 1-26 or fragment 
thereof, or a fusion protein thereof Generally, either peripheral blood lymphocytes ("PBLs") 

20 are used if cells of human origin are desired, or spleen cells or lymph node cells are used if 
non-human mammalian sources are desired. The lymphocytes are then fiised with an 
immortalized cell line using a suitable flising agent, such as polyethylene glycol, to form a 
hybridoma cell (e.g., pp. 59-103 in Goding (1986) Monoclonal Antibodies: Principles and 
Practice Academic Press). Immortalized cell lines are usually transformed mammalian cells, 

25 particularly myeloma cells of rodent, bovine and human origin. Usually, rat or mouse 

myeloma cell lines are employed. The hybridoma cells may be cultured in a suitable culture 
medium that preferably contains one or more substances that inhibit the growth or survival of 
the unfused, immortalized cells. For example, if the parental cells lack the enzyme 
hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), the culture medium 

30 for the hybridomas typically will include hypoxanthine, aminopterin, and thymidine ("HAT 

medium"), which substances prevent the growth of HGPRT-deficient cells. 

In one embodiment, the antibodies are bispecific antibodies. Bispecific antibodies are 

monoclonal, preferably human or humanized, antibodies (hat have binding specificities for at 
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least two different antigens or that have binding specificities for two epitopes on the same 
antigen. In one embodiment, one of the binding specificities is for a protein encoded by a 
nucleic acid Table 1-26 or a firagment thereof, the other one is for any other antigen, and 
preferably for a cell-surface protein or receptor or receptor subunit, preferably one that is 
5 tumor specific. Alternatively, tetramer-type technology may create multivalent reagents. 

In a preferred embodiment, the antibodies to ovarian cancer protein are capable of 
reducing or eliminating a biological function of an ovarian cancer protein, as is described 
below. That is, the addition of anti-ovarian cancer protein antibodies (either polyclonal or 
preferably monoclonal) to ovarian cancer tissue (or cells containing ovarian cancer) may 

1 0 reduce or eliminate the ovari^ cancer. Generally, at least a 25% decrease in activity, 

growth, size or the like is preferred, with at least about 50% being particularly preferred and 
about a 95- 1 00% decrease being especially preferred. 

In a preferred embodiment the antibodies to the ovarian cancer proteins are 
humanized antibodies (e.g., Xenerex Biosciences; Medarex, Inc.; Abgenix, Inc.; Protein 

1 5 Design Labs, Inc.) Humanized forms of non-human (e.g., murine) antibodies are chimeric 
molecules of inrniunoglobulins, inmiunoglobulin chains or fragments thereof (such as Fv, 
Fab, Fab', F(ab')2 or other antigen-binding subsequences of antibodies) which contain 
minimal sequence derived fi:om non-human immunoglobulin. Humanized antibodies include 
human iramunoglobulins (recipient antibody) in which residues fi-om a complementary 

20 determining region (CDR) of the recipient are replaced by residues from a CDR of a non- 
human species (donor antibody) such as mouse, rat or rabbit having the desired specificity, 
affinity and capacity. In some instances, Fv fiiamework residues of the human 
iimnunoglobulin are replaced by corresponding non-human residues. Hiraianized antibodies 
may also comprise residues which are found neither in the recipient antibody nor in the 

25 imported CDR or firamework sequences. In general, a humanized antibody will comprise 
substantially all of at least one, and typically two, variable domains, in which all or 
substantially all of the CDR regions correspond to those of a non-human immunoglobulin 
and all or substantially all of the firamework (FR) regions are those of a human 
immunoglobulin consensus sequence. The humanized antibody optimally also will comprise 

30 at least a portion of an immunoglobulin constant region (Fc), typically that of a human 
immunoglobulin. Humanization can be essentially performed following the method of 
Winter and co-workers, e.g., by substituting rodent CDRs or CDR sequences for the 



51 



wo 02/102235 



PCT/US02/19297 



corresponding sequences of a human antibody. See, e.g., Jones, et al. (1986) Nature 321:522- 
525; Riechmann, et al. (1988) Nature 332:323-329; Presta (1992) Curr. Op. Struct'Biol. 
2:593-596; and Verhoeyen, et al. (1988) Science 239:1534-1536). Accordingly, such 
humanized antibodies are chimeric antibodies (U.S. Patent No. 4,816,567), wherein 
5 substantially less than an intact human variable domain has been substituted by the 
corresponding sequence from a non-human species. 

Human antibodies can also be produced using various techniques known in the art, 
including phage display libraries (see, e.g., Hoogenboom and Winter (1991) J. Mol. Biol. 
227:381-388; and Marks, et al. (1991) J. Mol. Biol. 222:581-597) or human monoclonal 

10 antibodies (see, e.g., p. 77, Cole, et al. m Reisfeld and Sell (1985) Monoclonal Antibodies 
and Cancer Therapy Liss; and Boemer, et al. (1991) J. Immunol. 147:86-95). Similarly, 
human antibodies can be made by introducing of human immimoglobulin loci into transgenic 
animals, e.g., mice in which the endogenous immunoglobuUn genes have been partially or 
conq)letely inactivated. Upon challenge, human antibody production is observed, which 

15 closely resembles fliat seen in himians in all respects, including gene rearrangement, 
assembly, and antibody repertoire. See, e.g., U.S. Patent Nos. 5,545,807; 5,545,806; 
5,569,825; 5,625,126; 5,633,425; 5,661,016; Marks, etal. (1992) Bio/Technology 10:779- 
783; Lonberg, et al. (1994) Nature 368:856-859; Morrison (1994) Nature 368:812-13; 
Neuberger (1996) Nature Biotechnology 14:826 commenting on Fishwild, et al. (1996) 

20 Nature Biotechnology 14:845-5 1 ; and Lonberg and Huszar (1995) Intern. Rev, tomunol. 
13:65-93. 

By immunotherapy is meant treatment of ovarian cancer, e.g., with an antibody raised 
against ovarian cancer proteins. As used herein, immunotherapy can be passive or active. 
Passive immunotherapy as defined herein is the passive transfer of antibody to a recipient 

25 (patient). Active immunization is the induction ofantibody and/or T-cell responses in a 

recipient (patient). Induction of an immune response is the result of providing the recipient 
with an antigen to which antibodies are raised. The antigen may be provided by injecting a 
pol5'peptide against which antibodies are desired to be raised into a recipient, or contacting 
the recipient with a nucleic acid capable of expressing the antigen and imder conditions for 

3 0 expression of the antigen, leading to an iirunune response. 

In a preferred embodiment the ovarian cancer proteins against which antibodies are 
raised are secreted protems as described above. Without being boimd by theory, antibodies 
used for treatment, bind and prevent the secreted protein from binding to its receptor, thereby 
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inactivating the secreted ovarian cancer protein. 

In another preferred embodiment, the ovarian cancer protein to which antibodies are 
raised is a transmembrane protein. Without being bound by theory, antibodies used for 
treatment, bind the extracellular domain of the ovarian cancer protein and prevent it from 
5 binding to other proteins, such as circulating ligands or cell-associated molecules. The 

antibody may cause down-regulation of the transmembrane ovarian cancer protein. As will 
be appreciated by one of ordinary skill in the art, the antibody may be a competitive, non- 
competitive or uncompetitive inhibitor of protein binding to the extracellular domain of the 
ovarian cancer protein. The antibody is also an antagonist of the ovarian cancer protein. 

1 0 Further, the antibody prevents activation of the transmembrane ovarian cancer protein. In 
one aspect, when the antibody prevents the binding of other molecules to the ovarian cancer 
protein, the antibody prevents growth of the cell. The antibody may also be used to target or 
sensitize ttie cell to cytotoxic agents, including, but not limited to TNF-a, TNF-P, EL-l, INF- 
y, and JL-2, or chemotherapeutic agents including 5FU, vinblastine, actinomycin D, cisplatin, 

15 methotrexate, and the like. In some instances the antibody belongs to a sub-type that 
activates serum complement when complexed with the transmembrane protein thereby 
mediating cytotoxicity or antigen-dependent cytotoxicity (ADCC). Thus, ovarian cancer is 
treated by administering to a patient antibodies directed against the transmembrane ovarian 
cancer protein. Antibody-labeling may activate a cp-toxin, localize a toxin payload, or 

20 otherwise provide means to locally ablate cells. 

In another preferred embodiment, the antibody is conjugated to an effector moiety. 
The effector moiety can be any number of molecules, including labeling moieties such as 
radioactive labels or fluorescent labels, or can be a therapeutic moiety. In one aspect the 
therapeutic moiety is a small molecule that modulates the activity of the ovarian cancer 

25 protein. In another aspect the therapeutic moiety modulates the activity of molecules 

associated with or in close proximity to the ovarian cancer protein. The therapeutic moiety 
may inhibit enzymatic activity such as protease or coUagenase or protein kinase activity 
associated with ovarian cancer. 

In a preferred embodiment, the therapeutic moiety can also be a c3^otoxic agent. In 

30 this method, targeting the cytotoxic agent to ovarian cancer tissue or cells, results in a 

reduction in the number of afflicted cells, thereby reducing symptoms associated with ovarian 
cancer. Cytotoxic agents are numerous and varied and include, but are not limited to. 
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cytotoxic drugs or toxins or active fragments of such toxins. Suitable toxins and their 
corresponding fragments include diphtheria A chain, exotoxin A chain, ricin A chain, abrin A 
chain, curcin, crotin, phenomycin, enomycin and the like. Cytotoxic agents also include 
radiochemicals made by conjugating radioisotopes to antibodies raised against ovarian cancer 
5 proteins, or binding of a radionuclide to a chelating agent that has been covalently attached to 
the antibody. Targeting the therapeutic moiety to transmembrane ovarian cancer proteins not 
only serves to increase the local concentration of therapeutic moiety in the ovarian cancer 
afflicted area, but also serves to reduce deleterious side effects that may be associated with 
the untargeted therapeutic moiety. 

10 In another preferred embodiment, the ovarian cancer protein against which the 

antibodies are raised is an intracellular protein. In this case, the antibody maybe conjugated 
to a protein which facilitates entry into the cell. In one case, the antibody enters the cell by 
endocytosis. In another embodiment, a nucleic acid encoding the antibody is administered to 
the individual or cell. Moreover, wherein the ovarian cancer protein can be targeted within a 

15 cell, e.g., the nucleus, an antibody thereto contains a signal for that target localization, e.g., a 
nuclear localization signal. 

The ovarian cancer antibodies of flie invention specifically bind to ovarian cancer 
proteins. By "specifically bind" herein is meant that the antibodies bind to the protein with a 
of at least about O.l mM, more usually at least about 1 ^M, preferably at least about 0.1 

20 \iM or better, and most preferably, 0.01 pM or better. Selectivity of binding is also 
important. 

Detection of ovarian cancer sequence for diagnostic and therapeutic applications 

In one aspect, the RNA expression levels of genes are determined for different 
25 cellular states in the ovarian cancer phenotype. Expression levels of genes in normal tissue 
(e.g., not undergoing ovarian cancer) and in ovarian cancer tissue (and in some cases, for 
varying severities of ovarian cancer that relate to prognosis, as outlined below, or in non- 
malignant disease are evaluated to provide expression profiles. An expression profile of a 
particular cell state or point of development is essentially a "fingerprint" of the state of the 
30 cell. While two states may have any particular gene similarly expressed, the evaluation of a 
number of genes simultaneously allows the generation of a gene expression profile that is 
reflective of the state of the cell. By comparing expression profiles of cells in different states, 
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information regarding which genes are important (including both up- and down-regulation of 
genes) in each of these states is obtained. Then, diagnosis may be performed or confirmed to 
determine whether a tissue sample has the gene expression profile of normal or cancerous 
tissue. This will provide for molecular diagnosis of related conditions. 
5 "Differential expression," or grammatical equivalents as used herein, refers to 

qualitative or quantitative differences in the temporal and/or cellular gene expression 
patterns within and among cells and tissue. Thus, a differentially expressed gene can 
qualitatively have its expression altered, including an activation or inactivation, in, e.g., 
normal versus ovarian cancer tissue. Genes may be turned on or turned off in a particular 

ip state, relative to another state thus permitting comparison of two or more states. A 

quaUtatively regulated gene will exhibit an expression pattern within a state or cell type 
which is detectable by standard techniques. Some genes will be expressed in one state or cell 
type, but not in both. Alternatively, the difference in expression may be quantitative, e.g., in 
that expression is modulated, either up-regulated, resulting in an increased amount of 

15 transcript, or down-regulated, resulting in a decreased amount of transcript. The degree to 
which expression differs need only be large enough to quantify via standard characterization 
techniques as outlined below, such as by use of Affymetrix GeneChip™ expression arrays. 
See, e.g., Lockhart (1996) Nature Biotechnology 14:1675-1680. Other techniques include, 
but are not limited to, quantitative reverse tianscriptase PGR, northern analysis, and RNase 

20 protection. As outlined above, preferably the change in expression (e.g., up-regulation or 

down-regulation) is at least about 50%, more preferably at least about 100%, more preferably 
at least about 150%, more preferably at least about 200%, witii firom 300 to at least 1000% 
being especially preferred. 

Evaluation may be at the gene transcript, or the protein level. The amount of gene 

25 expression may be monitored using nucleic acid probes to the DNA or RNA equivalent of the 
gene transcript, and the quantification of gene expression levels, or, alternatively, the final 
gene product itself (protein) can be monitored, e.g., with antibodies to the ovarian cancer 
protein and standard immunoassays (ELISAs, etc.) or other techniques, including mass 
spectroscopy assays, 2D gel electrophoresis assays, etc. Proteins corresponding to ovarian 

30 cancer genes, e.g., those identified as bemg important in an ovarian cancer or disease 

phenotype, can be evaluated in an ovarian disease diagnostic test. In a preferred 

embodiment, gene expression monitoring is performed simultaneously on a number of genes. 

Multiple protein expression monitoring can be performed, or on an individual basis. 
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In this embodiment, tiie ovarian cancer nucleic acid probes are attached to biochips as 
outlined herein for the detection and quantification of ovarian cancer sequences in a 
particular sample. The assays are fiorther described below in the example. PGR techniques 
can be used to provide greater sensitivity. 
5 In a preferred embodiment nucleic acids encoding the ovarian cancer protein are 

detected. Although DNA or RNA encoding the ovarian cancer protein may be detected, of 
particular interest are methods wherein an mRNA encoding an ovarian cancer protein is 
detected. Probes to detect mRNA can be a nucleotide/deoxynucleotide probe that is 
corhplementary to and hybridizes with the mRNA and includes, but is not limited to, 

10 oligonucleotides, cDNA or RNA. Probes also should contain a detectable label, as defined 
herein. In one method the mRNA is detected after immobilizing the nucleic acid to be 
examined on a solid support such as nylon membranes and hybridizing the probe with the 
sample. Following washing to remove the non-specifically bound probe, the label is 
detected. In another method detection of the mRNA is performed in situ. In this method 

15 permeabilized cells or tissue samples are contacted with a detectably labeled nucleic acid 
probe for sufficient time to allow the probe to hybridize with the target mRNA. Following 
washing to remove the non-specifically bound probe, the label is detected. For example a 
digoxygenin labeled riboprobe (RNA probe) that is complementary to the mRNA encoding 
an ovarian cancer protein is detected by binding the digoxygenin with an anti-digoxygenin 

20 secondary antibody and developed with nitro blue tetrazolium and 5-bromo-4-chloro-3- 
indoyl phosphate. 

In a preferred embodiment, various proteins firom the three classes of proteins as 
described herein (secreted, transmembrane or intracellular proteins) are used in diagnostic 
assays. The ovarian cancer proteins, antibodies, nucleic acids, modified proteins and cells 
25 containing ovarian cancer sequences are used in diagnostic assays. This can be performed on 
an individual gene or corresponding polypeptide level. In a preferred embodiment, the 
expression profiles are used, preferably in conjunction with high throughput screening 
techniques to allow monitoring for expression profile genes and/or corresponding 
polypeptides. 

30 As described and defined herein, ovarian cancer proteins, including intracellular, 

transmembrane, or secreted proteins, find use as prognostic or diagnostic markers of ovarian 

disease. Detection of these proteins in putative ovarian cancer tissue allows for detection, 

diagnosis, or prognosis of ovarian disease, and for selection of therapeutic strategy. In one 
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embodiment, antibodies are used to detect ovarian cancer proteins. A preferred method 
separates proteins from a sample by electrophoresis on a gel (typically a denaturing and 
reducing protein gel, but maybe another type of gel, including isoelectric focusing gels and 
the like). Following separation of proteins, the ovarian cancer protein is detected, e.g., by 
5 immunoblotting with antibodies raised against the ovarian cancer protein. Methods of 
immunoblotting are well known to those of ordinary skill in the art. 

In another preferred method, antibodies to the ovarian cancer protein find use in in 
situ imaging techniques, e.g., in histology. See, e.g., Asai (ed. 1993) Methods in Cell 
Biology: Antibodies in Cell Biology (vol. 37) Academic Press. Cells are contacted with from 

10 one to many antibodies to the,ovaiian cancer protein(s). Following washing to remove non- 
specific antibody binding, the presence of the antibody or antibodies is detected. In one 
embodiment the antibody is detected by incubating with a secondary antibody that contains a 
detectable label. In another method the primary antibody to the ovarian cancer protein(s) 
contains a detectable label, e.g., an enzyme marker that can act on a substrate. In another 

15 preferred embodiment each one of multiple primary antibodies contains a distinct and 

detectable label. This method finds particular use in simultaneous screening for a plurality of 
ovarian cancer proteins. As will be appreciated by one of ordinary skill in the art, many other 
histological imaging techniques are also provided by the invention. 

In a preferred embodiment the label is detected in a fluorometer which has the ability 

20 to detect and distinguish emissions of different wavelengths. In addition, a fluorescence 
activated cell sorter (FAGS) can be used in the method. 

In another preferred embodiment, antibodies find use in diagnosing ovarian cancer 
from blood, serum, plasma, stool, and other samples. Such samples, therefore, are useful as 
samples to be probed or tested for the presence of ovarian cancer proteins. Antibodies can be 

25 used to detect an ovarian cancer protein by previously described immunoassay techniques 
including ELISA, immunoblotting (western blotting), immunoprecipitation, BIACORE 
technology, and the like. Conversely, the presence of antibodies may indicate an immune 
response against an endogenous ovarian cancer protein. 

In a preferred embodiment, in situ hybridization of labeled ovarian cancer nucleic 

30 acid probes to tissue arrays is done. For example, arrays of tissue samples, including ovarian 

cancer tissue and/or normal tissue, are made. In situ hybridization (see, e.g., Ausubel, supra) 

is then performed. When comparing the fingerprints between an individual and a standard, 

the skilled artisan can make a diagnosis, a prognosis, or a prediction based on the findings. It 
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is further understood that the genes which indicate the diagnosis may differ fcom those which 
indicate the prognosis and molecular profiling of the condition of the cells may lead to 
distinctions between responsive or refractory conditions or may be predictive of outcomes. 
In a preferred embodiment, the ovarian cancer proteins, antibodies, nucleic acids, 
5 modified proteins and cells containing ovarian cancer sequences are used in prognosis assays. 
As above, gene expression profiles can be generated that correlate to ovarian cancer, clinical, 
pathological, or other information, in terms of long term prognosis. Again, this may be done 
on either a protein or gene level, with the use of a plurality of genes being preferred. As 
above, ovarian cancer probes may be attached to biochips for the detection and quantification 
10 of ovarian cancer sequences in a tissue or patient. The assays proceed as outlined above, for 
diagnosis, PGR method may provide more sensitive and accurate quantification. 

Assays for therapeutic compounds 

In a preferred embodiment members of the proteins, nucleic acids, and antibodies as 

15 described herein are used in drug screening assays. The ovarian cancer proteins, antibodies, 
nucleic acids, modified proteins and cells containing ovarian cancer sequences are used in 
drug screening assays or by evaluating the effect of drug candidates on a "gene expression 
profile" or expression profile of polypeptides. In a preferred embodiment, the expression 
profiles are used, preferably in conjunction with high throughput screening techniques to 

20 allow monitoring for expression profile genes after treatment with a candidate agent. See, 
e.g., Zlokamik, et al. (1998) Science 279:84-88; and Held (1996) Genome Res. 6:986-994. 

In a preferred embodiment, the ovarian cancer proteins, antibodies, nucleic acids, 
modified proteins and cells contaming the native or modified ovarian cancer proteins are used 
in screening assays. That is, the present invention provides novel methods for screening for 

25 compositions which modulate the ovarian cancer phenotype or an identified physiological 
function of an ovarian cancer protein. As above, this can be done on an individual gene level 
or by evaluating the effect of drug candidates on a "gene expression profile". In a preferred 
embodiment, the expression profiles are used, preferably in conjunction with high throughput 
screening techniques to allow monitoring for expression profile genes after treatment with a 

30 candidate agent. See, e.g., Zlokamik, supra. 

Having identified the differentially expressed genes herein, a variety of assays may be 

executed. In a preferred embodiment, assays may be run on an individual gene or protein 

level. That is, having identified a particular gene as up regulated in ovarian cancer, test 
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compounds can be screened for the ability to modulate gene expression or for binding to the 
ovarian cancer protein. "Modulation" thus includes both an increase and a decrease in gene 
expression. The preferred amount of modulation will depend on the original change of the 
gene expression in normal versus tissue undergoing ovarian cancer, with changes of at least 
5 10%, preferably 50%, more preferably 100-300%, and in some embodiments 300-1000% or 
greater. Thus, if a gene exhibits a 4-fold increase in ovarian cancer tissue compared to 
normal tissue, a decrease of about four- fold is often desired; similarly, a 10-fbld decrease in 
ovarian cancer tissue compared to normal tissue often provides a target value of a 10-fold 
increase in expression to be induced by the test compound. 
1 0 The amount of gene expression may be monitored using nucleic acid probes and the 

quantification of gene expression levels, or, alternatively, the gene product itself can be 
monitored, e.g., through the use of antibodies to the ovarian cancer protein and standard 
immunoassa)^. Proteomics and separation techniques may also allow quantification of 
expression. 

15 In a preferred embodiment, gene expression or protein monitoring of a number of 

entities, e.g., an expression profile, is monitored simultaneously. Such profiles will typically 

involve a plurality of those entities described herein. 

In this embodiment, the ovarian cancer nucleic acid probes are attached to biochips as 

outlined herein for the detection and quantification of ovarian cancer sequences in a 
20 particular cell. Alternatively, PGR may be used. Thus, a series, e.g., of microliter plate, may 

be used with dispensed primers in desired wells. A PGR reaction can then be performed and 

analyzed for each well. 

Expression monitoring can be performed to identify compounds that modify the 

expression of one or more ovarian cancer-associated sequences, e.g., a polynucleotide 
25 sequence set out inTables 1-26. Generally, in a preferred embodiment, a test modulator is 

added to the cells prior to analysis. Moreover, screens are also provided to identify agents 

that modulate ovarian cancer, modulate ovarian cancer proteins, bind to an ovarian cancer 

protein, or interfere with the binding of an ovarian cancer protein and an antibody or otiier 

binding partner. 

30 The term "test compound" or "drug candidate" or "modulator" or grammatical 

equivalents as used herein describes any molecule, e.g., protein, oligopeptide, small organic 

molecule, polysaccharide, polynucleotide, etc., to be tested for the capacity to directly or 

indirectly alter the ovarian cancer phenotype or the expression of an ovarian cancer sequence, 
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e.g., a nucleic acid or protein sequence. In preferred embodiments, modulators alter 
expression profiles, or expression profile nucleic acids or proteins provided herein' In one 
embodiment, the modulator suppresses an ovarian cancer phenotype, e.g., to a normal or hon- 
malignant tissue fingerprint. In another embodiment, a modulator induced an ovarian cancer 
5 phenotype. Generally, a plurality of assay mixtures are run in parallel with different agent 
concentrations to obtain a differential response to flie various concentrations. Typically, one 
of these concentrations serves as a negative control, e.g., at zero concentration or below the 
level of detection. 

Drug candidates encompass numerous chemical classes, though typically they are 
10 organic molecules, preferably small organic compounds having a molecular weight of more 
than 100 and less than about 2,500 daltons. Preferred small molecules are less than 2000, or 
less than 1500 or less than 1000 or less than 500 D. Candidate agents comprise fimctional 
groups necessary for structural interaction with proteins, particularly hydrogen bonding, and 
typically include at least an amine, carbonyl, hydroxyl or carboxyl group, preferably at least 
15 two of the fimctional chemical groups. The candidate agents often comprise cyclical carbon 
or heterocyclic structures and/or aromatic or polyaromatic structures substituted with one or 
more of the above functional groups. Candidate agents are also found among biomolecules 
including peptides, saccharides, fatty acids, steroids, purines, pyrimidines, derivatives, 
structxural analogs or combinations thereof Particularly preferred are peptides. 
20 In one aspect, a modulator will neutralize the effect of an ovarian cancer protein. By 

"neutralize" is meant that activity of a protein is inhibited or blocked and the consequent 
effect on the cell. 

In certain embodiments, combinatorial libraries of potential modulators will be 
screened for an ability to bind to an ovarian cancer polypeptide or to modulate activity. 

25 Conventionally, new chemical entities with usefiol properties are generated by identifying a 
chemical compound (called a "lead compound") with some desirable property or activity, 
e.g., inhibiting activity, creating variants of the lead compoxmd, and evaluating the property 
and activity of those variant compounds. Often, high throughput screening (HTS) methods 
are employed for such an analysis. 

30 In one preferred embodiment, high throughput screening methods involve providing a 

library containing a large number of potential therapeutic compounds (candidate 

compounds). Such "combinatorial chemical libraries" are then screened in one or more 

assays to identify tiiose Ubrary members (particular chemical species or subclasses) that 
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display a desired characteristic activity. The compounds thus identified can serve as 
conventional "lead compounds" or can themselves be used as potential or actual therapeutics. 

A combinatorial chemical Ubrary is a collection of diverse chemical compounds 
generated by either chemical synthesis or biological synthesis by combining a number of 
5 chemical "building blocks" such as reagents. For example, a linear combinatorial chemical 
library, such as a polypeptide (e.g., mutein) library, is formed by combining a set of chemical 
building blocks called amino acids in every possible way for a given compound length (e.g., 
the number of amino acids in a polypeptide compound). MiUions of chemical compounds 
can be synthesized through such combinatorial mixing of chemical building blocks. See, e.g., 

10 Gallop, et al. (1994) J. Med. Chem. 37:1233-125f. 

Preparation and screening of combinatorial chemical libraries is well known to those 
of skill in the art. Such combinatorial chemical libraries include, but are not limited to, 
peptide Ubraries (see, e.g., U.S. Patent No. 5,010,175; Furka (1991) Pent. Prot. Res. 37:487- 
493; and Houghton, et al. (1991) Nature 354:84-88), peptoids (PCX Publication No WO 

15 91/19735), encoded peptides (PCX PubUcation WO 93/20242), random bio-oligomers (PCX 
Publication WO 92/00091), benzodiazepines (U.S. Pat. No, 5,288,514), diversomers such as 
hydantoins, benzodiazepines and dipeptides (Hobbs, et al. (1993) Proc. Nat'l Acad. Sci. USA 
90:6909-913), vinylogous polypeptides (Hagihara, et al. (1992) J. Amer. Chem. Soc. 
1 14:6568-570), non-peptidal peptidomimetics with a Beta-D-GIucose scaffolding 

20 (Hirsclimann, et al. (1992) J. Amer. Chem. Soc. 1 14:9217-218), analogous organic syntheses 
of small compound libraries (Chen, et al. (1994) J. Amer. Chem. Soc. 1 16:2661-662), 
oligocarbaraates (Cho, et al. (19931 Science 261:1303-305), and/or peptidyl phosphonates 
(Campbell, et al.(1994) J. Org. Chem. 59:658-xxx). See, generally, Gordon, et al. (1994) I 
Med. Chem. 37:1385-401, nucleic acid libraries (see, e.g., Stratagene, Corp.), peptide nucleic 

25 acid libraries (see, e.g., U.S. Patent 5,539,083), antibody libraries (see, e.g., Vaughn, et 

al.(1996) Nature Biotechnology 14:309-314; and PCX/US96/10287), carbohydrate libraries 
(see, e.g., Liang, et al. (1996) Science 274:1520-1522; and U.S. Patent No. 5,593,853), and 
small organic molecule libraries (see, e.g., benzodiazepines, page 33, Baiun (Jan. 18, 1993) 
C&E News: isoprenoids, U.S. Patent No. 5,569,588; thiazolidinones and metathiazanones, 

30 U.S. Patent No. 5,549,974; pyrroUdines, U.S. Patent Nos. 5,525,735 and 5,519,134; 

morpholino compounds, U.S. Patent No. 5,506,337; benzodiazepines, U.S. Patent No. 

5,288,514; and the like). 

Devices for the preparation of combinatorial libraries are conmiercially available. 
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See, e.g., 357 MPS, 390 MPS, Advanced Chem Tech, Louisville KY; Symphony, Rainin, 
Wobum, MA; 433A Applied Biosystems, Foster City, CA; 9050 Plus, Millipore, Bedford, 
. MA. 

A number of well known robotic systems have also been developed for solution phase 
5 chemistries. These systems include automated workstations like the automated synthesis 
apparatus developed by Takeda Chemical Industries, LTD. (Osaka, Japan) and many robotic 
systems utilizing robotic arms (Zymate II, Zymark Corporation, Hopkinton, MA; Orca, 
Hewlett-Packard, Palo Alto, CA), which mimic the manual synthetic operations performed by 
a chemist. Any ofthe above devices are suitable for use with the present invention. The 

1 0 nature and implementation of modifications to these devices (if any) so that they can operate 
as discussed herein will be apparent to persons skilled in the relevant art. In addition, 
numerous combmatorial libraries are themselves commercially available (see, e.g., 
ComGenex, Princeton, N.J.; Asinex, Moscow, RU; Tripos, Inc., St. Louis, MO; ChemStar, 
Ltd, Moscow, RU; 3D Pharmaceuticals, Exton, PA; Martek Biosciences, Columbia, MD; 

15 etc.). 

The assays to identify modulators are amenable to high throughput screening. 
Preferred assays thus detect enhancement or inhibition of ovarian cancer gene transcription, 
inhibition or enhancement of polypeptide expression, and inhibition or enhancement of 

polypeptide activity. 

20 High throughput assays for the presence, absence, quantification, or other properties 

of particular nucleic acids or protein products are well known to those of skill in the art. 
Similarly, binding assays and reporter gene assays are similarly well known. Thus, e.g., U.S. 
Patent No. 5,559,410 discloses high throughput screening methods for proteins, U.S. Patent 
No. 5,585,639 discloses high throughput screening methods for nucleic acid binding (e.g., in 

25 arrays), while U.S. Patent Nos. 5,576,220 and 5,541,061 disclose high throughput methods of 
screening for ligand/antibody binding. 

In addition, high throughput screening systems are commercially available (see, e.g., 
Zjanark Corp., Hopkinton, MA; Air Technical Industries, Mentor, OH; Beckman 
Instruments, Inc. FuUerton, CA; Precision Systems, Inc., Natick, MA, etc.). These systems 

30 typically automate entire procedures, including all sample and reagent pipetting, liquid 

dispensing, timed incubations, and final readings of the microplate in detector(s) appropriate 

for the assay. These configurable systems provide high throughput and rapid start up as well 

as a high degree of flexibiUty and customization. The manufacturers of such systems provide 
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detailed protocols for various high throughput systems. Thus, e.g., Zymark Corp. provides 
technical bulletins describing screening systems for detecting the modulation of gene 
transcription, ligand binding, and the like. 

In one embodiment, modulators are proteins, often naturally occurring proteins or 

5 fragments of naturally occurring proteins. Thus, e.g., cellular extracts containing proteins, or 
random or directed digests of proteinaceous cellular extracts, may be used. In this way 
libraries of proteins may be made for screening in the methods of the invention. Particularly 
preferred in this embodiment are libraries of bacterial, fungal, viral, and mammalian proteins, 
with the latter being preferred, and human proteins being especially preferred. Particularly 

1 0 useful test compound will be directed to the class of proteins to which the target belongs, e.g., 
substrates for enzymes or ligands and receptors. 

In a preferred embodiment, modulators are peptides of from about 5 to about 30 
amino acids, with from about 5 to about 20 amino acids being preferred, and from about 7 to 
about 15 being particularly preferred. The peptides may be digests of naturally occurring 

15 proteins as is outlined above, random peptides, or "biased" random peptides. By 

"randomized" or grammatical equivalents herein is meant that each nucleic acid and peptide 
consists of essentially random nucleotides and amino acids, respectively. Since generally 
these random peptides (or nucleic acids, discussed below) are chemically synthesized, they 
may incorporate any nucleotide or amino acid at any position. The synthetic process can be 

20 designed to generate randomized proteins or nucleic acids, to allow the formation of all or 
most of the possible combinations over the length of the sequence, thus forming a library of 
randomized candidate bioactive proteinaceous agents. 

In one embodiment, the library is ftiUy randomized, with no sequence preferences or 
constants at any position. In a preferred embodiment, the library is biased. That is, some 

25 positions within the sequence are either held constant, or are selected from a limited number 
of possibilities. For example, in a preferred embodiment, the nucleotides or amino acid 
residues are randomized within a defmed class, e.g., of hydrophobic amino acids, hydrophilic 
residues, sterically biased (either small or large) residues, towards the creation of nucleic acid 
binding domains, the creation of cysteines, for cross-linking, prolines for SH-3 domains, 

30 serines, threonines, tyrosines or histidines for phosphorylation sites, etc., or to purines, etc. 

Modulators of ovarian cancer can also be nucleic acids, as defmed above. 

As described above generally for proteins, nucleic acid modulating agents may be 

naturally occurring nucleic acids, random nucleic acids, or "biased" random nucleic acids. 
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For example, digests of procaryotic or eucaryotic genomes may be used as is outlined above 
for proteins. 

In a preferred embodiment, the candidate compounds are organic chemical moieties, a 
wide variety of which are available in the literamre. 
5 After the candidate agent has been added and the cells allowed to incubate for some 

period of time, the sample containing a target sequence to be analyzed is added to the 
biochip. If required, the target sequence is prepared using known techniques. For example, 
the sample may be treated to lyse the cells, using known lysis buffers, electroporation, etc., 
with purification and/or amplification such as PGR perfiarmed as appropriate. For example, 

10 an in vitro transcription with labels covalently attached to the nucleotides is performed. 
Generally, the nucleic acids are labeled with biotin-FITC or PE, or with cy3 or cy5. 

In a preferred embodiment, the target sequence is labeled with, e.g., a fluorescent, a 
chemiluminescent, a chemical, or a radioactive signal, to provide a means of detecting the 
target sequence's specific binding to a probe. The label also can be an enzyme, such as, 

1 5 alkaline phosphatase or horseradish peroxidase, which when provided with an appropriate 
substrate produces a product that can be detected. Alternatively, the label can be a labeled 
compound or small molecule, such as an enzyme inhibitor, that binds but is not catalyzed or 
altered by the enzyme. The label also can be a moiety or compound, such as, an epitope tag 
or biotin which specifically binds to strep tavidin. For the example of biotin, the streptavidin 

20 is labeled as described above, thereby, providing a detectable signal for the bound target 
sequence. Unbound labeled streptavidin is typically removed prior to analysis. 

As will be appreciated by those in the art, these assays can be direct hybridization 
assays or can comprise "sandwich assays", which include the use of multiple probes, as is 
generally outlined in U.S. Patent Nos. 5,681,702; 5,597,909; 5,545,730; 5,594,117; 

25 5,591,584; 5,571,670; 5,580,731; 5,571,670; 5,591,584; 5,624,802; 5,635,352; 5,594,118; 
5,359,100; 5,124,246; and 5,681,697, each of which is hereby incorporated by reference. In 
this embodiment, in general, the target nucleic acid is prepared as outlined above, and then 
added to the biochip comprising a plurality of nucleic acid probes, under conditions that 
allow the formation of a hybridization complex. 

30 A variety of hybridization conditions may be used in the present invention, including 

high, moderate and low stringency conditions as outlined above. The assays are generally 
run under stringency conditions which allows formation of the label probe hybridization 
complex only in the presence of target. Stringency can be controlled by altering a step 
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parameter that is a tfaermodynamic variable, including, but not limited to, temperature, 
formamide concentration, salt concentration, chaotropic salt concentration pH, organic 
solvent concentration, etc. 

These parameters may also be used to control non-specific binding, as is generally 
5 outlined in U.S. Patent No. 5,681,697. Thus it may be desirable to perform certain steps at 
higher stringency conditions to reduce non-specific binding. 

The reactions outlined herein may be accomplished in a variety of ways. Components 
of the reaction may be added simultaneously, or sequentially, in different orders, with 
preferred embodiments outlined below. In addition, the reaction may include a variety of 

10 other reagents. These include salts, buffers, neutral proteins, e.g., albumin, detergents, etc, 
which may be used to facilitate optimal hybridization and detection, and/or reduce non- 
specific or backgroimd interactions. Reagents that otherwise improve the efficiency of the 
assay, such as protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may also be 
used as appropriate, depending on the sample preparation methods and purity of the target. 

15 The assay data are analyzed to determine the expression levels, and changes in 

expression levels as between states, of individual genes, forming a gene expression profile. 

Screens are performed to identify modulators of the ovarian cancer phenotype. In one 
embodiment, screening is performed to identify modulators that can induce or suppress a 
particular expression profile, thus preferably generating the associated phenotype. In another 

20 embodiment, e.g., for diagnostic applications, having identified differentially expressed genes 
important in a particular state, screens can be performed to identify modulators that alter 
expression of individual genes. In an another embodiment, screening is performed to identify 
modulators that alter a biological function of the expression product of a differentially 
ejcpressed gene. Again, having identified the importance of a gene in a particular state, 

25 screens are performed to identify agents that bind and/or modulate the biological activity of 
the gene product. 

In addition screens can be done for genes that are induced in response to a candidate 
agent. After identifying a modulator based upon its ability to suppress an ovarian cancer 
expression pattern leading to a normal expression pattem, or to modulate a single ovarian 
30 cancer gene expression profile so as to mimic the expression of the gene from normal tissue, 
a screen as described above can be performed to identify genes that are specifically 
modulated in response to the agent. Comparing expression profiles between normal tissue 
and agent treated ovarian cancer tissue reveals genes that are not expressed in normal tissue 
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or ovarian cancer tissue, but are expressed in agent treated tissue. These agent-specific 
sequences can be identified and used by methods described herein for ovarian cancer genes or 
proteins, hi particular these sequences and the proteins they encode find use in marking or 
identifying agent treated cells, hi addition, antibodies can be raised against the agent induced 
5 proteins and used to target novel therapeutics to the treated ovarian cancer tissue sample. 

Thus, in one embodiment, a test compound is administered to a population of ovarian 
cancer cells, that have an associated ovarian cancer expression profile. By "administration" 
or "contacting" herein is meant that the candidate agent is added to the cells in such a manner 
as to allow the agent to act upon the cell, whether by uptake and intracellular action, or by 

10 action at the cell surface. In some embodiments, nucleic acid encoding a proteinaceous 
candidate agent (e.g., a peptide) may be put into a viral construct such as an adenoviral or 
retroviral construct, and added to the cell, such that expression of the peptide agent is 
accomplished, e.g., PCT US97/01019. Regulatable gene therapy systems can also be used. 
Once the test compound has been administered to the cells, the cells can be washed if 

15 desired and are allowed to incubate under preferably physiological conditions for some 

period of time. The cells are then harvested and a new gene expression profile is generated, 
as outlined herein. 

Thus, e.g., ovarian cancer or non-malignant tissue may be screened for agents that 
modulate, e.g., induce or suppress the ovarian cancer phenotype. A change in at least one 

20 gene, preferably many, of the expression profile indicates that the agent has an effect on 
ovarian cancer activity. By defining such a signature for the ovarian cancer phenotype, 
screens for new drugs that alter the phenotype can be devised. With this approach, the drug 
target need not be known and need not be represented m the original expression screening 
platform, nor does the level of transcript for the target protein need to change. 

25 In a preferred embodunent, as outlined above, screens may be done on individual 

genes and gene products (proteins). That is, having identified a particular differentially 
expressed gene as unportant in a particular state, screening of modulators of either the 
expression of the gene or the gene product itself can be done. The gene products of 
differentially expressed genes are sometimes referred to herein as "ovarian cancer proteins" 

30 or a "ovarian cancer modulatory protein". The ovarian cancer modulatory protein may be a 

fragment, or alternatively, be the full length protein to the fragment encoded by the nucleic 

acids of the Tables. Preferably, the ovarian cancer modulatory protein is a fragment. In a 

preferred embodiment, the ovarian cancer amino acid sequence which is used to determine 
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sequence identity or similarity is encoded by a nucleic acid of the Tables. In another 
embodiment, the sequences are naturally occiuring allelic variants of a protein encoded by a 
nucleic acid of the Tables. In another embodiment, the sequences are sequence variants as 
further described herein. 
5 Preferably, the ovarian cancer modulatory protein is a fragment of approximately 14 

to 24 amino acids long. More preferably the fragment is a soluble fragment. Preferably, the 
fragment includes a non-transmembrane region. In a preferred embodiment, the fragment has 
an N-terminal Cys to aid in solubility. In another embodiment, the C-terminus of the 
fragment is kept as a free acid and the N-terminus is a free amine to aid in coupling, e.g., to 
10 cysteine. Or, the ovarian cancer proteins are conjugated to an immunogenic agent, e.g., to 
BSA. 

Measurements of ovarian cancer polypeptide activity, or of ovarian cancer or the 
ovarian cancer phenotype can be performed using a variety of assays. For example, the 
effects of the test compounds upon the fimction of the ovarian cancer polypeptides can be 

1 5 measured by examining parameters described above. A suitable physiological change that 
affects activity can be used to assess the influence of a test compoimd on the polypeptides of 
this invention. When the functional consequences are determined using intact cells or 
animals, one can also measure a variety of effects such as, in the case of ovarian cancer 
associated with tumors, tumor growth, tumor metastasis, neovascularization, hormone 

20 release, transcriptional changes to both known and uncharacterized genetic markers (e.g., 
northern blots), changes in cell metabolism such as cell growth or pH changes, and changes 
in intracellular second messengers such as cGMP. In the assays of the invention, mammalian 
ovarian cancer polypeptide is typically used, e.g., mouse, preferably human. 

Assays to identify compounds with modulating activity can be performed m vitro. 

25 For example, an ovarian cancer polypeptide is first contacted with a potential modulator and 
incubated for a suitable amoimt of time, e.g., from 0.5 to 48 hours. In one embodiment, the 
ovarian cancer polypeptide levels are determined in vitro by measuring the level of protein or 
mRNA. The level of protein is measured using immimoassays such as western blotting, 
ELISA and the like with an antibody that selectively binds to the ovarian cancer polypeptide 

30 or a fragment thereof For measurement of mRNA, amplification, e.g., using PGR, LCR, or 

hybridization assays, e.g., northern hybridization, RNAse protection, dot blotting, are 

preferred. The level of protein or mRNA is detected using directly or indirectly labeled 

detection agents, e.g., fluorescently or radioactively labeled nucleic acids, radioactively or 
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enzymatically labeled antibodies, and tiie like, as described herein. 

Alternatively, a reporter gene system can be devised using the ovarian cancer protein 
promoter operably linked to a reporter gene such as luciferase, green fluorescent protein, 
CAT, or P-gal. The reporter construct is typically transfected into a cell. After treatment 
5 with a potential modulator, the amount of reporter gene transcription, translation, or activity 
is measiued according to standard techniques known to those of skill in the art. 

In a preferred embodiment, as outlined above, screens may be done on individual 
genes and gene products (proteins). That is, having identified a particular differentially 
expressed gene as important in a particular state, screening of modulators of the expression of 
10 the gene or Ihe gene product itself can be done. The gene products of differentially expressed 
genes are sometimes referred to herein as "ovarian cancer proteins." The ovarian cancer 
protein may be a fragment, or alternatively, be the full length protein to a fragment shown 
herein. 

In one embodiment, screening for modulators of expression of specific genes is 

15 performed. Typically, the expression of only one or a few genes are evaluated. In another 
embodiment, screens are designed to first jSnd compounds that bind to differentially 
expressed proteins. These compoimds are then evaluated for the ability to modulate 
differentially expressed activity. Moreover, once initial candidate compounds are identified, 
variants can be further screened to better evaluate structure activity relationships. 

20 In a preferred embodiment, binding assays are done. In general, purified or isolated 

gene product is used; that is, the gene products of one or more differentially expressed 
. nucleic acids are made. For example, antibodies are generated to the protein gene products, 
and standard iimnunoassays are run to determine the amovmt of protein present. 
Altematively, cells comprising the ovarian cancer proteins can be used in the assays. 

25 Thus, in a preferred embodiment, the methods comprise combining an ovarian cancer 

protein and a candidate compound, and determming the binding of the compound to the 
ovarian cancer protein. Preferred embodiments utilize the human ovarian cancer protein, 
although other mammalian protems, e.g., counterparts, may also be used, e.g., for the 
development of animal models of human disease. In some embodiments, as outiined herein, 

3 0 variant or derivative ovarian cancer proteins may be used. 

Generally, in a preferred embodiment of the methods herein, the ovarian cancer 

protein or the candidate agent is non-diffiisably bound to an insoluble support having isolated 

sample receiving areas (e.g., a microtiter plate, an array, etc.). The insoluble supports may be 
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made of any composition to which the compositions can be bound, is readily separated from 
soluble material, and is otherwise compatible with the overall method of screening" The 
surface of such supports may be solid or porous and of any convenient shape. Examples of 
suitable insoluble supports include microtiter plates, arrays, membranes and beads. These are 

5 typically made of glass, plastic (e.g., polystyrene), polysaccharides, nylon or nitrocellulose, 
teflon™, etc. Microtiter plates and arrays are especially convenient because a large number 
of assays can be carried out simultaneously, using small amounts of reagenfs"and samples. 
The particular manner of binding of the composition is not crucial so long as it is compatible 
with the reagents and overall methods of the uivention, maintains the activity of the 

10 composition and is non-diffusible. Preferred methods of binding include the use of 

antibodies (which do not sterically block either the ligand binding site or activation sequence 
when the protein is bound to the support), direct bmding to "sticky" or ionic supports, 
chemical crosslinking, the synthesis of the protein or agent on the surface, etc. Following 
binding of the protein or agent, excess unbound material is removed by washing. The sample 

1 5 receiving areas may then be blocked through incubation with bovine serum albumin (BS A), 
casein or other innocuous protein or other moiety. 

In a preferred embodiment, the ovarian cancer protein is bound to the support, and a 
test compound is added to the assay. Alternatively, the candidate agent is bound to the 
support and the ovarian cancer protein is added. Novel binding agents include specific 

20 antibodies, non-natural binding agents identified in screens of chemical libraries, peptide 

analogs, etc. Of particular interest are screening assays for agents that have a low toxicity for 
human cells. A wide variety of assays may be used for this purpose, including labeled in 
vitro protein-protein binding assays, electrophoretic mobility shift assays, immunoassays for 
protein binding, ftmctional assays phosphorylation assays, etc.) and the like. 

25 The determination of the binding of the test modulating compound to the ovarian 

cancer protein may be done in a niunber of ways. In a preferred embodiment, the compound 
is labeled, and binding determined directly, e.g., by attaching all or a portion of the ovarian 
cancer protein to a solid support, adding a labeled candidate agent (e.g., a fluorescent label), 
washing off excess reagent, and determining whether the label is present on the solid support. 

30 Various blocking and washing steps may be utilized as appropriate. 

In some embodiments, only one of the components is labeled, e.g., the proteins (or 
proteinaceous candidate compounds) can be labeled. Alternatively, more than one 
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component can be labeled with different labels, e.g., ^^Sj for the proteins and a fluorophor 
for the compound. Proximity reagents, e.g., quenching or energy transfer reagents are also 
useful. 

In one embodiment, the binding of the test compound is determined by competitive 
5 binding assay. The competitor is a binding moiety known to bind to the target molecule (e.g., 
an ovarian cancer protein), such as an antibody, peptide, binding partner, ligand, etc. Under 
certain circumstances, there may be competitive binding between the compound and the 
binding moiety, with the binding moiety displacing the compound. In one embodiment, the 
test compound is labeled. Either the compound, or the competitor, or both, is added first to 

10 the protein for a time sufficieijt to allow binding, if present. Incubations may be performed at 
a temperature which facilitates optimal activity, typically 4-40° C. Incubation periods are 
typically optimized, e.g., to facilitate rapid high throughput screening. Typically between 0.1 
and 1 hr will be sufficient. Excess reagent is generally removed or washed away. The 
second component is then added, and the presence or absence of the labeled component is 

15 followed, to indicate binding. 

In a preferred embodiment, the competitor is added first, followed by the test 
compound. Displacement of the competitor is an indication tliat the test compound is binding 
to the ovarian cancer protein and thus is capable of binding to, and potentially modulating, 
the activity of the ovarian cancer protein. In this embodiment, either component can be 

20 labeled. Thus, e.g., if the competitor is labeled, the presence of label in the wash solution 
mdicates displacement by the agent. Alternatively, if the test compound is labeled, the 
presence of the label on the support indicates displacement. 

In an alternative embodunent, the test compound is added first, with incubation and 
washing, followed by the competitor. The absence of binding by the competitor may indicate 

25 that the test compound is bound to the ovarian cancer protein with a higher affinity. Thus, if 
the test compound is labeled, the presence of the label on the support, coupled with a lack of 
competitor binding, may indicate.that the test compound is capable of binding to the ovarian 
cancer protein. 

In a preferred embodiment, the methods comprise differential screening to identity 
30 agents that are capable of modulating the activity of the ovarian cancer proteins. In this 

embodiment, the methods comprise combining an ovarian cancer protein and a competitor in 
a first sample. A second sample comprises a test compound, an ovarian cancer protein, and a 
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competitor. The binding of the competitor is determined for both samples, and a change, or 
difference in binding between the two samples indicates the presence of an agent capable of 
binding to the ovarian cancer protein and potentially modulating its activity. That is, if the 
binding of the competitor is different in the second sample relative to the first sample, the 

5 agent is capable of binding to the ovarian cancer protein. 

Alternatively, differential screening is used to identify drug candidates that bind to the 
native ovarian cancer protein, but cannot bind to modified ovarian cancer proteins. The 
structure of the ovarian cancer protein may be modeled, and used in rational drug design to 
synthesize agents that interact with that site. Drug candidates that affect the activity of an 

1 0 ovarian cancer protem are also identified by screening drugs for the ability to either enhance 
or reduce the activity of the protein. 

Positive controls and negative controls may be used in the assays. Preferably control 
and test samples are performed in at least tripUcate to obtain statistically significant results. 
Incubation of all samples is for a time sufiBcient for the binding of the agent to the protein. 

15 Following incubation, samples are washed free of non-specifically boimd material and the 
amount of boimd, generally labeled agent determined. For example, where a radiolabel is 
employed, the samples may be coimted in a scintillation counter to determine the amoimt of 
bound compound. 

A variety of other reagents may be included in the screening assays. These include 

20 reagents like salts, neutral proteins, e.g., albumin, detergents, etc. which may be used to 

facilitate optimal protein-protein binding and/or reduce non-specific or background 

interactions. Also reagents that otherwise improve the efficiency of the assay, such as 

protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may be used. The mixture 

of components may be added in an order that provides for the requisite binding. 

25 In a preferred embodiment, the invention provides methods for screening for a 

compound capable of modulating the activity of an ovarian cancer protein. The methods 

comprise adding a test compoimd, as defined above, to a cell comprising ovarian cancer 

proteins. Preferred cell types include almost any cell. The cells contain a recombinant 

nucleic acid that encodes an ovarian cancer protein. Li a preferred embodiment, a library of 

3 0 candidate agents are tested on a plurality of cells. 

In one aspect, the assays are evaluated in the presence or absence or previous or 

subsequent exposure of physiological signals, e.g., hormones, antibodies, peptides, antigens, 

cytokines, growth factors, action potentials, pharmacological agents including 
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chemotherapeutics, radiation, carcinogenics, or other cells (e.g., cell-cell contacts). In 
another example, the determinations are determined at (Ufiferent stages of the cell cycle 
process. 

In this way, compounds that modulate ovarian cancer agents are identified. 
5 Compounds with pharmacological activity are able to enhance or interfere with the activity of 
the ovarian cancer protein. Once identified, similar structures are evaluated to identify 
critical structural feature of the compound. 

In one embodiment, a mefliod of inhibiting ovarian cancer cell division is provided. 
The method comprises administration of an ovarian cancer inhibitor. In another embodiment, 
1 0 a method of inhibiting ovarian cancer is provided. The method comprises administration of 
an ovarian cancer inhibitor. In a further embodiment, methods of treating cells or individuals 
with ovarian cancer are provided. The method comprises administration of an ovarian cancer 
inhibitor. 

In one embodiment, an ovarian cancer inhibitor is an antibody as discussed above. In 
15 another embodiment, the ovarian cancer inhibitor is an antisense or RNAi molecule. 

A variety of cell viability, growth, proliferation, and metastasis assays are known to 
those of skill in the art, as described below. 

Soft agar growth or colony formation in suspension 

20 Normal cells require a solid substrate to attach and grow. When the cells are 

transformed, they lose this phenotype and grow detached from the substrate. For example, 
transformed cells can grow in stirred suspension culture or suspended in semi-solid media, 
such as semi-solid or soft agar. The transformed cells, when transfected with tumor 
suppressor genes, regenerate normal phenotype and require a solid substrate to attach and 

25 grow. Soft agar growth or colony formation in suspension assays can be used to identify 
modulators of ovarian cancer sequences, which when expressed in host cells, inhibit 
abnormal cellular proliferation and transformation. A therapeutic compound would reduce or 
eliminate the host cells' ability to grow in stirred suspension culture or suspended in semi- 
solid media, such as semi-solid or soft. 

30 Techniques for soft agar growth or colony formation in suspension assays are 

described in Freshney (1994) Culture of Animal Cells: A Manual of Basic Technique (3d ed.) 
Wiley-Liss, herein incorporated by reference. See also, the methods section of Garkavtsev, et 
al. (1996), supra, herein incorporated by reference. 
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Contact inhibition and density limitation of growth 

Normal cells typically grow in a flat and organized pattern in a petri dish until they 
touch other cells. When the cells touch one another, they are contact inhibited and stop 
5 growing. When cells are transformed, however, the cells are not contact inhibited and 

continue to grow to high densities in disorganized foci. Thus, the transformed cells grow to a 
higher saturation density than normal cells. This can be detected morphologically by the 
formation of a disoriented monolayer of cells or rounded cells in foci within the regular 
pattern of normal surrounding cells. Alternatively, labeling index with (^ETj-thymidine at 
10 saturation density can be used,to measure density limitation of growth. See, e.g., Freshney 
(1994), supra. The transformed cells, when transfected with tumor suppressor genes, 
regenerate a normal phenotype and become contact inhibited and would grow to a lower 
density. 

In this assay, labeling index with (3H)-thymidine at saturation density is a preferred 
15 method of measuring density limitation of growth. Transformed host cells are transfected 
with an ovarian cancer-associated sequence and are grown for 24 hr at saturation density in 
non-limiting mediirai conditions. The percentage of cells labeling with (3H)-thymidine is 
determined autoradiographically. See, e.g., Freshney (1994), supra. 

20 Growth factor or serum dependence 

Transformed cells typically have a lower serum dependence than their normal 
counterparts. See, e.g., Temm (1966) J. Nat'l Cancer Inst. 37:167-175; Eagle, et al. (1970) L 
Exp. Med. 131 -.836-879; and Fresliney, supra. This is in part due to release of various growth 
factors by the transformed cells. Growth factor or serum dependence of transformed host 

25 cells can be compared with that of control. 

Tumor specific markers levels 

Tumor cells release an increased amount of certain factors (hereinafter "tumor 
specific markers") than their normal counterparts. For example, plasminogen activator (PA) 
30 is released from human glioma at a higher level than from normal brain cells (see, e.g., 

GuUino, pp. 178-184 "Angiogenesis, tumor vascularization, and potential interference with 
tumor growth" in Mihich (ed. 1985) Biological Responses m Cancer Plenum. Similarly, 
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tumor angiogenesis factor (TAF) is released at a higher level in tumor cells than their nonnal 
counterparts. See, e.g., Folkman (1992) Sem Cancer Biol. 3:89-96. 

Various techniques which measure the release of these factors are described in 
Freshney (1994), supra. Also, see, Unkeless, et al. (1974) J. Biol. Chem. 249:4295-4305; 
5 Strickland and Beers (1976) J. Biol. Chem. 25 1 :5694-5702; Whur, et al. (1980) Br. J. Cancer 
42:305-312; Gullino, pp. 178-184 "Angiogenesis, tumor vascularization, and potential 
interference with tumor growth" in Mihich (ed. 1985) Biological Responses in Cancer 
Plenum; and Freshney (1985) Anticancer Res. 5:1 11-130. 

10 Invasiveness into Matrigel 

The degree of invasiveness into Matrigel or some other extracellular matrix 
constituent can be used as an assay to identify compounds that modulate ovarian cancer- 
associated sequences. Tumor cells exhibit a good correlation between malignancy and 
invasiveness of cells into Matrigel or some other extracellular matrix constituent. In this 

15 assay, tumorigenic cells are typically used as host cells. Expression of a tumor suppressor 
gene in these host cells would decrease invasiveness of the host cells. 

Alternatively, the level of invasion of host cells can be measured by using filters 
coated with Matrigel or some other extracellular matrix constituent. Penetration into the gel, 
or through to the distal side of the filter, is rated as invasiveness, and rated histologically by 

20 number of cells and distance moved, or by pre-labeling the cells with l^Sj and counting the 
radioactivity on the distal side of the filter or bottom of the dish. See, e.g., Freshney (1984), 
supra. 

Tumor growth in vivo 

25 Effects of ovarian cancer-associated sequences on cell growth can be tested in 

transgenic or inmiune-suppressed mice. Knock-out transgenic mice can be made, in which 
the ovarian cancer gene is disrupted or in which an ovarian cancer gene is inserted. Knock- 
out transgenic mice can be made by insertion of a marker gene or other heterologous gene 
into the endogenous ovarian cancer gene site in the mouse genome via homologous 

30 recombination. Such mice can also be made by substituting the endogenous ovarian cancer 
gene with a mutated version of the ovarian cancer gene, or by mutating the endogenous 
ovarian cancer gene, e.g., by exposure to carcinogens. 
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A DNA construct is introduced into the nuclei of embryonic stem cells. Cells 
containing the newly engineered genetic lesion are injected into a host mouse embryo, which 
is re-implanted into a recipient female. Some of these embryos develop into chimeric mice 
that possess germ cells partially derived from the mutant cell line. By breeding the chimeric 
5 mice it is possible to obtain a new line of mice containing the Introduced genetic lesion. See, 
e.g., Capecchi, et al. (1989) Science 244:1288-1292. Chimeric targeted mice can be derived 
according to Hogan, et al. (1988) Manipulating the Mouse Embryo: A Laboratory Manual 
CSH Press; and Robertson (ed. 1987) Teratocarcinomas and Embryonic Stem Cells: A 
Practical Approach IRL Press, Washington, D.C. 

1 0 Altematively, various immune-siqjpressed or immune-deficient host animals can be 

used. For example, genetically athymic "nude" mouse (see, e.g., Giovanella, et al. (1974) J. 
Nat'l Cancer Inst. 52:921-930), a SCID mouse, a thymectomized mouse, or an irradiated 
mouse (see, e.g., Bradley, et al. (1978) Br. J. Cancer 38:263-272; Selby, et al. (1980) Br. J. 
Cancer 41:52-6 1) can be used as a host. Transplantable tumor cells (typically about 10^ 

1 5 cells) injected into isogenic hosts will produce invasive tumors in a high proportions of cases, 
while normal cells of similar origin will not. hi hosts which developed invasive tumors, cells 
expressing an ovarian cancer-associated sequences are injected subcutaneously. After a 
suitable length of time, preferably 4-8 weeks, tumor growth is measured (e.g., by volume or 
by its two largest dimensions) and compared to the control. Tumors that have statistically 

20 significant reduction (using, e.g., Student's T test) are said to have inhibited growth. 

Polynucleotide modulators of ovarian cancer 

Antisense and RNAi Polynucleotides 

In certain embodiments, the activity of an ovarian cancer-associated protein is down- 
25 regulated, or entirely inhibited, by the use of antisense polynucleotide, e.g., a nucleic acid 
complementary to, and which can preferably hybridize specifically to, a coding mRNA 
nucleic acid sequence, e.g., an ovarian cancer protein mRNA, or a subsequence thereof 
Binding of the antisense polynucleotide to the mRNA reduces the translation and/or stability 
of the mRNA. 

30 In the context of this invention, antisense polynucleotides can comprise naturally- 

occurring nucleotides, or synthetic species formed from naturally-occurring subunits or their 
close homologs. Antisense polynucleotides may also have altered sugar moieties or inter- 
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sugar linkages. Exemplary among these are the phosphorothioate and other sulfur containing 
species which are known for use in the art. Analogs are comprehended by this invention so 
long as they function effectively to hybridize with the ovarian cancer protein mRNA. See, 
e.g., Isis Pharmaceuticals, Carlsbad, CA; Sequitor, Inc., Natick, MA. 
5 Such antisense polynucleotides can readily be synthesized using recombinant means, 

or can be synthesized in vitro. Equipment for such synthesis is sold by several vendors, 
including Applied Biosystems. The preparation of other oligonucleotides such as 
phosphorothioates and alkylated derivatives is also well known to those of skill in the art. 
Antisense molecules as used herein include antisense or sense oligonucleotides. 

10 Sense oligonucleotides can, e.g., be employed to block transcription by binding to the anti- 
sense strand. The antisense and sense oligonucleotide comprise a single-stranded nucleic 
acid sequence (either RNA or DNA) capable of binding to target mRNA (sense) or DNA 
(antisense) sequences for ovarian cancer molecules. A preferred antisense molecule is for an 
ovarian cancer sequences in Tables 1-26, or for a ligand or activator thereof. Antisense or 

1 5 sense oligonucleotides, according to the present invention, comprise a fragment generally at 
least about 14 nucleotides, preferably from about 14 to 30 nucleotides. An antisense or a 
sense oligonucleotide can be developed based upon a cDNA sequence encoding a given 
protein. See, e.g., Stein and Cohen (1988) Cancer Res. 48:2659-2668; and van der Kiol, et 
al. (1988) BioTechniques 6:958-976. 

20 RNA interference is a mechanism to suppress gene expression in a sequence specific 

manner. See, e.g., Brumelkamp, et al. (2002) Sciencexpress (21March2002); Sharp (1999) 
Genes Dev. 13:139-141; and Cathew (2001) Curr. Op. Cell Biol. 13:244-248. In mammalian 
cells, short, e.g., 21 nt, double stranded small interfering RNAs (siRNA) have been shown to 
be effective at inducing an RNAi response. See, e.g., Elbashir, et al. (2001) Nature 41 1:494- 

25 498. The mechanism may be used to down-regulate expression levels of identified genes, 
e.g., treatment of or validation of relevance to disease. 

Ribozymes 

In addition to antisense polynucleotides, ribozymes can be used to target and inhibit 
30 transcription of ovarian cancer-associated nucleotide sequences. A ribozyme is an RNA 

molecule that catalytically cleaves other RNA molecules. Different kinds of ribozymes have 
been described, including group I ribozymes, hammerhead ribozymes, hairpin ribozymes, 
RNase P, and axhead ribozymes (see, e.g., Castanotto, et al. (1994) Adv. Pharmacol. 25: 289- 
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317 for a general review of the properties of different ribozymes). 

The general features of hairpin ribozymes are described, e.g., in Hampel, ef al. (1990) 
Nucl. Acids Res. 18:299-304; European Patent Publication No. 0 360 257; U.S. Patent No. 
5,254,678. Methods of preparing them are well known to those of skill in the art. See, e.g., 
5 WO 94/26877; Ojwang, et al. (1993) Proc. Nat'l Acad. Sci. USA 90:6340-6344; Yamada, et 
al. (1994) Hum. Gene Ther. 1:39-45; Leavitt, et al. (1995) Proc. Nat'l Acad. .Sci. USA 
92:699-703; Leavitt, et al. (1994) Hum. Gene Ther. 5:1151-120; and Yamada, et al. (1994) 
Virology 205:121-126. 

Polynucleotide modulators of ovarian cancer may be introduced into a cell containing 

10 the target nucleotide sequence by formation of a conjugate with a ligand binding molecule, as 
described in WO 91/04753. Suitable ligand binding molecules include, but are not limited to, 
cell surface receptors, growth factors, other cj^okines, or other ligands that bind to cell 
surface receptors. Preferably, conjugation of the ligand binding molecule does not 
substantially interfere with the ability of the ligand binding molecule to bind to, its 

1 5 corresponding molecule or receptor, or block entry of the sense or antisense oligonucleotide 
or its conjugated version into the cell. Alternatively, a polynucleotide modulator of ovarian 
cancer may be introduced into a cell containing the target nucleic acid sequence, e.g., by 
formation of an polynucleotide-lipid complex, as described in WO 90/10448. It is 
understood that the use of antisense molecules or knock out and knock in models may also be 

20 used in screening assays as discussed above, in addition to methods of treatment. 

Thus, in one embodiment, methods of modulating ovarian cancer in cells or 
organisms are provided. In one embodiment, the methods comprise administering to a cell an 
anti-ovarian cancer antibody that reduces or eliminates the biological activity of an 
endogenous ovarian cancer protein. Altematively, the methods comprise administering to a 

25 cell or organism a recombinant nucleic acid encoding an ovarian cancer protem. TMsmaybe 
accomplished in any number of ways. In a preferred embodiment, e.g., when the ovarian 
cancer sequence is down-regulated in ovarian cancer, such state may be reversed by 
increasing the amount of ovarian cancer gene product in the cell. This can be accomplished, 
e.g., by over-expressing the endogenous ovarian cancer gene or administering a gene 

30 encoding the ovarian cancer sequence, using known gene-therapy techniques, e.g.. In a 
preferred embodiment, the gene therapy techniques include the incorporation of the 
exogenous gene using enhanced homologous recombination (EHR), e.g., as described in 
PCT/US93/03868, hereby incorporated by reference in its entirety. Altematively, e.g., when 
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the ovarian cancer sequence is up-regulated in ovarian cancer, the activity of the endogenous 
ovarian cancer gene is decreased, e.g., by the administration of an ovarian cancer antisense or 
RNAi nucleic acid. 

In one embodiment, the ovarian cancer proteins of the present invention may be used 
5 to generate polyclonal and monoclonal antibodies to ovarian cancer proteins. Similarly, the 
ovarian cancer proteins can be coupled, using standard technology, to affinity 
chromatography columns. These columns may then be used to purify ovarian cancer 
antibodies useful for production, diagnostic, or flierapeutic purposes. In a preferred 
embodunent, the antibodies are generated to epitopes unique to an ovarian cancer protein; 
1 0 that is, the antibodies show little or no cross-reactivity to other proteins. The ovarian cancer 
antibodies may be coupled to standard affinity chromatography columns and used to purify 
ovarian cancer proteins. The antibodies may also be used as blocking polypeptides, as 
outlined above, since they will specifically bind to the ovarian cancer protein. 

15 Methods of identifying variant ovarian cancer-associated sequences 

Without being bound by theory, expression of various ovarian cancer sequences is 
correlated with ovarian cancer. Accordingly, disorders based on mutant or variant ovarian 
cancer genes may be determined. In one embodiment, the invention provides methods for 
identifying cells containing variant ovarian cancer genes, e.g., determining all or part of the 

20 sequence of at least one endogenous ovarian cancer genes in a cell. This may be 

accomplished using any number of sequencing techniques. In a preferred embodiment, the 
invention provides methods of identifying the ovarian cancer genotype of an individual, e.g., 
determining all or part of the sequence of at least one ovarian cancer gene of the individual. 
This is generally done in at least one tissue of the individual, and may include the evaluation 

25 of a number of tissues or different samples of the same tissue. The method may include 
comparing the sequence of the sequenced ovarian cancer gene to a known ovarian cancer 
gene, e.g., a wild-type gene. 

The sequence of all or part of the ovarian cancer gene can then be compared to the 
sequence of a known ovarian cancer gene to determine if any differences exist. This can be 

30 done using any number of known homology programs, such as Bestfit, etc. hi a preferred 

embodiment, the presence of a difference in the sequence between the ovarian cancer gene of 
the patient and the known ovarian cancer gene correlates with a disease state or a propensity 
for a disease state, as outlined herein. 
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In a preferred embodiment, the ovarian cancer genes are used as probes to determine 
the number of copies of the ovarian cancer gene in the genome. 

In another preferred embodiment, the ovarian cancer genes are used as probes to 
determine the chromosomal localization of the ovarian cancer genes. Information such as 
5 chromosomal localization finds use in providing a diagnosis or prognosis in particular when 
chromosomal abnormalities such as translocations, and the like are identified in the ovarian 
cancer gene locus. 

Administration of pharmaceutical and vaccine compositions 

10 In one embodiment, a tiierapeutically effective dose of an ovarian cancer protein or 

modulator thereof, is administered to a patient. By "therapeutically effective dose" herein is 
meant a dose that produces effects for which it is administered. The exact dose will depend 
on the purpose of the treatment, and will be ascertainable by one skilled in the art using 
known techniques. See, e.g., Ansel, et al. (1999) Pharmaceutical Dosage Forms and Drug 

1 5 Delivery Systems Lippincott; Lieberman (1 992) Pharmaceutical Dosage Forms (vols. 1-3) 
Dekker, ISBN 0824770846, 082476918X, 0824712692, 0824716981; Lloyd (1999) The Art. 
Science and Technology of Pharmaceutical Compounding Amer. Pharmaceutical Assn.; and 
Pickar (1999) Dosage Calculations Thomson. Adjustments for ovarian cancer degradation, ■ 
systemic versus localized delivery, and rate of new protease synthesis, as well as the age, 

20 body weight, general health, sex, diet, time of administration, drug interaction, and the 
severity of the condition may be necessary, and will be ascertainable with routine 
experimentation by those skilled in the art. U.S. Patent Application No. 09/687,576, further 
discloses the use of compositions and methods of diagnosis and treatment in ovarian cancer is 
hereby expressly incorporated by reference. 

25 A "patient" for the purposes of the present invention includes both humans and other 

animals, particularly mammals. Thus the methods are appUcable to both human therapy and 
veterinary appUcations. In the preferred embodiment the patient is a mammal, preferably a 
primate, and in the most preferred embodiment the patient is human. 

The administration of the ovarian cancer proteins and modulators thereof of the 

30 present invention can be done in a variety of ways as discussed above, including, but not 

limited to, orally, subcutaneously, intravenously, intra-nasally, transdermally, 

intraperitoneally, intramuscularly, intr^uhnonary, vaginally, rectally, or intraocularly. In 

some instances, e.g., in the treatment of wounds and inflammation, the ovarian cancer 
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proteins and modulators may be directly ^plied as a solution or spray. 

The pharmaceutical compositions of the present invention comprise an ovaiian cancer 
protein in a form suitable for administration to a patient. In the preferred embodiment, the 
pharmaceutical compositions are in a water soluble form, such as being present as 

5 pharmaceutically acceptable salts, which is meant to include both acid and base addition 
salts. "Pharmaceutically acceptable acid addition salt" refers to those salts that retain the 
biological effectiveness of the free bases and that are not biologically or otherwise 
undesirable, formed with inorganic acids such as hydrochloric acid, hydrobromic acid, 
sulfuric acid, nitric acid, phosphoric acid and the like, and organic acids such as acetic acid, 

10 propionic acid, glycolic acid, pyruvic acid, oxalic acid, maleic acid, malonic acid, succinic 
acid, fumaric acid, tartaric acid, citric acid, benzoic acid, cinnamic acid, mandelic acid, 
methanesulfonic acid, ethanesulfonic acid, p-toluenesulfonic acid, salicylic acid, and the like. 
'Tharmaceutically acceptable base addition salts" include those derived from inorganic bas^ 
such as sodium, potassium, lithiiun, ammoniiun, calciiun, magnesiiun, iron, zinc, copper, 

1 5 manganese, aluminum salts and the like. Particularly preferred are the ammonium, 

potassium, sodium, calcium, and magnesium salts. Salts derived from pharmaceutically 
acceptable organic non-toxic bases include salts of primary, secondary, and tertiary amines, 
substituted amines including naturally occurring substituted amines, cyclic amines and basic 
ion exchange resins, such as isopropylamine, trimethylamine, diethylamine, triethylamine, 

20 tripropylamine, and ethanolaraine. 

The pharmaceutical compositions may also include one or more of the following: 
carrier proteins such as serum albumin; buffers; fillers such as microcrystaUine cellulose, 
lactose, com and other starches; binding agents; sweeteners and other flavoring agents; 
coloring agents; and polyethylene glycol. 

25 The pharmaceutical compositions can be administered in a variety of unit dosage 

forms depending upon the method of administration. For example, unit dosage forms 
suitable for oral administration include, but are not limited to, powder, tablets, pills, capsules, 
and lozenges. It is recognized that ovarian cancer protein modulators (e.g., antibodies, 
antisense constructs, ribozymes, small organic molecules, etc.) when administered orally, 

30 should be protected from digestion. This is typically accomphshed either by complexing the 

molecule(s) with a composition to render it resistant to acidic and enzymatic hydrolysis, or by 

packaging the molecule(s) in an appropriately resistant carrier, such as a liposome or a 

protection barrier. Means of protecting agents from digestion are well known in the art. 
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The compositions for administration will commonly comprise an ovarian cancer 
protein modulator dissolved in a phannaceutically acceptable carrier, preferably an aqueous 
carrier. A variety of aqueous carriers can be used, e.g., buffered saline and the like. These 
solutions are sterile and generally free of undesirable matter. These compositions may be 
5 sterilized by conventional, well known sterilization techniques. The compositions may 
contain pharmaceutically acceptable auxiliary substances as required to approximate 
physiological conditions such as pH adjusting and buffering agents, toxicity adjusting agents 
and the. like, e.g., sodium acetate, sodium chloride, potassium chloride, calcium chloride, 
sodium lactate and the like. The concentration of active agent in these formulations can vary 

1 0 widely, and will be selected primarily based on fluid volumes, viscosities, body weight, and 
the like in accordance with the particular mode of administration selected and the patient's 
needs. See, e.g., Remington's Pharmaceutical Science (15th ed., 1980) and Hardman and 
Limbird (eds. 2001) Goodman and Gillman: The Pharmacological Basis of Therapeutics 
(10th ed.) McGraw-Hill. Thus, a typical pharmaceutical composition for intravenous 

15 administration would be about 0.1 to 10 mg per patient per day. Dosages from 0.1 up to 

about 1 00 mg per patient per day may be used, particularly when the drug is administered to a 
secluded site and not into the blood stream, such as into a body cavity or into a lumen of an 
organ. Substantially higher dosages are possible in topical administration. Actual methods 
for preparing parenterally administrable compositions are readily available. 

20 The compositions containing modulators of ovarian cancer proteins can be 

administered for therapeutic or prophylactic treatments. In therapeutic appUcations, 
compositions are administered to a patient suffering from a disease (e.g., a cancer) in an 
amoxmt sufficient to cure or at least partially arrest the disease and/or its complications. An 
amount adequate to accomplish this is defimed as a "therapeutically effective dose." Amounts 

25 effective for this use will depend upon the severity of the disease and the general state of the 
patient's health. Single or multiple administrations of the compositions may be administered 
depending on the dosage and frequency as required and tolerated by the patient. In any event, 
the composition should provide a sufficient quantity of the agents of this invention to 
effectively treat the patient. An amount of modulator that is capable of preventing or slowing 

30 the development of cancer in a mammal is referred to as a "prophylactically effective dose." 
The particular dose required for a prophylactic treatment will depend upon the medical 
condition and history of the mammal, the particular cancer being prevented, as well as other 
factors such as age, weight, gender, administration route, efficiency, etc. Such prophylactic 
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treatments may be used, e.g., in a mammal who has previously had cancer to prevent a 
recurrence of the cancer, or in a mammal who is suspected of having a significant likelihood 
of developing cancer based, e.g., in part, upon gene expression profiles. Vaccine strategies 
may be used, in either a DNA vaccine form, or protein vaccine. 

5 It will be appreciated that the present ovarian cancer protein-modulating compoimds 

can be administered alone or in combination with additional ovarian cancer modulating 
compoimds or with other therapeutic agent, e.g., other anti-cancer agents or treatments. 

In numerous embodiments, one or more nucleic acids, e.g., polynucleotides 
comprising nucleic acid sequences set forth in Tables 1-26, such as RNAi, antisense 

10 polynucleotides or ribozymes, will be introduced into cells, in vitro or in vivo. The present 
invention provides methods, reagents, vectors, and cells usefiil for expression of ovarian 
cancer-associated polypeptides and nucleic acids using in vitro (cell-fi-ee), ex vivo or in vivo 
(cell or organism-based) recombinant expression systems. 

The particular procedure used to introduce the nucleic acids into a host cell for 

1 5 expression of a protein or nucleic acid is application specific. Many procedures for 

introducing foreign nucleotide sequences into host cells may be used. These include the use 
of calcium phosphate transfection, spheroplasts, electroporation, liposomes, microinjection, 
plasma vectors, viral vectors and any of the other well known methods for introducing cloned 
genomic DNA, cDNA, synthetic DNA or other foreign genetic material into a host cell. See, 

20 e.g., Berger and Kimmel (1987) Guide to Molecular Cloning Techniques from Methods in 
Enzymology (vol. 152) Academic Press; Ausubel, et al. (eds. 1999 and supplements) Current 
Protocols Lippincott; and Sambrook, et al. (2001) Molecular Cloning: A Laboratory Manual 
(3ded, Vol. 1-3) CSH Press. 

In a preferred embodiment, ovarian cancer proteins and modulators are administered 

25 as tiierapeutic agents, and can be formulated as outlined above. Similarly, ovarian cancer 
genes (including both the fiiU-length sequence, partial sequences, or regulatory sequences of 
the ovarian cancer coding regions) can be administered in a gene therapy apphcation. These 
ovarian cancer genes can include antisense applications, either as gene therapy (e.g., for 
incorporation into the genome) or as antisense compositions, as will be appreciated by those 

30 in the art. 

Ovarian cancer polypeptides and polynucleotides can also be administered as vaccine 
compositions to stimulate HTL, CTL, and antibody responses.. Such vaccine compositions 
can include, e.g., lipidated peptides (see, e.g., Vitiello, et al, (1995) J. Clin. Invest. 95:341- 
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349), peptide compositions encapsulated in poly(D,L-lactide-co-glycolide, "PLG") 
microspheres (see, e.g., Eldridge, et al. (1991) Molec. Immunol. 28:287-294; Alonso, et al. 
(1994) Vaccine 12:299-306; Jones, et al. (1995) Vaccine 13:675-681), peptide compositions 
contained in immune stimulating complexes (ISCOMS; see, e.g., Takahashi, et al. (1990) 
5 Nature 344:873-875; Hu, et al. (1998) Clin. Exp. Immunol. 1 13:235-243), multiple antigen . 
peptide systems (MAPs; see, e.g., Tam (1988) Proc. Nat'l Acad. Sci. USA 85:5409-5413; 
Tam (1996) J. Immunol. Methods 196:17-32), peptides formulated as multivalent peptides; 
peptides for use in ballistic delivery systems, typically crystallized peptides, viral delivery 
vectors (Perkus, et al., p. 379, in Kaufinann (ed. 1996) Concepts in Vaccine Development de 

10 Gruyter; Chakrabarti, et al. (\§86) Nature 320:535-537; Hu, et al. (1986) Nature 320:537- 
540; Kieny, et al. (1986) AIDS Bio/Technoloev 4:790-795: Top, et al. (1971) J. Infect. Dis. 
124:148-154; Chanda, et al. (1990) Virologv 175:535-547), particles of viral or synthetic 
origin (see, e.g., Kofler, et al. (1996) J. Immunol. Methods 192:25-35; Eldridge, et al. (1993) 
Sem. Hematol. 30:16-24; Falo, et al. (1995) Nature Med. 7:649-653), adjuvants (Warren, et 

15 al. (1986) Ann. Rev. Immunol. 4:369-388; Gupta, et al. (1993) Vaccine 11:293-306), 

liposomes (Reddy, et al.(1992) J. hnmunol. 148:1585-1589; Rock (1996) Immunol. Todav 
17:131-137), or, naked or particle absorbed cDNA (Uhner, et al. (1993) Science 259:1745- 
1749; Robinson, et al. (1993) Vaccine 1 1:957-960; Shiver, et al., p. 423, in Kaufinann (ed. 
1996) Concepts in Vaccine Development de Gruyter; Cease and Berzofsky (1994) Ann. Rev. 

20 hnmunol. 12:923-989; and Eldridge, et al. (1993) Sem. Hematol. 30: 16-24). Toxin-targeted 
deUvery technologies, also known as receptor mediated targeting, such as those of Avant 
Immunotherapeutics, Inc. (Needham, Massachusetts) may also be used. 

Vaccine compositions often include adjuvants. Many adjuvants contain a substance 
designed to protect the antigen &om rapid catabolism, such as aluminum hydroxide or 

25 mineral oil, and a stimulator of immune responses, such as lipid A, Bortadella pertussis, or 
Mycobacterium tuberculosis derived proteins. Certain adjuvants are commercially available 
as, e.g., Freund's Incomplete Adjuvant and Complete Adjuvant (Difco Laboratories, Detroit, 
MI); Merck Adjuvant 65 (Merck and Company, Inc., Rahway, NJ); AS-2 (SmithKline 
Beecham, Philadelphia, PA); aluminum salts such as aluminum hydroxide gel (alum) or 

30 aluminum phosphate; salts of calcium, iron or zinc; an insoluble suspension of acylated 
tyrosine; acylated sugars; cationically or anionically derivatized polysaccharides; 
polyphosphazenes; biodegradable microspheres; monophosphoryl lipid A and quil A. 
Cytokines, such as GM-CSF, interleukin-2, -7, -12, and other like growth factors, may also be 
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used as adjuvants. 

Vaccines can be administered as nucleic acid compositions wherein DNA or RNA 
encoding one or more of the polypeptides, or a fragment thereof, is administered to a patient. 
See, e.g., Wolff et. al. (1990) Science 247:1465-1468: U.S. Patent Nos. 5,580,859; 5,589,466; 
5 5,804,566; 5,739,118; 5,736,524; 5,679,647; and WO 98/04720. Examples of DNA-based 
delivery technologies include "naked DNA", facilitated (bupivicaiae, polymers, peptide- 
mediated) deUvery, cationic lipid complexes, and particle-mediated ("gene gun") or pressure- 
mediated deUvery (see, e.g., U.S. Patent No. 5,922,687). 

For ther^eutic or prophylactic immunization purposes, the peptides of the invention 

10 can be expressed by viral or bacterial vectors. Examples of expression vectors include 
attenuated viral hosts, such as vaccinia or fowlpox. This approach involves the use of 
vaccinia virus, e.g., as a vector to express nucleotide sequences that encode ovarian cancer 
polypeptides or polypeptide fragments. Upon introduction into a host, the recombinant 
vaccinia virus expresses the immunogenic peptide, and thereby elicits an immune response. 

1 5 Vaccinia vectors and methods useflil in immunization protocols are described in, e.g., U.S. 
Patent No. 4,722,848. Another vector is BCG (Bacille Calmette Guerin). BCG vectors are 
described in Stover, et al. ( 1 99 1 ) Nature 3 5 1 :45 6-460. A wide variety of other vectors useful 
for therapeutic administration or immunization e.g., adeno and adeno-associated virus 
vectors, retroviral vectors, Salmonella typhi vectors, detoxified anthrax toxin vectors, and the 

20 like, will be apparent. See, e.g., Shata, et al. (2000) Mol. Med. Today 6:66-71; Shedlock, et 
al. (2000) J. Leukoc. Biol. 68:793-806; and ffipp, et al. (2000) In Vivo 14:571-85. 

Methods for the use of genes as DNA vaccines are well known, and include placing 
an ovarian cancer gene or portion of an ovarian cancer gene under the control of a regulatable 
promoter or a tissue-specific promoter for expression in an ovarian cancer patient. The 

25 ovarian cancer gene used for DNA vaccines can encode full-length ovarian cancer proteins, 
but more preferably encodes portions of the ovarian cancer proteins including peptides 
derived from the ovarian cancer protein. In one embodiment, a patient is immunized with a 
DNA vaccine comprising a plurality of nucleotide sequences derived from an ovarian cancer 
gene. For example, ovarian cancer-associated genes or sequence encoding subfiagments of an 

30 ovarian cancer protein are introduced into expression vectors and tested for tiieir 

immunogenicity in the context of Class I MHC and an ability to generate cytotoxic T cell 
responses. This procedure provides for production of cytotoxic T cell responses against cells 
which present antigen, including intracellular epitopes. 
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In a prefered embodiment, the DNA vaccines include a gene encoding an adjuvant 
molecule with the DNA vaccine. Such adjuvant molecules include cytokines that increase 
the immunogenic response to the ovarian cancer polypeptide encoded by the DNA vaccine. 
Additional or alternative adjuvants are available. 
5 In another preferred embodiment ovarian cancer genes find use in generating animal 

models of ovarian cancer. When the ovarian cancer gene identified is repressed or 
diminished in cancer tissue, gene therapy technology, e.g., wherein antisense RNA directed 
to the ovarian cancer gene will also diminish or repress expression of the gene. Animal 
models of ovarian cancer find use in screening for modulators of an ovarian cancer- 

10 associated sequence or modulators of ovarian cancer. Similarly, transgenic animal 
technology including gene knockout technology, e.g., as a result of homologous 
recombination with an appropriate gene targeting vector, will result in the absence or 
increased expression of the ovarian cancer protein. When desired, tissue-specific expression 
or knockout of the ovarian cancer protein may be necessary. 

15 It is also possible that the ovarian cancer protein is overexpressed in ovarian cancer. 

As such, transgenic animals can be generated that overexpress the ovarian cancer protein. 
Depending on the desired expression level, promoters of various strengths can be employed 
to express the transgene. Also, the number of copies of the integrated transgene can be 
determined and compared for a determination of the expression level of the transgene. 

20 Animals generated by such methods find use as animal models of ovarian cancer and are 
additionally useful in screening for modulators to treat ovarian cancer. 

Kits for Use in Diagnostic and/or Prognostic Applications 

For use in diagnostic, research, and therapeutic applications suggested above, kits are 

25 also provided by the invention. In the diagnostic and research applications such kits may 

include any or all of the following: assay reagents, buffers, ovarian cancer-specific nucleic 

acids or antibodies, hybridization probes and/or primers, siRNA or antisense polynucleotides, 

ribozymes, dominant negative ovarian cancer polypeptides or polynucleotides, small 

molecules inhibitors of ovarian cancer-associated sequences etc. A therapeutic product may 

30 include sterile saline or another pharmaceutically acceptable emulsion and suspension base. 

In addition, the kits may include instructional materials containing directions (e.g., 

protocols) for the practice of the methods of this invention. While the instmctional materials 

typically comprise written or printed materials they are not Umited to such. Any medium 
85 
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capable of storing such instructions and coramimicating them to an end user is contemplated 
by this invention. Such media include, but are not limited to electronic storage media (e.g., 
magnetic discs, tapes, cartridges, chips), optical media (e.g., CD ROM), and the like. Such 
media may include addresses to internet sites that provide such instructional materials. 
5 The present invention also provides for kits for screening for modulators of ovarian 

cancer-associated sequences. Such kits can be prepared from readily available materials and 
reagents. For example, such kits can comprise one or more of the following materials: an 
ovarian cancer-associated polypeptide or polynucleotide, reaction tubes, and instructions for 
testing ovarian cancer-associated activity. Optionally, the kit contains biologically active 
10 ovarian cancer protein. A wide variety of kits and components can be prepared according to 
tiie present invention, depending upon the intended user of the kit and the particular needs of 
the user. Diagnosis would typically involve evaluation of a plurality of genes or products. 
The genes will be selected based on correlations witii important parameters in disease which 
may be identified in historical or outcome data. 

15 



EXAMPLES 



Example 1: Gene Chip Analysis 
20 Molecular profiles of various normal and cancerous tissues were determined and 

analyzed using gene chips, RNA was isolated and gene chip analysis was performed as 
described (Glynne, et al. (2000) Nature 403:672-676; Zhao, et al, (2000) Genes Dev, 14:981- 
993). 



25 

TABLE 1A lists about 1 1 19 genes ufHegulated In ovarian cancer ccmpaied to normal adult Ussues. Tliese were selected from S90(N probesets on tlie Aflymetrix£os Hu03 
GeneChlp aray such that Ibe ratio of "average" ovarian cancer to "average" normal adult tissues was greater than or equal to 5.0. The "average" ovarian cancer level was set to 
the 80th percentile value amongst various ovarian cancers. The 'average' normal adult tissue level was set to the 8Sth percentile amongst various non-malignant tissues. 

30 TABLE 1A: ABOUT 1119 UP-REGULATED OVARIAN CANCER GENES 
PkBf. Primekey 
Ex.Accn: Exemplar Accession 
UGID: UnlGenelD 

Title: UniGene title 
35 ratio; ratio tumor vs normal tissues 



Pl<ey 
423634 


Ex.Accn 


UGID 


Title 




AW959908 


Hs.1690 


heparin-binding growth factor binding protein 


65.7 


423017 


AW178751 


Hs.227948 


"serine (or cysteine) proteinase inhibitor, clade B(o«aibumi 


63.6 


432938 


T27013 


Hs.3132 


steroidogenic acute regulator/ protein 


58.3 


445810 


AW265700 


Hs.1 55660 


ESTs 


35.9 


431933 


AA938471 


Hs.1 15242 


develcpraentaliy regulated GTP-binding proiein 1 


3Z0 


407112 


AA070801 


Hs.51615 


"ESTs, Weakly similar to ALU7_HUMAN ALU SUBFAM 


31.3 


425650 


NM_001944 


Hs.1925 


desmogleh 3 (pemphigus vulgaris antigen) 


30.0 


402075 
400301 


X0363S 


Hs.1667 


predicted exon 
estrogen receptor 1 


7S.i 



86 
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414540 BE379060 

413994 AA296520 
401576 

457024 M397546 

440634 AI253123 

459006 AW298B31 
4009B4 
402421 

437329 M811977 

414605 BE390440 

411004 AWai3242 



469390 BE385725 

421823 N40850 

417366 BE185289 

422525 AA758797 

458121 S42416 

43C520 NM_C16190 

450192 AA263143 



30 
35 
40 
45 



404097 
4S937S 
443198 
441557 
433871 
429163 
443406 
400613 
448372 
410929 
445887 
422036 
404767 
420831 
405196 
452947 
429538 



"g^bSl236655F1 NIH_MGC_44 Homo sapiens cDNAdon 
selectin E (endothelial adhesion molecule 1) 

ESTs 

"ESTs, Highly similarto NEST_HUIMAN NESTI [H.sapien 
liypolhetioal protein FLJ20353 
predbted exon 



Hs.291761 ESTs 



Hs.263320 ESTs 
Hs.106234 

Hs.121121 

Hs.28625 



■gb:601283S01F1 NlhLM6C_44 Homo sapiens cDNAdon 
■gb:IVIR3^T0191-020200-207-g10 ST0191 Homo sapiens 
predicted exon 



Hs.192807 
Hs.74647 
Hs,242057 
Hs.24596 



45.17882 ESTs 
Hs.128577 ESTs 
Hs.1 17929 ESTs 

Hs,123571 

! 

Hs,167C47 



ESTs 

■gb:6C1276347F1 NIH MGC 20 Homo sapiens cDNAclon 
ESTs 

small praline-rich protein IB (corailin) 
ESTs 

Human T-cell receptor active alpha-chain mRNAfram JM c 
ctiromosorre 1 open readhg frame 10 
RAD51 -interacting protein 



AI039813 

AW4S2647 

W02410 

AA8B4766 

AI0S6238 

AW445166 
H47233 
A1263105 
AA3C2647 



Hs.143316 ESTs 



Hs.30643 

Hs.145597 
Hs.271891 



gbaraZOalOsI SoaiesJ<FLT_GBC_S1 HomosapienscD 



AA280824 Ha.190035 ESTs 



:(_CGAP_Gas4 Homo sapiens cDNAcIo 



55 

60 
65 
70 



442653 
443534 
458012 
441018 



AI076123 
AI424899 
AI8Q9587 
BE391563 
R45154 
AW898161 
BE551408 
T92641 



Hs.1 88211 
Hs.148782 
Hs.165433 
Hs.106604 
Hs.53112 
Hs.127196 
Hs.127648 

AJ403108 Hs.132127 



404664 



Hs.139322 small proline-ricli protein 3 

■gb:RC1-CT0279-070100-021-a06 CT0279 Homo sapiens c 
4s. 1 70226 Homo sapiens clone 23S79 mRNA sequence 

gb;oy92e04j(1 Soaies_letaUiver_spleea.1NFLS_S1 Homo 



ESTs 
ESTs 

"ESTs, Highly similar to T17342 hypolhetcd pioyn DKFZ 
ESTs 

■ESTs, WeaHy sWar to ALUa_HUMAN ALU SUBFAM 
ESTs 

hypothetical protein PR02176 



423735 AA330269 

444266 AI424984 

443341 AW631480 

457336 AW969657 

440500 AA972165 

446292 AF081497 

438086 AA336519 

434715 BE005346 

4C9387 AW384900 

409272 AB014569 

464913 AW841462 

439846 T63969 

409695 AA296961 



Hs.125455 ESTs 



"gb:EST33963 Embryo, 12weekll Homo sapiens CDNA5' 



Hs.301167 
Hs.116410 
Hs.123526 



Hs.4290 
Hs.269414 



.;iAA0669gene product 

"gb:RC6-CN001 4-080300-012-B09 CN0014 Homo sapiens 
"Homo sapiens cDNA: FU23537 Ss. clone LNG07690" 
'gb:EST1 12514 Adrenal gland tumor Homo sapiens cDNA 



■gb:PT2.1_6_G03.r tumor2 Homo sapiens cDNA3', mRNA 
ESTs 

87 
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407463 
446767 
433040 



423637 
448643 
456714 
458356 
431822 
454822 
453358 
435542 
421286 
452799 
444355 



AI638743 
AW857271 
AJ272a34 
A1380107 



AI199576 
AA394104 
AL137279 
AW897741 
AW397265 
AI024855 
AA516049 
AWB33793 



"gb;QV1-DT0072-31010(M)56*(J7 DT0072 Homo sapiens 

Hs.224672 ESTs 

•gb:CM0-CT0307-21 01 00-158^09 CT0307 Homosapiens 
gbiHcmo sapiens mRNA Ibr putative capaoltatlve calcium c 

Hs.158954 ESTs 

Hs.300511 ESTs 

Hs,1876g8 'ESTs, hiiglily similar to cytomegalovlnis parOal fusion rece 
Hs.21«33 ESTs 

predicted exon 
Hs.1 53070 ESTs 
Hs.97489 ESTs 

Hs.130187 Homo sapiens mRNA; cDNA DKFZp43401214 (from cion 
Hs.21380 Home sapiens mRNA; cONA DKFZp5B6P1124 (from don 
■gb:CMO-NN0057-150400-335^4 NN0057 Homo sapiens 



AA687376 Hs.269533 ESTs 



428719 
418282 
437308 
400584 



435243 
431725 
425108 
422330 
432949 



440571 
411178 
445934 



AA358193 
AA215535 
AA749417 



AA570749 
AA191719 
AA843387 
AA650114 
AA904461 



Hs.292353 ESTs 

predicted exon 

Hs.164059 "Homo sapiens cM^ FU13338 Ss.done OVARC100188 
Hs.171066 BTs 

predicted exon 

Hs.114762 
Hs.261373 
Hs.2839 
Hs.96967 
Hs.1 15253 



464666 
443613 
400385 
411725 
455174 
412402 
434206 
450496 



453846 
404849 
442824 



ESTs 
epiregulln 

DEAD/H (Asp-Glu-AIa-Asp/Hls) box polypeptide 8 (RNA 
ESTs 

'gb:ns92h09.s1 NCI_CGAPJ« Homo sapiens cDNAdon 
ESTs 

■gb:RC2-ST0301-12020(W)11-f12 ST0301 Homo sapiens c 

iiypotiietical protein 
. Human DNA sequence from clone RP5-1028D15 on ohrom 

predicted exon 

■gb;EST39787 Epididymus Homo sapiens cDNA 5 end, mR 



458154 AW816379 



427276 
454315 
450786 
402578 



AW772282 
AA890521 
AW807073 



Hs.283104 

Hs.147801 

Hs.283032 
Hs.257131 
Hs.269128 



"gb:QV4-ST0234-1811994)35-g01 ST0234 Homo sapiens c 
ESTs 

igb:EST94257 Activated T-oells 1 Homo sapiens oDNA S e 
'^a/1-CT0364-260100-052-g05 CT0364 Homo sapiens 
dendritic cell protein 

i nwiear pore complex Interacting protein 

gteDK^p5MA1169_r1 564 (synonym: hfbr2) Homo sapie 
•gb:lin71b06.x1 NCI_CGAP_Kld11 Homo sapiens cDNAc 



•gb:li<R4-ST0062-03119M18-d06 ST0062 Ho 
gb:oz39b09.s1 Soares_NliHIMPu_S1 Homo sapiens cDNAc 
puldive capacHative calcium channel 
■gb:CIM0-CT0341-181299-130^«6 CT0341 Homo sapiens 



BE1 78065 
AA430058 
AA649530 
BE062109 Hs.241551 



"gb:601 29787 IF 1 NIH_MGC_19 Homo sapiens cI»lA don 

acetyl LDL receptor; SREC 

gb;DKFZp761F0816_r1 761 (synonym: hamy21 Homo sapi 

predicted exon 



447724 AW298375 

457028 AVM49838 

429900 AA460421 

4S2240 AI591147 

458067 AA393603 



Hs.24477 
Hs.97562 
HS.30B75 
Hs.61232 
Hs^752 



■gb:EST52440 Greater omenta tumor Homo sapiens d3N 
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446745 AW118189 
453060 AW294092 
443482 AWiaa093 
436843 AA824588 
416320 H478e7 
435772 AA70CC19 
451542 AAC18365 
408522 A1541214 
414712 
411940 
408733 
452030 
458175 
400612 
440159 AI537599 
429443 AB028967 
416319 AI815601 



Hs.1 56400 
Hs.21694 
Hs.250385 

Hs.34024 
Hs.132992 
Hs.32713 
Hs.46320 
Hs.77039 

Hs.254290 

Hs!l50434 

Hs.1 261 27 
Hs.202687 
Hs.79197 



■gb:oc83d02.s1 NCI_CGAP_GCB1 Homo sapiens cDNA c 
ESTs 

•ATP-binding cassette, subfamily G (WHtTE), member 5 ( 

■Small prdlne-rich protein SPRK [human, odonbgenic kera 
libosomal protein S3A 

"gb:CM4-PTO031-180200-507-e05 PT0031 Homo sapiens c 
ESTs 

Homo sapiens mRNA: cONA DKFZpS64N2464 (ftom cion 

ESTs . 
prediofed exon 



432408 
451702 
418179 



419276 
407237 
403065 
414195 



Hs.288677 
Hs.169054 ESTs 



440834 
437096 
400135 
447849 
400593 
427469 
402794 
452743 



55 

60 



AI45728S 
BE018611 
AA437010 
AA907027 
AA744406 



Hs.143979 
Hs251946 
HS2S65B4 
HS.12B606 



'myxovlms (Influenza) resistance 1, lie 
ESTs 
Wilms tumor 1 

gb:yt03f1 1 .r1 Scares retina N2b5HR Homo sapiens cDNA 

predioled exon 

"Homo sapiens cDNA: FU23161 fis, done LNG09730' 
■ESTs, Weakly similar to ALU4JIUMAN ALU SUBFAM 

"g^?»i?44M1F2 NIH_MGC_19 Homo sapiens cDNA don 
'ESTs, WeaMy similar to KIAA1276 protein [Ksapiensr 
■ sc»IA:FU23107fls, clone LNG07738" 



•gbinySlliOZsl NCLCGAPJY18 Homo sapiens cDNA do 



AI638147 Hs.164277 ESTs 



Hs.269347 ESTs 

predk^ted exon 

Hs.61455 ESTs 

AI611654 Hs,224908 ESTs 

AF242524 Hs,26323 hypolhetkalnudear factor SBBI22 

AA442153 Hs,104744 "ESTs, Weakly similar to AF208855 1 BM-01 3 [Hsapiensl 

AWSieaOO •gb;RC1-ST0278-160200-014^i10ST0278 Homosapiensc 

AK0ai745 Hs.184628 liypollietkal piatein FLJ10883 

AL390179 Hs.137011 Homo sapiens mRNA; cDNADKFZp547P1 34 (torn clone 

BE407727 ■gb:601299771F1 NiH-MGC_21 Homo sapiens cDNA cion 

AW290999 Hs. 146534 chromosome 21 open reading ftame 23 

AI904646 "gb:QV-BT065-020399-103 BT065 Homo sapiens oDNA, m 

AV6585ai Hs.282633 ESTs 

AA383623 Hs.293616 ESTs 

AI217900 Hs.144464 ESTs 



BE141737 H6.264105 
D61523 Hs.283435 
NIUI 006825 Hs.74368 



e1.(aW 



3in(63kD),en 



nicretkxiium/Go 



416203 H27794 

426882 AA393108 

454874 AW836407 

406702 Z20656 



405636 
437242 
425627 
452226 



457940 A13601S9 



440576 
419088 
454707 
446252 



•gbcPli42-HT0225fl31299-003-f11 Hr0225 Homo sapiens 
Hs.136907 ESTs 

predicted exon 
Hs.269065 ESTs 
Hs.97365 ESTs 

•gb:PM3iT0031-301299^-b09 LT0031 Honx) sapiens 
Hs.278432 ■myosin, heavy polypepttde 6, cardiac muscle, alpha (caidio 

Hs.1 03538 ESTs 
Hs.146884 ESTs 

gb:yu28a10£l Scares fetal liver spleen INFLSHomosaple 
Hs.212184 ESTs 

predicted exon 
Hs.187942 ESTs 
Hs.297007 ESTs 
Hs.296002 ESTs 
Hs.81796 ESTs 
Hs.126647 ESTs 
Hs.178045 ESTs 

■gb:od71 a09.s 1 NCI_CGAP_0v2 Homo sapiens cDNA do 
Hs.30445 Homo sapiens mRNA M length insert cDNA ckrne EURO 
Hs.134002 ESTs. 
Hs.126008 ESTs 

Hs.77496 

■gb:MR1-ST0206-170400^)2+fl05 ST0206 Homo sapiens 
Hs.150009 ESTs 

■gb:np85d02.s1 NO.CGAP.Thyl Homo sapiens cDNAd 

89 
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403093 
454633 
407291 
455203 



■gb:lL3-ST01 43-290999^)1 9-D05 STO143 Homo sapiens c 
gbze4Sb01.r1 Soares reb-naN2b4HR Homo sapiens cONA 
■gb:PM4-SN002(M)10400-008-b09 SN002C Homo sapiens 

predicted exon 

hypoWcal protein FU20813 

■gb;QV4-STa21M91 199fl23-f10 ST0212 Homo sapiens c 



422949 
402112 
457886 

452332 ' 
434950 
409601 
419968 



40 
45 
50 
55 
60 
65 



AK001581 
M428240 
AI656040 
AW295151 
AA213403 
AI828600 
AI018777 



415715 
405277 
412167 
442771 
404898 
401230 
400623 
413308 



Hs.2186 
Hs.293346 
Hs.130794 
Hs.101657 

Hs.80828 
Hs.93913 
Hs.80961 
Hs.126083 
Hs.196532 
Hs.1 63612 
Hs.257542 
Hs.21124 
Hs.131241 
Hs.192605 
Hs.22607 
Hs.1 58451 
Hs.254569 
Hs.144339 



AI321836 
AI683487 
AA885871 
AI954795 



417728 
454S97 
427093 
408000 
440556 
400163 
420120 
417549 
406163 
437918 
449419 



AA398118 

L11690 

AW206958 



AW979294 
AJ243662 
AA018297 
N91553 



40B751 
401862 

417344 AW997313 

454455 AW752710 

455592 BE008002 

417650 T0587C 

456309 AA225423 

432030 AI908400 

421492 BE176990 
402576 

426874 N67326 
403334 

408562 AI436323 

439443 AF086261 



432527 
403273 
452077 



-gb:EST386997 MAGE resequences, UAGN Homo sapien 
kerabn 1 (epidermolytlc hyperkeratosis) 
-interleukin6(lnlerteron.beta2)' 
'polymerase (DNA directed), gammsT 



'ESTs, Weakly similar to ALU8JtUMAN ALU SUBFAM 



Hs.1 0359 ESTs 
Hs.299112 
Hs.135727 
Hs.1 561 35 



no sapiens cDNA FU11441 lis, ctone HEMBA100132 



AF202961 Hs.284200 



Hs.24790 

Hs.97579 
Hs.620 
Hs.1 25968 

Hs.95243 



Hs.121629 
Hs.119172 
Hs.202639 



'gb:EST3881 14 MAGE resequences, MAGN Homo sapien 
KIAA1573piX)teln 

'gb:MR4^124-26109941S^01 ST0124 Homo sapiens 
ESTs 

bullous pempliigoid anlgen 1 (230/240kD) 



insulin-like growth Ixlor 1 (somatomedia Q 
Hs.222056 "Homo sapiens cDNA FU11572lis, ckire HEMBA100337 
■gb:MR4-ST01 1 7-C7O10O-O27-a04 ST0117 Homo sapiens c 

Hs.42746 ESTS 
Hs.35634 ESTs 
Hs.1 10196 NICE-1 protein 
Hs.35493 ESTs 



■gb-.RC2-BN0048-250400-018-f12 BN0048 Homo sapiens 
■gb:IL3-CT0219-281099-024-A03 CT021 9 Homo sapiens c 
■gb:QV0-BNO147-2g04OO-214-h04 BN0147 Homo sapiens 



Hs.100640 ESTs 



•gb:nc24a12.r1 NCLCGAP_Pr1 Homo sapiens cDNActon 



Hs.31141 
Hs.1 27892 
Hs.15035 

Hs.1 02754 



AI288830 Hs.1 49924 ESTs 



•gb:RC2-HT0187-041099^11-d12HT0187 Homo sapiens 



wo 02/102235 



PCT/US02/19297 



434066 
429643 
432340 
446142 
417412 
416913 
451318 
406647 
423843 
454145 
401200 
404166 
412761 
412333 
455092 
419281 
446171 



449737 
401830 
421991 
416996 



417682 
424983 
434353 



413122 
419929 
400925 



AF1 15649 Hs.283944 
AA455889 Hs.1 87648 ESTs 



gb:iij21d02,s1 NC1_CGAP_M1 Hoinosai 



AA029888 HS.95C71 ESTs 



ESTs 



predicted exon 
predicted exon 

AW995092 ■gb:QVO-BN0041-03030D-145-a1 0 BNCC41 Homo sapiens 

AW937486 •gb:QV3-DT004+-221299*lfrt09DT0044Homosapiens 
BE152428 'gb:CM0-HT0323-151299-126^04 HT0323 Homo sapiens 

H96452 Hs.42189 ESTs 

AI374927 gb:la66c04.x1 SoaresL.totaUetus_NI)2HF8_9w Homo saple 

AL359551 Hs.16493, hypothetical protein □KFZp762N231 6 

predicted exon 
AV663838 Hs^95131 ESTs 
Hs.233343 ESTs 
Hs.210689 ESTs 
Hs.246316 ESTs 

Hs.5038 neuroDathv taraet esterase 
Hs.110488 



AJ004832 
NM.014918 
W91892 
AI540644 
D55644 



AI742434 
AA630863 
ALC36710 



Hs.59609 ESTs 



Hs.169911 
Hs.131375 
Hs.209527 



■ESTs, Moderately similar to ALU7_HUMAN ALU SUBF 
gb:Human spieen PABL(pseiidoautosomal boundary-iiltese 

predicted exon 

gb:zd47a08.r1 SoaresJetal_heart_NbHH19W Homo sapien 
ESTs 

■ESTs, Weakly similar to ALUB_HUMAN !!!! ALU CIAS 
ESTs 

■gb;QVO-HT0368-04O100^)82-f06 HT0368 Homo sapiens 



AW516196 HS.25B238 'ESTs, 
R42778 Hs.22217 ESTs 
U90268 Hs.93810 cerebrs 



loALU1_HUl«ANALUSUBF 



AAB68729 Hs.144694 ESTs 
AW161697 Hs^94150 ESTs 
AA299780 Hs.121036 ESTs 



449867 
422174 
413382 
456502 



NM.014400 Hs.11950 GPl-anchored metastasis-associated protein homolcg 

AI206173 Hs.211375 ESTs 

AI672379 Hs.73919 "clalhrin, light polypeptide (Lcb)" 

AL049325 Hs.112493 Homo sapiens mRNA; cDNA DKFZp564D036 (from clone 

BE090689 "gb:RC1-BT0720-280300-011-ffl8BT0720Honnosapiensc 

AI798611 Hs.157277 ESTs 

predicted exon 



AI247716 



70 

75 



437724 
452755 
401781 



421469 
448251 
429125 



414266 
407839 
456101 



Hs.232168 ESTs 

AW970636 Hs.106413 ESTs 

AW444828 Hs.1 84323 ESTs 

AW138937 Hs.213436 ESTs 



AA446854 
BE077129 
AW57B713 
AA782163 
N84636 



AA370141 
BE267834 
AA045144 
AA159478 
BE147225 
CI 8200 



Hs.271004 ESTs 



Hs.293502 ESTs 



HS.15554S 
Hs.147293 
Hs.251453 

Hs.161566 ESTs 



■Homo sapiens cDNA FU1 1979 lis. done HEMBB100128 



flb:zo74d06.s1 Stratagene pancreas (937208) Hctno sapiens 
'gb:PM2-HT022543129g-003-f1 1 HT0225 Homo sapiens 
^:C18200 Human placenta cDNA (TFifiwara) Homo sapie 
AW369351 Hs.287955 "Homo sapiens cID(^AFU13090fis, clone l^2RP3002142 
T26661 •gb:AB65C7R Infant brain, LLNL array of Dr. M. Soares 1 

AW206737 Hs.253582 ESTs 



413025 AAB0526S Hs.291646 ESTs 



wo U2/102235 



PCT/US02/19297 



10 
15 



431673 AW971302 

421029 AW057782 

408391 AW859276 

422529 AW015128 

454389 AW752571 

427821 AA470158 

434657 M641876 

445628 AI344166 

424872 AA347923 

439232 N48590 

441417 AI733297 



413306 AW303544 

446726 AW300144 

427504 AA776743 
405621 

414127 AI431863 

409866 AW502152 

446232 AI281848 
403568 

451458 AI797558 

439157 AA912737 
401793 

429839 AI190291 

445672 AI907438 

449444 AW813436 

447499 AW2625B0 

421773 W69233 

439706 AWB72527 

432189 AA527941 



Hs.182979 
Hs.293233 



predicted exon 

•gb:IL5-UU0057.24030(W)50^01 UM0067 Homo sapiens 
ribosonial protein LI 2 

"gb:af69g08/1 Soares_NliHMPu_S1 Homo sapiens cDNA 



•gb:MR1-CT()352-240200-105^)02 CT0352 Homo sapiens 



Hs.256703 ESTs 



Hs.98202 
Hs.191840 
H6.166743 

Hs.46693 
Hs.144474 
Hs.62905 
Hs.2661 
Hs.1 18654 



'gb:IIJ-CT0213-17010a055-F02 CT0213 Homosapiens c 



"gl):EST54302 1^1 lieari II Homo sapiens cDNA S' end. m 



Hs.1 91589 
Hs. 135270 



■Homosapiens cDNA FLJ11629 lis, done HEMBA100424 



423193 
415548 
443236 
436053 
437191 
451829 
443151 
452055 
445265 
401032 
448134 
414803 
418540 
410449 



gb:UI-HF-BR0p-air-f-1 1-04Jl.r1 NIH_MGC_52 Homo sap 



Hs.282862 ESTs 

Hs.23590 ■solute earner family 1 6 (monocarboxylic acid transporters) 
Hs.147674 K1AA1621 protein 
Hs.1 12457 ESTs 
Hs.59761 ESTs 

"gb:nh30c04.s1 NCI_CGAP_Pi3 Homosapiens cCWAcion 

predicted exon 
H5.211608 nucleoporin 1S3kD 

HS.2S4837 -Homo sapiens cDNA FU13502 Us, done PLACE1004836 
gb:yr47f06.r1 Scares fetal liverspleen 1NFLS Homosapien 
Hs.1 34169 ESTs 
Hs.15443 



H62953 
AI079496 
AI057224 

NM_006846 Hs.5476 'sernei 
AW964031 ■■ 
AI827193 
AI377431 
AI218295 



Hs.247377 
Hs.1 32714 ESTs 
Hs.293772 ESTs 



r,Kazaltype,5" 



AI821597 
AW748954 
AA688048 
AI904723 



427025 
444683 
447700 
412740 
416642 



AA425234 

BE175582 

AA393041 

1^0233 

AA397589 

AI375101 

AI4201B3 



AIC38316 
BE541042 
H55014 



Hs.1 09697 ESTs 

gb:ye42e02.r1 Scares fetal Ker spleen 1NFLS Homosapien 
Hs.90877 "ESTs, Weakly simll» to AIJU1_HUMAN ALU SUBFAM 
Hs.18192 Ser/Arg^elated nuciearmatrix protein (plenty of prolines 1 

Hs.294080 ESTs 

"gb:CM-BT066-120299-092 BT066 Homo sapiens cDNA, 
gb:yy1 Id 2.r1 Scares melanocyle 2NbHM Homo sapiens cD 

Hs.274449 ■Homo sapiens cDNA FU20137 fis, clone COL07137" 
predioled exon 

Hs.79886 rlxise Spliosphate isomerase A (libose 5-pliaspliate epimer 
'gb:RC5-HT0S80-100S00422-C01 HTD580 Homo sapiens 

Hs.216493 ESTs 

gb:yg18ti1 1.r1 Soares Infant brain 1NIB Homo sapiens cDN 

Hs!l5B721 ■ESTs.WeaHysimilartoAUJLHUMANAUJSUBFAM 
Hs.171077 'ESTs, Weakly similar to similar to serine/threonine kinase 

'gb:RC1-BN0035-13040l»)13-a05 BN0035 Homosapiens 
Hs.226313 ■ESTs,WeaklyslrailartoALU1_HUMANALUSUBFAM 
Hs.237305 ESTs 

gb:NKTBCae04f07r1 Nonnai Human Trabecular Bone Cell 

'gb:Homa sapiens MIST mRtU, partial cds.' 
Hs.274419 hypothetical protein FU10244 
Hs.210589 ESTs 

gb:ox48c08.x1 Soares_totaUehis^l«HFaj9w Homosapi 
Hs.23240 -Homo sapiens cDNA FU13496 fis, ctons PLACE1004471 

gb:yu66f10.r1 Weizmann Olfactory Epithelium Homosaple 
Hs,147905 ESTs 
Hs.188478 ESTs 
Hs.15049 ESTs 

•gb:CM1-CT0129-180B99«)6-b08 CT0129 Homo sapiens 



407021 
449276 
418251 
420788 



U52077 
AW241510 
AAB32123 
AA937957 



'gb:Human marinerl transposase gene, complete coi 
Hs.252713 ESTs 
Hs.177723 ESTs 
Hs.193367 ESTs 



wo 02/102235 



PCT/US02/19297 



5 
10 
15 
20 
25 
30 
35 



401881 • predbledexon 

466436 AA251079 Hs.158386 ESTs 

413425 F20956 ■9b:HSPD05390HM3 Homo saaienscDNA clone 032-X4- 

448966 AW372914 Hs.287452 -Homo sapiens CDNAFU11 875 fis, clone HEMBA100707 

429340 N35938 Hs.199429 Homo sapiens mRNA;cDNADKFZp434M2216(fom don 



431009 

426662 
408636 



W03667 

NM_003613 

AW4446C2 

AF058939 

AI797473 

AI371849 

AL079741 



407506 

413802 
440061 
446283 



Hsj48213 'gap Junciion protein, beta 6 (connexin 30)' 
Hs.122710 ESTs 
Hs,135188 ESTs 

"gb:MRO-BT0551-060300-102-e05 BT0551 Homo sapiens 



420757 
432479 
449733 
437846 
464934 
421929 
401780 
448106 
448836 
400842 



AA461797 
AWB47647 
AK002042 
U71600 



Hs,209468 ESTs 

H3,200696 "ATPase, Class VI, type lie 

Hs.1831 14 -Homo sapiens cDNA FU14236 fis, cbne NT2RP4000515 

Hs,269307 ESTs 

Hs,99915 androgen receptor (dlhydrolestostefone receptor testicular 
Hs,275675 katanin p80 (WD40xoiilalning) subunit B 1 
H3,29438 'Homo sapiens cDNA FLJ12094 fis, clone HEMBB100260 
Hs.244569 ESTs 

■gb:Mf»CT0176-081099^2J)09 CT0176 Homo sapiens 
Hs.247360 ESTs 

predicted exon 
Hs.171941 ESTs ' 

Hs.1 1081 "ESTs. Wealdif similar to S57447 HPBRII-7 protein [H,sap 
predicted exon 

Hs.201202 'ESTs, Moderately sMarto Pro-Pol-dUTPase potyproiein 
'gb:ltJ-CT0213-280100«6-A06 CT0213 Homosaplensc 

Hs.134795 "HomosaplenscDNAFU11180fls,clonePLACE1007452 
'gteHuman zinc finger protein zfpSI (zf31) mRNA, partiai 

Hs.32241 ESTs 

•gb;601346293F1 NIHJIGC.B Homo sapiens cDNA clone 

Hs,171073 ESTs 

HS.13S1S9 'Homosaplens cDNA FU11481 fis, done HEI/IBA100180 



404992 
428129 
414315 
400491 
459275 
450853 
467460 
434168 



401050 
447588 
449002 
452759 
443220 
400749 
406277 
433785 
434129 
453369 
411722 



Hs.26912 ESTs 

'gb:HSB65D052 STIWTAGENE Human siceletal muscle cD 
Hs.50868 "solute carrier family 22 (organic cab'on transporter), merab 
Hs.118321 ESTs 



AA479629 Hs.44243 ESTs 

AI143312 Hs.164004 ESTs 

AI204626 Hs.1 16156 ESTs 

AI214671 •gb:qm32d02.x1 N( 

AI912012 Hs.200737 ESTs 

D81402 gb:HUM162A03B Human felal brain (TFujiwara) Homo sa 

AW195566 Hs.253182 ESTs 

H16302 Hs.168950 Homo sapiens mRNA; cDNADKFZpSSSAi 046 (iomclon 



AI394154 
Ai620018 
AW590773 



iP_Lu5 Homo sapiens cDNA do 



429413 
407788 
447252 



41885B 
419323 
415317 
418654 



Hs.27g65g 
Hs.1 17461 
Hs.258996 
Hs.132032 



'ESTs, WSakiy similar to untmown prote'n [Hsapiens]- 



'Homosaplens cDNA FU11683 fis, clone HEMBA1004gO 
predicted exon 
predicted exon 



AW875942 

AW858621 

AA115466 

N40340 

AA287203 

AA634489 

AW1 03046 

AW298119 

AL12CC71 

AW978465 

NM_014058 
BB14982 

Rsogie 

BE146879 

AF086332 

AW961605 

AI092379 

Z43388 

AA226334 

AF067801 



ESTS 

■gb-.CM1-PT0O13-131299^)67-b1O PT0013 Homosapiens 
■gb-.CMO-CT0342-021299-115-f04CT0342 Homo sapiens 
gb;zkB9b05.r1 Soaresjiregnantuterus_NbHPU Homosapl 
'ESTs, Weatdy simitar to 0RF2 IM.rausculusr 

SMA5 

gb:nf76g1 1 .si Na_CGAP_Co3 Homo sapiens cDNA done 
KIAA0771 protein 



Hs.292368 ESTs 



Hs.58314 
Hs.21145 
Hs.135275 
Hs.5570 
Hs.154291 



"Homo sapiens HSPC082 mRNA, partial cds" 
DESC1 protein 

S1G0 calcium-binding protein A2 
glxynOlel 0.r1 Scares adult brain N2b4HB55Y Homo saplen 
'gb:QV4-HT0222-261099^)14x11 HT0222 Homo sapiens 



ESTs 

'Homo sapiens cCMA: FU22489 fis, ct 



liypothetical protein FU10006 



eHRC10951' 



:gb:Homo sapierB HD0GC21 P mRNA, complete cds.' 

93 



wo U2/102235 
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451191 
454006 
443657 
455879 



423372 
450316 
447795 



411483 
420271 
431948 



M843915 

N67900 

U12775 

BE153275 

BE242152 

AL040021 

AW975076 

AI246375 

Wa4446 

AW295151 

BE074910 

AW848115 
AI954365 
AA917706 



Hs.54707 
Hs.1 18446 
Hs.37006 



30 
35 
40 



431277 
446232 
459170 
437876 
406762 
401245 
446102 
446989 
421160 
458831 
408914 
411018 
436662 
457620 
438647 
439670 
419273 
443745 
431029 



417136 
416441 
413702 



426872 
437152 
440517 
450877 
410664 
405793 
418709 



437939 
451842 
405310 
443747 
427287 
413521 



Hs.288417 proteins 



agouti (mouse)-signaling protein 
gb:yf42f10.s1 Soares fetal liverspleen 1NFLS Hoino sapien 
-gb:PMO+IT0335-18()400t008^11 HTI)33S Homo sapiens 



H$42892 
Hs.194616 1 
Hsi79724 1 

Hs.44970 
Hs.144269 
Hs.128062 
Hs^TBO 



•gb:RC5^T0580-170300-021-F12 BT0580 Homo sapiens 
predicted exon 

'gb:II^T0214^01299^C09 CT0214 Hcmo sapiens c 



Sb:S4d1 0 Human letna cDNA randomly primed sublibrary 



BE337771 
AW003578 
AA089474 

CI 3863 
AI027847 
AV653444 
AA501806 



AW168067 
AK001898 
AL080215 
• H71739 
AW4S03C9 
AW813428 
H71937 
AA602711 
AA813118 
T79926 
BE271130 



■ESTs, Htghlysimilarto AF1995g7 1 A-tnie potassium cha 
•gbS^SMIFI NIH_MGC_44 Homo sapiens cDNA don 



Hs.163443 

H3!280776 
Hs.249965 ESTs 



■Homo sapiens cDNA FU13684 lis, clone PLACE2000021 



Hs.200227 ESTs 



'gb'.npOShOe^l Na_CGAP_Pi2 Homo sapiens cDNA don 



BE407197 
BE170313 
AI907552 



Hs.169756 

Hs.ie3230 
Hs.269165 
Hs,293490 
Hs.9728 
Hs.248571 
Hs.282284 

■gb:RC4-ST0173-1910994)32-a07 ST0173 Homo sapiens c 

Hs.50547 ESTs 

"gb:601301552F1 N|H_M6C_21 Homo sapiens cDN A don 
■gb:QV4-HT0536O40500-193^02 HT0536 Homo sapiens 
"gb;RC-BT1 47-120499 044 BT1 47 Homo sapiens cDNA, m 

Hs.295306 'ESTs, Highly similar to unnamed protein product [H.saplen 
predicted exon 

Hs.234545 'ESTs, Weakly similar to AF1 56136 1 novel retinal pigmen 

gb-.hi82h11.x1 Soares_NFL_T_GBC.S1 Homo sapiens cDN 
Hs,1 59388 Homo sapiens clone 23578 mRNA sequence 
H3,1 70938 'ESTs, Weakly similar to KIAA07Q5 protein [H^piens]' 
Hs.293556 ESTs 

Hs,112011 -ESTs, Weakly similar to unknown [Rsapiens]" 

gb:Horao sapiens mRNA; cDNA DKFZp566C0324 (from c 

Hs.132246 ESTs 

Hs.29178 ESTs 

HS.6S370 -lipase, endothelial' 
predicted exon 

gb:zr17ciar1 Stratagane NT2 neuional precursor 937230 H 
Hs.44784 ESTs 

■gb:RC1-NN0073-2604aa01 1-g09 NN0073 Homo sapiens 
predicted exon 

Hs. 1 1 5521 -REVS (yeast hcraologHIke, catalytic subunit of DMA poly 
■gb:CM0-BN0103-18030()-296j04 BN0103 Homo sapiens 
predicted «(on 

Hs.136355 ESTs 

Hs.77646 Homo sapiens mRNA; cDNA DKFZp761MC223 (from clon 
Hs.141840 'ESTs, Weakly similar to S59501 interferon receptor JFNA 
Hs.267087 'ESTs, Moderately similar to AUJ4_HUMANAUJSUBF 

predicted exon 

■gb:AV646352 GLC Homo sapiens cONA clone GLCAME 
Hs.174188 KIAA0938proyn 

•gb:MR0^T0208-101299-202-a04 HT0208 Homo sapiens 
Hs.99066 ESTs 

94 



wo U2/102235 
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451488 
466713 
452161 
428647 



401508 
412677 
441720 
418051 
438014 
432101 



431318 
413470 
402425 

400160 
413795 
405071 



446091 
444788 
404972 
400227 
433804 



50 

55 



H22999 Hs.208846 ESTs 

BE069891 "gb;QV4-BT0401-201299mb01 BT0401 Homo sai 

R43077 Hs.221747 ESTs 

AA830050 Hs.124344 ESTs 

AI246275 Hs.149196 ESTs 

AB011142 Hs.114293 KIM0570 gene product 

AW029608 Hs,17384 ESTs 



AI346487 
AW192535 
N71183 
A1918950 
AW293133 
AA721522 
AA502700 



449547 H93543 



418342 

438007 
410536 

414083 
405362 
410102 
457868 



Hs.19479 ESTs 

Hs.121806 'Homo sapiens cDNA FU11971 lis, clone HEMBB100120 
Hs.11092 ■Homo sapiens cDNA FU14290 lis, clone PIACE1006795 
Hs.101340 ESTs 

■gb:nv54h12.r1 NCLCGAP.Ewl Homo sapiens cDNA do 
Hs.293147 ESTs 

gb:yx54c11.s1 Soares melanocyte 2NbHM Homo sapiens c 

predicted exon 

■gb:RC0-HT0613-14O30^O21-<i06 HTOSIS Homo sapiens 

predicted exon 
Hs.142003 ESTs 

predicted exon 

predicted exon 
Hs.207530 ESTs 

predicted exon 
H3,200197 ESTs 
Hs,202821 ESTs 

pledldedomn 
Hs.1 12740 ESTs 
Hs.7549 ESTs 

predicted exon 
Hs.151406 KIAA0623 gene product 
Hs.1 17963 ESTs 

Hs.203967 metallathionein 1A (liincllonal) 
Hs.229535 CATX-15 protein 
Hs.244106 ESTs 

Hs.293504 "ESTs, Moderately similar to ALUI.HUMAN ALU SUBF 
Hs.158875 ribosomal protein L14 

gb7v27d04.s1 Scares fetal liver spleen 1NFLS Homosapie 

Hs.170378 ESTs 
Hs.257786 ESTs 



AW248508 
AW975133 
AF0050B2 



401260 
406544 
428446 
412246 
400420 



433405 
416795 
435706 



AI024600 . 

AI160873 

AJ277247 

BE065387 

AB037749 

AW450630 

AW157566 

AI497778 

W31254 

AA057418 

AA398848 



403041 
408523 
416515 



452158 
411237 
400441 
439398 
440862 
4154S1 

456072 



H39048 
H19415 
AA031956 
H54381 



Hs.279727 ■Homo sapiens cDNA FU14035 fis, cbne HEMBA10O463 
'gb:EST387239 MAGE resequences, MAGI4 Homosaplen 
'gb:Homo sapiens sMn-specific protein (xp33) mRNA, part 

Hs. 1 34289 -ESTs, Wealdy similar to KIAA1063 protein [H^apiensF 

Hs.52515 Iransducin(beta)-lll(e2 



Hs.98612 ESTs 

Hs.69233 'ESTs, Wealdy similar lo KIAA1064 pmteln [Ksapiensf 
Hs.28736g Interteukiri22 

Hs.186928 
Hs.133851 
Hs.156892 
Hs.1 68053 
Hs.7045 
Hs.33654 
Hs.97541 
Hs.192183 
Hs.35381 
Hs.278909 



AW833259 

N91716 

BE173164 

Ai174947 

NM_003540 



Hs.194140 ESTs 



predicted exi 

"gb;RC2-TT0007-131099-Q11-c01 TT0007 Ho 



insulin-iilie growth factor-binding protein 4 
■gb;ii*55f06^1 NCL0GAP_Ew1 Homo sapiens cDNA do 
Homo sapiens ntm cONA DI<FZpS64D1164 (from don 
predicted exon 

'H4 histone family, member C 
■gb:RC-BT1634)20499^5 Bri68 Homo sapiens cDNA, m 
ESTs 



Hs.221504 
Hs.127432 
Hs.268720 



•ESTs, Moderately similar to AliJ1_HUMAN ALU SUBF 
gb:zk15e04,s1 Soa(eS4)regnantutenfi_NbHPU Homosapi 
g|b:yq89a03,s1 Scares fetal liver spleen 1NFLS Homosapie 

95 
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443488 
430325 
454466 



AI073495 
A1734186 
AA984138 
AA2.7B2T! 
AJ403107 



AW296713 
R89238 



Hs.266467 ESTs 

Hs.1 3391 2 "ESTs, Weakly similar to metliyl-CpG binding domain-cont 

Hs.135105 ESTs 

hls.279895 'Homo sapiens mlVIAfbr KIAA1578 protein, parSal cds' 
Hs.194212 ESTs 

Hs.148590 "ESTs, Wealcly similar to AI=20B8461 BM-004 [H^aplens] 
"gb:EST386775 MAGE resequences, MAGM Homo sapien 

Hs.146180 

predioledexon 

■gb:RC2-BN0120-25040(M)12-f03 BN0120 Homo sapiens 

Hs.1 79982 ■ ■ ■ " 

Hs.221441 ESTs 



Hs.34262 ESTs 



421375 
437630 
443500 

438214 
428046 
431941 



AI252782 
AV646388 
A1613276 



Hs.97206 
Hs.100595 
Hs.153029 
Hs.137071 



417421 
401129 
434745 



427166 

416168 
431467 
421558 
458055 
418345 
426544 
433544 
442007 
443422 
434311 



AF075079 
AW936136 
F08208 



AA452244 
AW238162 
AA586894 
AL1 38201 



AW974445 Hs.185155 



AA431576 
H23687 
N71831 
AB011125 
AW979121 



lartoALU1_HUMANALUSUBF 

'guanine nucleotide binding protein (G protein), beta polyp 
TRABID protein 

■ESTs, Moderately simila-to 136022 hypothetical protein |H 
-Hoino sapieiK cDNA FU2009g ds, clone COtj04S44- 
predicted exon 

gb:Homosapiens full length insert cDNA YQ80A08 

ESTs 

paired mesoderm homeo box 1 
-protease Wbitor 3, sidivderived (SKALPy 
predicted exon 

"gb:QV4-8T0534-281299-053<;05 BT0534 Homo sapiens 



S100 calcium-binding prolein A7 (psoriasin 1) 
"auclear receptor subfamily 4, group A, member T 

•ESTs, Wealdy similar to HuEMAP |H,sapiens]" 

'Homo sapiens mRNA for KIAA1S61 protein, partial cds' 



Hs.155658 ESTs 



■gb;RCO-ST0174-191099^1-a07 ST0174 Homo sapiens c 



:yn72(l12.r1 Soares ady' 
Homo sapiens mRNA; cDf^A DKFZp434E0628 (from do'n 
KIAAC553 protein 

"ESTs, Wealdy similar to ALUB HUMAN III! ALU CLAS 
AJ001696 Hs.241407 "serine (or cysteine) proteinase inhibitor, cladeB(ovalbumi 
AA492325 gb:ng81b11,s1 Na_CGAPJ>:6 Homo sapiens cDNA clone 

AI793211 Hs.165372 "ESTs, Moderately similar toALU1_HUMANAiajSUBF 
AA301116 Hs,142838 "Homo sapiens oDNA:FU23444fis, clone HSI01343' 



Hs.200938 ESTs 



'gb:RC1-BT0314-31030M15-l01 BT0314 Homo sapiens c 



414428 
43.1211 
411541 
448612 
419118 



454690 AW8S4639 



A1696363 Hs.171285 ESTs 



416292 
449309 
408418 
416100 
437845 
443345 
418407 
434557 
431688 



AA179233 

AW589823 

AW963897 

HI 8700 

AA769578 

AI052508 

AL044818 

AW855466 

AA513906 



H62943 
BE326688 
AW451457 
AW016437 



predicted exon 

'^b:MR1-CT02S8-1401 00-203^10 CT025B Homosapiens 
hypothetical protein FU10826 



Hs,44743 KIAA1435pit 
Hs,268799 ESTs 
Hs.90488 ESTs 
Hs.1 64482 
Hs.84928 
Hs.271666 



Hs.291911 
Hs.33565 
Hs.17166 
Hs.147482 
Hs.154188 



•ESTs, Wealdy similar to contains simiiaiity to TPR domain 
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438425 
415532 
441442 



R14780 
AL043282 
AI792478 



HS.8328S ESTs 



405227 
442257 
403403 



419936 
437267 
430563 



AA771881 
M382267 
AB000509 



Hs.298149 
Hs.10653 
Hs.29736 



AW511443 
AA481269 
AI198994 
AJ132099 
AW977435 



lo16.7Kd protein [H^apiens]- 



ESTs 
ESTs 

TNF receptor-associated factor 5 
predicted exon 

gb-.UI-HF-BHO-alb-b-OS-O-UUI NIH_MGC_50 Homo sap 
predicted exon 

gb:7Ba3C12 Ctiromosome 7 Fetal Bra n cDNA Library Ho 
■ESTs, WeaWy similar to NG22 [H.sapiensr 
"gb:ol91d05.y5 NCI CGAP Kid5 Homo sapiens cDNA do 
ESTs 



425557 
432162 
450152 
410053 
421285 



409545 BE298182 



441541 
400709 
407615 
424153 



453034 
412963 
425351 

416533 
458378 
401213 
405904 
445132 
405138 



AI348408 
AW342028 
AA93B663 



405867 
414093 
447306 
413083 
404828 
402643 
421988 
413404 
459043 
404410 
430264 
43U99 
412566 



AW136374 
AF283776 
AA481119 
RS09S6 
AA383623 
A1267491 
AAS79001 
AA00g647 
Y00630 
H01699 



ESTs 

Hs,59332 ESTs 

Hs,1363 "cytootirome P460, subfamily XVII (steroid 17-alplia-hydro 

Hs.20369 -ESTs, Weakly similar lo gonadolropin inducible Iranscript 

Hs.1200 aracliidoaate 12-lipoxygenase 

Hs.1 20845 ESTs 

Hs.129740 KIAA0565 gene product 



AW162907 Hs.76969 



proline-ricli protein witii nuclear targeting signal 
predicted exon 

Hs!287329 FaSlIJjiSi '"'^""'^ 
predicted exon 

•gb:601177324F1 NIHJilGCJ 7 Homo sapiens cDNA don 
Hs,131793 ESTs 
Hs.4g265 ESTs 

gb:420f09,s1 Soares_fetalJlverj|ileen_1NFLS_S1 Homo 
HS.12467S -ESTs, Weakly simiar to unnamed prateh product [Ksapie 
Hs.256112 ESTs 
Hs.199828 ESTs 



predicted axon 

■gb:aa65f05j1 NCLCGAP_GCB1 Homo sapiens cDNAcI 
'HomosaplenscDNA FU1341S fis, clone PLACE100179g 
ribosomal protein U7a 



Hs.1 841 09 
Hs.238809 
Hs.1 55924 



Hs.270949 
Hs.80285 
HS.283SS8 
HS.59S03 
Hs.293616 



CAMP responsive i 
predicted exon 
r€tiiioblastoma-like2(p130) 
ESTs 

piedrcted exon 
predicted exon 

gb:HSC29G031 normalized infant brain cDNA Homo sapie 

predicted exon 

ESTs 

Homo sapiens mRNA; cDNA DKFZp586C1723 (from clon 

hypotlietical protein PROISSS 

■ESTs, Weakly similar to AF157318 1 AMI? protein [H.S 



Hs.192221 ESTs 

Hs.8850 a dislnlegrin and metallcprotelnase domain 12 (meltrin alpii 
Hs.75716 •seiine(orcystelne)proteinaseinhibltor,clade B(ovalbuml 
Hs.27289 CGI-12S protein 
predicted exon 

'gb:601078872F1 N1H_MGC_12 Homo sapiens oDNA don 

Hs.170333 ESTs 

"gb:RC4-BT0311-250200-014-h06 BT0311 Homo sapiens 

predicted exon 
predicted exon 
Hs.1 61333 ESTs 
Hs.297431 ESTs 

Hs.2081 13 "ESTs, Weakly similar lo ht-WASP [H.sapiensr 
predicted exon 

■gb:nc71f10.s1 Na_CGAP_Pr1 Homo sapiens cDIW don 
NM_001514 Hs,258561 general Uanscription factor IIB 
AWg62574 ■gb:EST374647 MAGE resequences, MAGG Homo sapien 



AW450481 
BE503463 
AI306444 

AA470619 
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Hs.81 



ESTs 



AA884304 Hs.131163 ESTs 

446205 AW172662 H5.149479 ESTs 

455275 AW977806 "gb:EST389810 MAGE resequences, MAGO Honno sapien 

415679 AA1 65232 Hs.222069 ESTs 

423200 AA323073 Hs.289083 ESTs 

440052 AI633744 Hs. 195648 ESTs 

424717 H03754 Hs!l52213 -wirigless-lypeMMTVinlegrarion site family, member 5A" 

420111 AA255652 gb:zs21h11.r1 NCLCGAP_GCB1 HomosapienscDNAclo 

Hs.2726a8 hypolhelical proiein FU20397 

Hs.143056 ESTs 

Hs.136912 ESTs 



432140 
414904 
409479 
404727 



AI623778 
U46922 
AK001432 
NM_013g89 



fragile hislidlne triad gene 
1 "Homo sapiens cDNAFU10570fls, clone NT2RP20031 17 
■delodlnase, lodothyranlne, type II' 
gb7w96g08s1 SoarBS_placenla-8to9weeksJNbHP8to9W 



416879 
456088 
423175 
424585 
452281 
424323 
426701 
447645 
402974 



431867 
442768 
424085 



410123 
435435 
417071 



H3.287332 ESTs 



lA: FLJ23082 fis, clone LNG06451" 



AA338791 
AI968103 
AW897321 



AA523660 

AL048534 

NM 002914 

AF174394 

U62436 

AF226667 

T169B1 
T89473 
NSg820 
H50167 
AA604fi60 



gb:zx43h11.r1 Soaresjolal felus Nb2HF8_9w Homo sapie 
Hs.28792 'Homo sapiens cDNA FU11041 fis, clone PLACE1004405 
Hs. 1 46763 nascent-pdypeplide-associated complex alpha polypeptide 
Hs.269461 -Homo sapiens cDNA FLJ12836 Us, done NT2RP2003206 

H3.1 59699 ESTs 

predicted exon 
Hs.211061 ESTs 
Hs.98908 ESTs 

predicted exoti 
Hs.1 91727 ESTs 

Hs.48458 "ESTs, Wealdy similar to ALU8_HUMAN ALU SUBFAM 
Hs.1 39226 replication factor C (activator 1) 2 (401(0) 
Hs,1 77461 'Homo sapiens apoptolic^lated protein PCAR mRNA, par 
Hs. 1 031 28 'ctiolinergic receptor, rJcoHnlc, alpha polypeptide 5" 
Hs.58553 CTP synthase II 



417616 
411805 
419000 
413483 
400975 
407453 
430757 
417793 
401877 
457122 
410706 
435807 
428398 
401088 
414501 
419083 
421107 



Hs.33113 ESTs 

gb:abO3a10,s1 Stratagena fetal ret'na 937202 Homo sapiens 
■gb:PM1-Kr0345-121199<)01-c08 HT034S Homo sapiens 
gb.nq38g05.s1 Na_CGAP_Co10 HomosapienscDNAclo 

Hs.25978 ESTs 

Hs.142390 ESTs 

Hs.211516 ESTs 



predicted exon 

AI674093 Hs.293961 ESTs 

BE080469 'gb:QV1-BT0630-28020(M)8M06 BTC630 Homosapiens 

AI816344 Hs.12554 'ESTs, Weakly similar lo Nucleosome Assembly Protein 1- 

Ai908539 Hs.184592 KIAA0344 gene product 

AA447990 Hs.190478 ESTs 

AI531546 Hs.1 59732 ESTS 

AW022054 Hs.135591 ESTs 

R07728 Hs.263668 ESTs 

AW864183 ■gb;PMO^N0014-260400^»2-d02 SN0014 Homo sapiens 

T79855 Hs.258592 ESTs 

BE144017 Hs.184693 "transaiption elongallon factor B (SIIO. polypeptide 1 (15k 



AI732404 
AI033299 
AI249368 



461686 
453867 
419985 
426650 
424115 



AW848346 
X14767 
AW514083 
M 016112 



H66373 

AA382B14 

AA335497 



HS.129B92 KIAA0522 protein 
Hs.33728 -ESTs, Weakly similar to ALUIJtUMAN ALU SUBFAM 

Hs.68846 ESTs 
Hs.113614 ESTs 
Hs.98558 ESTs 

predicted exon 
Hs.171984 ESTs 

Hs.98613 -Homo sapiens cDNA FU12292 fis, cbne MAI*IA10018 
H3.55606 'ESTs, Weakly similar to ZN91JtUMAN ZINC FINGER P 
'gb:IL3-CT0214-150200^)76-F03 CT0214 Homo sapiens c 
Hs,89763 •gamma-aminobulyric acW (GABA) A receptor, beta 1" 
Hs.1 90135 ESTs 

Hs.159241 polycystic kidney disease 2-like1 
Hs.1 10293 ESTs 
HS.10819S HSPC037 protein 

Hs.15973 -ESTs. Highly amilar to bA393J16.3 IH.sapiensr 

'gb:EST960g7 Tests I Homo sapiens cDNA S end, mRNA 
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409684 
454423 
417173 



410523 
434623 
454484 
402131 
438913 
402628 
415973 
455640 
442750 
404638 
431117 
428819 

427336 
416450 
440876 



440559 
421236 
416258 
405982 



predicted axon 

gb.zm89f12j1 Sfralagene ovarian cancer (937219) Honnosa 
Hs. 1 79662 nucleosoine assently protein 1-lke 1 
Hs.81424 ubiquilin-like 1 (sentrin) 
Hs.269001 ESTs 

Hs.274227 "Homo sapiens cDNA FU10010 lis, clone HEMBA100030 
Hs,179825 RAN binding protein 2-lil(e1 
piedicted exon 

Hs.226155 ^ESTs, Weakly similar to PSF_HUMAN PT&ASSOaATE 
Hs.253146 ESTs 

gb:yf26c09.r1 Soares fetal liver spleen 1NFLS Homosaplen 
HS.797C8 ESTs 

"9btMR0-HT0164-151299^12-(J03 HT0164 Homosaplens 
HS.4C14 KIAA0946 protein; Huntlngtin Interacting protein H 
H5.21S8S7 ring finger protein 14 



AI380429 HS.17244S ESTs 

predicted 
R24707 Hs.260201 ESTs 
BE064059 
AI016803 



AL1 35623 
AA83711B 
AA448542 
AA180467 



predicted exon 
Hs.260600 della(Drosopt>ila)-like1 
Hs.193914 KIAA0575 gene product 
Hs.1 18366 ESTs 
Hs.261677 GanI 



Hs.14: 



AI652236 
AW968363 
AW629054 
AI287622 
N45661 



ESTs 



449182 AW292381 



Hs.279570 ESTs 

■gb:601301836F1 NIH_MGCJ1 Honn sapiens cDNAdon 

gb:yv60c02.s 1 Soares fetal liver spleen 1 NFLS Homo sapie 
Hs.49376 hypothetical protein FU20644 

'gb:EST380439 MAGE resequences, MAGJ Homosaplens 
HS.12S976 -ESTs,Weaklyslmllartometalloprotease/dl5lntegrin/cy5tei 
Hs.161956 ESTs 
Hs.275131 ESTs 

predicted axon 

predicted exon 
Hs.169142 ESTs 
Hs.119887 ESTs 
Hs.224150 ESTs 



430163 
455441 
400304 



AI808725 
AI689496 
AI015953 



BE253197 
AW291319 
AI498640 



441818 
458804 
411905 



Hs.147783 
Hs.108932 
Hs.125265 
Hs.234748 

Hs.1 13261 

Hs.1 14318 

Hs.1 94574 
Hs.159354 
Hs.1 24004 



416425 
407173 
452502 
446657 
459124 



ESTs 
ESTs 

predicted exon 
predicted exon 

■gb:RC1-BT0605-06020(W]12-h12 BT0605 Homo sapiens 
gbyc10d08.s1 Stratagene lung (937210) Homo sapiens cDN 
'gb:PU-BTD46-220199-286 1 BT046 Homo sapiens cDNA 
■ESTs, Moderately similar to ALU7_HUMAN ALU SUBF 



BE408145 

AI807264 

AW451597 

AI091633 

AW877277 

AW190593 

W92142 

AI530451 

AL1 67625 

BE265067 

AA628151 

AW296756 

N51636 

AW188551 

AK000185 

AW594482 

AW812227 



ls.205442 
ls.1 67409 
ls.136167 



"gb-.MR4-PT0051-1 50200-001 -d03 PT0051 Homo sapiens 
ESTs 

■ESTs, Weakly similar lo ALU5.HUMAN ALU SUBFAM 

1 761 (svnonvm: hamy2) Homo sapi 
;apienscDNA clone 

ESTs 

"Homo sapiens cDNA: FU21432fis, clone COL04219" 
gb:yy87b01 ,s1 Soares.multiple.sclerosisJNbHMSP Homo 
Hs.99519 "Homo sapiens cDNA FU14007 fis. clone Y79AA1002407 
■gb:Horao sapiens cDNA FU201 78 Us, clone COL09990r 
Hs.253315 ESTs 

:gb:RC2-Sr01 73-201099-01 1-g09 ST0173 Homo sapiens c 
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405800 
429430 
439518 



■synuctein, alpha (non A4 component of amyloid precursor) 

ESTs 

Homo sapiens mRNA; cONA DKFZpS64M0616 (from clan 
"gb:ESTS8717Felallung II Homo sapiens cDNAS" end, mR 

194624 ESTs 
ESTs 

"gb:tq20g10.x1 NCLCGAP.UH Homo sapiens cONA don 

!87767 Sequence 8 from Pa 

159153 ESTs 



Hs.155335 ESTs 

gb:zd60d04/1 SoaiBS_felaI_lieart.NbHH19W Homo sapien 
Hs.248114 glial cell derived neuiotiophic Factor 
HS.3C5C3 ■HomosapienscDNAFU11344fls,clonePLACE1010870 
Hs.126373 Human DNA sequence liom clone RP5-1 184F4 on chromos 



Hs.31835 ESTs 



gb:yf24c06.r1 Scares fetal liver spleen 1NFLS Homo sapien 



TABlilB: 

Pkey: Unique Eos probeaet identifier number 
CAT number: Gene duster number 



AW753085AW763082AtlV054744AW753107AW753087 



409867 1156530.1 

409940 1160994.1 

409982 1165022.1 

410500 1206323.1 

410523 1207041.1 

410536 1207322.1 

410626 1212621 -1 

410966 1228071.1 

411004 1228975.1 

411018 1229132.1 

411057 1230493_1 

411178 1234752.1 

411237 1236377.1 

411483 1247172.1 

411489 1247360.1 

411541 1249044.1 

411660 1253078.1 

411693 1264206.1 

411722 1254914.1 

411725 1255047.1 



411748 
411805 
411905 
411940 
412167 
412333 
412402 



1256178.1 
1259273.1 
1265181.1 
1266262.1 
1280605.1 
1289037.1 
1292917.1 



• H85615 H86300 H86263 H86282AA059278 H86304 
AW818390 AW818237 AW85891 1 AW858977 BE072544 W26498 
BE296182AW629821 
AA076010 AA076009 AI094314 
AA295961 AA296889AA076946AA077528AA077497 
AW502152 H41202 H29772 



BE548143AW511659 
BE005839BE005619AWS16815 

R09442AW846115AW846108AW75ig67AW846083AW846087AW846090 
BE143839 AW752787 AW752796 BE1 43584 N71805 
N39533AW753094AW753093 
BE407727 

AW812C88AW812105AW8120B2 

AW813242BE14S089AW813195AW813173AW813206BE145g53BE146212AW813196AW854582AW813241BE061582 
AW813428AW813444AW813367AW813368AW813429AW813424 
AW8150g8 BE154843 BE1S4831 
AW820852 AW820773 AW821088 

AW833676 AW833814 AW833798 AW833677 AW833449 AW833630 AW833626 AW833444 AW833366 AWB33791 AW83365g AW833432 



AW848115AW848127AW887023AW887117 
AW848346 AW848760 AW848340 AW84881 8 AW849043 AW849061 
W03940T98335AW850705 
AW855718AW855740AW855748 
AW857271 AW857308AW857295AW857258 
AW875942 AW858234 AW875938 AW875941 , 
AW856396 AW858505 AW858476 AW861971 , 
AW858393 

AW859920BE079582AW997112 
AWe&4ie3 AWe&4iei AWe64135AW864198 



AWB58556 AW861908 AW85e514 AW858601 AW861909 AW858434 AW85e400 AW85e405 



AVV984788AW984816AVVg84811AVV984807AVV984819AVIfl847g0AWg84782AVVg84784AVV9847B0AVV984^ 

AW984789 AW984823 AW94a021 AW984802 AW984800 AW984799 AW9Ma25 AW984792 AW984821 AW984820 AW984808 AW984809 

AW984812 AW984801 AW984813 AW984778 AW984804 AW984798 AW948C17 AWg84827 



413425 136885.1 F20956 AA129374AA133740AW819878 

413470 1371600.1 N20934 BE141875 BE141877 

413521 1374612.1 BE145814BE 

413544 1375671.1 BE147225 BE1 47205 BE147234 

413642 1 381 386.1 BE154837 BE1 54879 BE154S50 BE1S4877 BE154835 BE154349 BE1S4902 BE1S4905 BE1S4S67 BE154g01 BE1S4904 BE154899 

413702 1383899.1 BE170313 BE158339 BE158290 

413854 1395788.1 BE175582 BE175514 BE175505 BE175591 BE175530 

414093 1416417.1 BE544867 BE247720 

414125 1419230.1 BE253197 BE259456 BE254462 

414195 1424854.-3 BE263293. 
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5 

10 
15 
20 
25 



414210 1426051.1 

414266 1430984J 

414315 143512_1 
1460320J 

, 414540 1460324_-1 

414584 1464068_-1 

414605 1465790_.1 

414606 1465801.1 
414808 1482624_1 
414954 1509857J 
414995 1611738_1 

1540602-1 

415654 154135.1 

416716 1548818J 

415747 1551 89 J 



6151 



1574545J 



417549 168700_1 
417611 168900.1 
417682 1692759.1 
177402.1 
178363.1 
419753 187763.1 
419936 189181.1 
420111 190755.1 



423843 232510.1 

424101 235398.1 

424324 238127.1 

424685 241151.1 

424872 244505_1 

425090 246649.1 

426674 253317.1 

425612 253969.1 

426065 260276.1 

426130 261414 1 

426544 268987.1 

426650 270283.1 

429163 300543.1 

430264 315008.1 

430757 322947.1 

431071 327550.1 

431688 336609_1 

431822 338082 1 

432189 342819 1 

432340 345248.1 



434098 380006J 
434374 384889.1 
434804 393481.1 



436084 41437J 

436532 421802.1 

436843 427748.1 

437096 433006.1 



437152 43386.1 

437229 434947.1 

439031 46798_1 

439152 46920.1 

439518 47334.1 

440051 48426_2 

441194 51193.1 

441369 515636.1 

442257 53699.1 

443161 561306.1 

443175 561882.1 

443198 562655.1 



BE383592 BE261671 

BE267834 BE614180BE514096 

Z24878 AA494098 F13654AA494040M143127 

BE379046 BE395459 



BE387771 BE387954 BE389705 



D81402 C15494 D61078 D61313 D80399 D81520 

C18200 D78681 T8202S 

R20233F12901T74740 

AW968363 AA465492 R34S39 M16S41 1 

F30364F36559T15435 

M381209 AA381245 M167683 

T26661 Z44135 H23016 

H23687H46460 H40239 

BE077308 AL043350 AW962170 AA180251 AA325287 
BE407197 AA182474AA180369 BE275628 BE276131 

H62953 N76608 N72413 
H72974 W28967 

AW934714 BE161007 BE1 52500 AW749902AW749864 BE162498 BE161005AA190449AW513465 BE161006 BE1 62499 

AW997313AA195805 
AA203661 R89136 

AW993983 AW994798 AW993990 AW993999 AW993989 AA204756 

W69561 R08486 R8718J 

AW749855 AA225995 AW760208 AW750206 

AA227394AA641866AW750732 

N42531 W25700AA249574AA569553 



AA828125AA834883AA330555 

AA332652 AA331633 AW999369 AW902993 BE170475 AA378845 AW964175 AI475221 

AA335394 AA335535 AA335244 AW966148 

AA346316 BE16Q193AA338802AW954536 

AA464840 AA343628 

AA347923AA347928AW961759 

AA350552R21667AW9S3258 

AA359663 AA369654 AW963124 

BE004257 AW81 1 1 90 AM60576 BE172402 BE181703 

N32049R34821 R78237 

AA853282BE266688AA370481 

AA492326 AA503675 AA381 181 

AA332814 AA402411 AA412355 

AA834r66 AW974271 AA592975AA'!<7312 

AA470519 BE303010 BE302954 BE384120 

AI458623 AA639708 AA485409 R22065 AA485570 

AA491379 H86020AW96gi48 

AA513906AA847734AI357044 

AA516049AW004922 

AA527941 AI810608 AI62C190AA635266 



AA534489 AW970240 Am70323 

AA650114AW974148AA572946 

AW877277AW811294 

AW772282AA592974 

AA612960AI934769T12348 

AA625499AA625269AA625184 

AA631439 AI086355 AI082577 



AW974a92AA654375 
AA682622BE141696 
AK000185AW841262 



AA824588 AA732269 AW977146 
AA744406 AA745347 AA745535 

AA730977 AI261584AA334473 Z43283AW875861 AWg38044 BE150701 AW936262AA306862BE565575 BE567380 AA728920AA167612 

AI239729AI251752AA485791 BE568425 AW962958 

AL050027BE089051 

AW976005 AW419264 AA747275 AA810377 
AF075079 H48601H48795 
H65014AF086007 H65015 
W76326AF086341 W72300 

BE559980 BE397203 BE268207 BE559754 BE267725 BE513654 BE267742 BE268219 BE267665 BE561356 

BE274581 BE275382AA703515BE166690 

AA931535A1453601 Z44913 

AW503831 AW503317 BE565665 

AI038316Ai344631AI261653 

N57863AI038952W90167N64103 

AI039813 AI684642 Z40121 AI951414 BE501049 

AI076123AI244834AI695239 
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447252 
446516 
449104 
450613 



452507 
452542 
452563 
452947 



454822 
4S4874 
454913 



714160J 
766241 J 
798149.1 
840016.1 



R14973R14967AI081005 

AV646352 AV662121 AV652008 

AW866632AIC89351 D61942 

Z44811 R13709AV652749AW814906AA084016 

AI214671 Z45244 H24136 R25934 

BE294357 N36568AI217006 

AI374927AI278330AI301383 

R90916AL120023 R18429Z42095 AI369730 R90824 



R08702 R09864AI630313 
AI7020S5R89204 R86260 
BE144949 BE144ggi BE1 44990 AI832igg 



1227504J 
1228081J 
12291C6J 
1230250J 
1236369.1 
1238484.1 
1242236.1 
1245577.1 



1255227.1 
1259973J 
1272255.1 
1290546.1 
1291505.1 

1335196.1 
1348141.1 
1349206J 
1352512.1 



1398666.1 
1470256.1 
151654.1 



333127.1 
371514.1 
393566J 



AI904646 BE179494 BE179421 

AW812256AW812257AI906423AI906422 

AI907562 C03707 C02870 

AW130413AI932362 

AL040021 AL04C037 

AL1 57568 

AW177335 AW177352 AW177340 AW177378 AW177339 AW177388 AW177393 

AAC76811AWB14764 

AW752571 AW847602AA077979 

AW752710 BE180336 BE180186 

AW801365 AW801435 AW801372 

AW807073 AW807055 4W807067 AW807276 AW807030 AW807363 AW845892 AW807091 AW80727S AW807284 AW807287 AW845891 
AW807195AW807271 

AW809109 AW809112AW809122AW809126AWB09128AW809133AW809131 AW809113 AW809111 AW809132 

AW809648 AW809704 AW809643 AW809653 AW809709 AW809949 AW809939 AW81 0010 AW80g705 AWSOgSSO AW809822 AW809667 

AW810093 AW810076 AW809673 AW810349 AW809895 

AW81 1380 AW81 1385 

AW81 2227 AW81 2294 AW812092 

AW854639 AW854719 AW8S4718 BE145880 AW854692 BE145866 AW816154 AW854698 AW854654 AW813335 AW854699 
AW814989 AW814852 AW814808 

AW833793 AW833799 AW833346AW833371 AW833795 AW833562 AW833667 AW833377 

AW836407 BE175600 BE17557g 

AW841462 BE156657 BE156668 BE092475 

AW846080 AW846074 AW8461 18 AW846130 

AW847647 AW84765g AW847656 AW847653 AW847717 AWB47786 



AW865450 AW8651ig AW865452AW866461 AW865325AW865114AW865116 AW865321 AW8655gO AW8653gO 
AW977806 AW887923 AW886321 

AW939445 AW939465 AW93g604AWg3g531 AWg3g530AW939993 

AW945964 AW946020 AW946034 AW946027 AW946041 AW946044 AW946033 AW946024 AW946021 AW946029 AW946015 AW946016 



BE060469 BE080474 BE0e0477 BE080546 BE080545 

BE146879 BE146914 BE146918 
BE147225 BE147205 BE147234 

BE153275 BE153189 BE153329BE153022BE153030BE152974 

BE17g086 BE179084 BE17gOB6 BE179264 

H54381 H54463BE393262 

AA15g478 AW901089 AA160437 AW5g3155 

N51636T51874T51829 

AA225423 AA225369 BE144153AW801549 

AWB97265 AW897274 AL119504 AW897275 AW897270 AW897312 AW897318 AW897317 AA317240 AW961361 T06241 AA326794 

AL138130AVM07975AW999277 

AA504a60AA504911 

AA602711 BEa78290 

AW974687 AA649656 AA652145 

AW975133 AA729943 AA805813 

AW81 6379 AA888282 AA879046 AA879195 



TABLE 1C: 

Pkey: Unique number corresponding to an Eos piobesel 
Ret Sequence source. Hie 7 digit numbers in tliis column i 

liuman cliroinosonie 22* Dunliam, et at. (1999) jyatys 402:489^ 
Strand: Indicates DMA strand from wtildi exons were predicted 
NtjJOsHon: Indicates nucleotide posffions of predicted exons 



lie publication enttled Tlie DNA sequence of 
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NLpositon 
18398-18573 
25013-25127 



400623 7228177 Plus 

400709 7249204 Plus 

400749 7331445 Minus 

400842 1927148 Plus 

400925 7651921 Plus 

400964 7139719 Minus 

400968 7923967 Plus 

400975 7139779 Minus 

400983 8081198 Plus 

401032 8117525 Minus 



401129 8699792 
401200 9743387 
401213 9858408 
401230 9929527 
401245 4827300 
401260 8076883 
401269 8954206 
401263 9800093 
7381770 
7534110 
7705251 
7770649 
7229804 
7689963 
7249190 
7249190 
7263888 

7770606 
8122429 
7528100 
8072415 

8117361 
8117407 
7704961 
8576119 



401497 
401508 
401521 
401530 
401575 
401604 
401780 
401781 
401793 

401362 
401881 
402018 
402046 

402071 
402075 
402131 



6598824 
9796341 
9796347 
9797406 
7630937 
9838066 
7230225 
9884928 
9931216 
9931231 
9958129 
8077024 



6136940 
5010175 
9583237 



8954241 
9838213 
8018055 
8568877 . 



9162-9293 
90462-90673 

38183-38391,4390044086 

155282-155403 

19938-20043 

108473-108847 

107903-108832 



78449-79425 
194659-195179 

62022-62242,62326-62451.62543^2710,63072-63167 

111586-111806,114791-114916,115419-115583,116351-116446,116847-116907,122863-123067, 

98243-98380,98489-98619 

33835-34006,3<539-34592,36461-36745,4892549098.5260452758 



4725647456-, 

92607-92813 

110779-110983 

9127-9234 

41468-42406 



119835-120185 

28397-28617.28920-2904529135-29296,29411-29567,29705-29787,30224-30573 
83215-83435,83531-83656,83740-83901,84237-84393,84955«037,8629&«814 

102945-103083 
107548-108298 

148470-1486151 , 153418-153618,154282-154438 



M22035,122804-122921,124019-124161,124455-124610.125672-126076 

33114-33209,33496-33678 
8124-8285 

3261-3834,3939-4269 

22587-23723 

36758-37953 

4660946662,4675846811.8629^86346.89776-89829,90048-90101.102817-102924 



31753-31966 

115658-116580 

20167-22383 

70318-70846 

56847-57055 

8725-8859 

76515-76690 

147744-147861 

131034-131794 

43921-44049,46181-46273 

69821-75323 

194384-194645 

124035-124321 

70527-71019 

71615-71773,73930-74144 

163070-163351 

171964-172239 

177083-177373,177464-177751 
142560-142726 
133809-134099 
137205-137350 



403647 
403687 
403691 
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404270 
404340 
404410 
404599 



7630932 Minus 

7658395 Minus 

8083636 Minus 

7249249 Minus 

7331467 Minus 

7711864 Minus 

8568150 Plus 

2996642 Plus 



7342122 
8705107 
9796751 
9797142 
8081050 
7882827 
6580415 
7706886 
6850447 
7331420 



4662677 
7708797 
8576241 
7230083 
£731245 
3980473 



405465 
405472 
405647 
405576 
405621 
405636 
405675 
4057C8 
405771 
405783 
405793 



405917 

405982 
406030 
406053 



7705118 
7712162 
8247790 



149466-149665 

7210-7414,10043-10195 

184131-184295 



86147-86509 

3649-3750,4161-4306,5962-6049,6849-6965 
10893-11606 

49052-49176,56177-56273,59384^9488 

110443-110733 

99433-99528,100035-100161 

104257-105215 

115534-115747 



48711-49524.^ 

106104-106199,111659-111781 
11115-11562 



23471-23572 

55744-55903,57080^7170,61478-61560 

33227-33442 

33267-33563 

105008-105142,105980-106091,140445 140556,142519-142641 

31923-32311 

133807-134053 

8935-9073,12242-12367,13364-13506,14965-15493 

106297-106447,108462-108596 

124361-124520,124914-125050 

84000-85009 

59362-59607 



70304-70630 



91191-91254,91510-91589 



19271-19 



28135-23244 
26407-27151 
74553-75173 
57311-57874 
16375-16584 
106829-107213 



96123-96547 



7711360 
7711508 
8224211 
4567182 



4769-5490 

130230-130418 

21251-21525 

107068-107277 

45576-46757 

38306-38989 



TABLE 2A lists about 187 genes up-reguiated in ovarian cancer compared to normal adult tissues that are likely to be extracellular or celi-sutfece proteins. These were selected 
as for Table 1 A, except that the rato was gmater than or equal to 2.5, and the predicted protein contained a PFAM domain that is indicative of extracellular localization (e.g., ig, 
fn3, egf,7tm domains). 

TABLE 2A: ABOUT 187 UP-I«GULATED OVARIAN CANCER GENES ENCODING EXTRACELLUIAR/CELL SURFACE PROTEINS 

Pkey: Primeltey 

Ex.Accn: Exemplar Aixession 

UGID: UniGanelD 

Title: Unigene Title 

PFAMdomains 

ratio: tumor vs. nonnal tissues 
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Ex. Accn No. 
AW178761 
M938471 



60 

65 



467148 
428568 
412170 
442666 
403763 



UGID Title 

Hs.227948 serine (or cysteine) proteinase inhibKo 
Hs.115242 developmentally regulated GTP-bindi 
IHs.1925 desmoglan 3 (pemphigus vulgaris ant 
HS.B9546 seledin E (endotlieliai adtiesion molec 
gt)3(B0ID4.x1 Na_CGAP_Gas4 Horn 



HS.2B39 
Hs.1 15263 
Hs.155400 ESTs 
Hs.79197 " " 
Hs.76391 



403093 
446740 
405547 



402745 
452755 
421459 
416151 



AI611635 
AW937435 



AI821539 Hs.97249 



CD83 antigen (activated B lympliacyt 
myxovlrus (Influenza) resistance 1, ho 
predicted exon 



gb:QV3-DT0044-221299O45-b09 DT 
predicted exon 
predfcledexon 
Hs.213436 ESTs 



AI281848 
BE149762 
NM_003513 



gb:AB65C7R Infant brain, LUJL ara 

Hs.165547 ESTs 

IHs.248213 gap junction jirotein, beta 6 (connexin 



415451 
403083 
448995 
418345 
424966 
431211 
430563 
45ai52 
418344 
403089 
403687 
403691 
414035 
421284 
435435 
457122 



431117 
457948 
435174 
408170 



H19415 

AI613276 

AJ001696 

AU077312 

M86849 

AA4B1269 

AI138635 



Hs.191510 ESTs, Weakly similar to 0RF2[M.m 
gb:RC-BT091-210199J98 BT091 Ho 
gb:6013C1552F1 NIH MGC^I Horn 
Hs£5370 npase.endothenal 

predicted exon 
Hs.268720 ESTs,Modera(eIysimilartaALU1_H 

predicted exon 
Hs.5662 guanine nucleotide binding protein (G 
Hs.241407 serine (or cysteine) proteinase inhib'to 
Hs.153985 solute carrier family 7 (cationic amino 
Hs.5566 gap junction protein, beta 2, 26kD (CO 
Hs.178381 ESTs 
Hs.22963 ESTs 

Hs.1200 aractiidonato 12-lipoxygenase 



Y00630 

U62435 

T89473 

A1026157 

X14767 

NM.016112 

AF003522 

AI498640 



Hs.75716 
Hs.103128 
Hs.192328 
Hs.33728 
Hs.89768 
Hs.169241 
Hs.250500 
Hs.159354 
Hs.194624 
Hs.31835 
IHs.igi852 
Hs.2784e5 



ESTs, Weakiy similar to ALUI.HUM 
gamma^mlnobutyric acid (GABA) A 
polycystic kidney disease 2-llke 1 
delta (Drosophlia)-like1 



AJ271643 Hs.87469 



ESTs, Weakly sitnlar ki ALU1_HUM 
oltelary receptor, family 1, subfamiy 
ATP-bindirq cassette, sub-tamlly B (lU 
prsdctedexon 

transfomiing growtli factor, alpha 
serine (or cysteine) proteinase inhibito 
solute cam'er family 2 (facilitated glue 
tissue factor pathway inhibitor 2 
luteinizing hornione/choriogonadolrop 
putative acid-sensing ion channel 
adrenergic, beta-2-, receptor, surface 



Hs.67846 leukocyte imrounoglobufin-like recep 
gb:zm61g01.r1 Strat^ene libroblast ( 

Hs.1 96491 ESTs 

Hs.6431 1 a disintegnn and metalloproteinase do 



NM.001942 

AI750878 

AI264671 

AF0gi035 

AC004755 



Hs.288541 Homo sapiens mRNAfbr KIAA1558 

Hs.23672 low density lipoprotein receptor-relate 

Hs.1 59903 olfactory receptor, family 6, subfamily 

Hs.20924 KIAA1628 protein 



Hs.87409 

Hs.164166 ESTs 

Hs.184627 KIAAOIIS protein 

Hs.184922 one cut domain, family member 3 

Hs.73729 very low density lipoprotein receptor 




k)n_trans,K.tetra 
A2IW:A2M_N 

EGRIdLrecepUldLrecepLb 



EGF;ldLrecepLl>;lhyrDgtobUin_1 



fn3;ldLrecepLa;ldl_recepLb 
ig;SPRY 

ABC_tran;ABC_membrane 




sugar_tr 

Kunitz_BPTI;G-gamma 



A2HtA2M-N 

EGFW_recept_a;Wl_recepLb 
ldljecepLa;EGF;idl_recepLb 



E1-E2j\TPase 

EGF;klLrecepLa;idljiecepLb 

ank;death;RHD;TIG 
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403074 






predicted exon 


fri3 


3.8 


413605 


BE152644 




gb:CM1-HT0329-250200-123-f09 HT 


alpha-aniylase 


3.8 


442295 


Al 827248 


Hs.224398 


Homo sapiens cDNA FU11469 fls, c 


Collagen;COLFI 


3.7 


403661 






predicted exon 


7lnL3;ANF_receptor 




407305 


AA715234 




gb:nv35l03j1 NCI_CGAP_Br5Honi 


pkinase;Sema;Plexin repeat:TIG 




457353 


X65533 


Hs.248144 


melanocorb'n 2 receptor (adrenocortic 


7IIIL1 




431176 


AI026g84 


Hs^93662 


ESTs 


lamlnin_EGF;laminin_B 


3.6 


436233 


AI742878 


Hs.124116 


ESTs 


ig 


3.6 


431808 


M30703 


Hs.270833 




EGF 


3.6 


445798 


NM_012421 


Hs.13321 


rearranged L-myo fusion sequence 


zf-C2H2 


3.6 


400380 


NM 018485 


Hs.283079 


G protein-coupled receptor C5U 


7lnLl 


3.6 


463893 


NM_000836 


Hs.36461 


glutamate receptor, ionotropic, N-met 


ligjohan 


3.5 


409402 


AF208234 




cystatinB(stefinBl 


cystatin 


3.5 


421166 


AA306407 


Hs.102308 


potassium inwardiy-rectllying clianne 


IRK 




445575 


Z25363 


H3.1 72004 




thS 


3.5 


428957 


NM_003881 


Hs.1 94679 


WNT1 inducible signaling pathway p 


tsp_1wwc:IGFBP 






NM_016266 


Hs.95260 


Homo sapiens mRNA; cDKh [3KFZp 


Na_H_Exchanger 




403077 






predicted exon 


thS 




455612 


BE042895 


Hs.274848 


ESTs 


ABCJtraniABCjnembrane 




424091 


AF235097 


Hs.139263 


calcium channel, vollage-dependenl, a 






403956 


W28077 


Hs.79389 


nel (chicl<en)-like 2 


cadherin;Cadherin_C_tenii 




457470 


AB040973 


Hs.272385 


G protein-coupled receptor 72 


7trn_1 




401622 


N47812 


Hs.81360 


CGI-35 protein 


disinlegrlniRaproiysin 




404886 






predicted exon 






437692 


AA176959 






fhS 




407944 


R34008 


Hs.239727 


desmocollin2 


cadhetin 




407393 


AB038237 




gb:Homo sapiens mRNA for G protel 


7trn.1 




436936 


AL134451 


Hs.197478 




EGF;lairinlti_G 




423309 


BE006775 


Hs.126782 


sushi-repeal protein 


sushi:HYR 




402172 








ig 


3.3 


447420 


AI378628 




gb;lc72g07.x1 Soares_NhHMPu.S1 H • 
ESTs 


ank;pkinase;death 


3.3 


438901 


AF085834 


Hs.29036 


sushi 


3.3 


424362 


AL137646 


Hs.146001 


Homo sapiens mRigA; cDNA DKFZp 


t[ypsin;sushi;CUB 


3.3 


430453 


BE387060 


Hs.3903 


Cdc42 eflector protein 4; binder of Rh 


Ih3 


3.3 


416631 


H6g466 




gb:yr88f07.r1 Soares fetal liver spleen 


HLrecepLaiMACPF 


3.3 


453174 


AI633529 


Hs.135238 


ESTs 


7tin.1 




433848 


AF095719 


Hs.93764 


carboxypepCdase A3 


Zn_cartiOpepl;Propep_l«14 


3.2 


408546 


W49512 


Hs.45348 


bradyldnin receptor 81 


7lm_1 


3.2 


423573 


AA328504 




gb:EST31993 Embryo, 12«iieekl Horn 


7trn_1 ■ 


3.2 


458662 


AI823410 


Hs.1 691 49 


l<aryopher1n alplia 1 (importin alpha 5 


7tm_3ANF_receptor 


3.2 


433430 


AI863735 


Hs.1 86755 


ESTs 


thyroglobulinJilGFBP 


3.2 


438860 


R33727 


Hs.24688 


EST 


anl(;pkinase;death 


3.2 


420783 


AI659338 


Hs.99923 


lectin, galactoslde-binding, soluble, 7 


Gal-bindjeclin 


3.2 


409968 


U56102 


Hs.57699 


adhesion glycoprotein 




3.1 


430630 


AW269920 


Hs.2621 


o/stalinA(stelinA) 


71m 3,ANF_receptor 


3.1 


420737 


LC8096 




tumor necrosis factor (ligand) superfa 


TNF 


3.1 


422279 


H69644 


Hs.1 14231 


C-typelectin-lil(ereceptor-2 




3.1 


400289 


X07820 


Hs.2258 


matrix metalloproteinase 1 0 (siromely 


hemopexln;PepBdase_M10 


3.1 


412597 


AU077051 


Hs.74561 


alpha-2-macraglobulln 




3.1 


453420 


AJ00345g 




gb:AJ00345g Selected chroirasome 2 


IRK 


3.1 


404243 








zf-C3HC4;SPRY:zl-B_box 


3.1 


449987 


AW079749 


Hs.184719 


ESTs,WeaklyslmllarlQAF116721 1 


ABC_tran;ABCLniembrane 


3.1 


422471 


AA311027 


HS271894 


ESTs 


■<s 


3.0 


400464 








PepUdase_S9 


3.0 


458713 


BE044496 


HS.2&2707 


ESTs 


EGF 


3.0 


421340 


F07783 


Hs.1369 


decay accelerab'ng tetgr for complem 




ao 


449523 


NM_000579 


Hs.54443 


chemoidne (C-C motlQ receptor 5 


7tnLl 


ao 


400704 






predicted exon 


lig_chan;ANFj!eceplor 


ao 


416239 


AL038450 


Hs.48948 


ESTs 


E1-Eaj\TPase;Hydrolase 


3.0 


433664 


AW292176 


Hs.245834 




Rlcin_BJectin 


3.0 


42994 




Hs.1724 


Interleukin 2 receptor, alpha 




29 


447726 


AL1 37638 


Hs.19368 


Homo sapiens mRNA; cDNA DKFZp 




2.9 


425483 


AF231022 


HSJ01273 


Homo sapiens protocadherin Fat 2 (FA 


EGF];adheiln;lamlnin_G 


2.9 


423513 


AF035960 


Hs.129719 


transglutaminase 5 


TransgluLcore;Transglutamin_N 


2.9 








predicted exon 


ig;pkinase;LRRNT;LRRCT 


2.9 


405790 






predicted exon 


Sema;Rexin_repeat;TIG 


2.9 


422669 


H12402 


Hs.119122 


ribosomal protein L13a 


arfyas;RibosomaLS17 


2.9 


430793 


M83181 


Hs.247940 


5-hydroxytryptaraine (serotonin) recep 


7tm_1 


2.9 


403411 






predicted exon 


ABC_tran;ABC_membrane 




428188 


M98447 


H3.22 


transglutaminase 1 (K polypeptide ep 


Transglulamin_N;Transglut^core 


2.8 






Hs.76252 


endothelin receptor type A 


7lm_1 


2.8 


427223 


BE203189 


Hs.174031 


cytochrome c oxidase subunit VIb 


C0X6B 


2.8 


404187 






predicted exon 


ig 


2.8 


443537 


D13305 


Hs.203 


cholecystoldnin B recepter 


7tm.l 


2.8 










vwa;integrlriJVP2}0^eptor 




411213 


AA676939 


Hs!69285 


neuroplin 1 


CUB;MAM:F5_F8_type_C 


2J 




BE328153 


Hs.240087 


ESTs 




2.7 


401244 






predicted exon 


vwa;vwd;TIL 




458930 


NM.003612 


Hs.24640 






2.7 


434411 


AA632649 


Hs.201372 


ESTs 


sushi 


17 


400421 


AF263537 


Hs.287370 


fibroblast growth Mor 23 


FGF 


2.7 


448999 


AF179274 


Hs.22791 


Iransmembrane protein with EGF^ike 


kazal 


2.7 


417350 


U50928 


Hs.82001 


polycystic kidney disease 2 (autosoma 




2.6 


419452 


U33635 


Hs.90572 


. PTK7 protein ^slne kinase 7 


pkinase;ig 


2.6 
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431728 
441595 
445537 
447197 



416429 
417067 
433182 
403092 



M007536 
H54658 
AJC01417 
AB039920 



gb:yh88b01.s1 So 
Fc fragment of IgA, receptor for 
ESTs, Moderately similar to ALU1_H 



Hs.172928 

438698 AW297855 Hs.125815 
456816 NM.013348 Hs.144011 

TABLE 2B: 

Pkey: Unique Eos prot)eset Identifier rximlrei 
CAT number: Gene cluster nunnl)er 
Accession: Genbank accession numbers 



potassium Inwardly-rectifying clianne 



ABC_tian;ABCjnembrane 



409385 112523J 



AA071267 T55940 T64615 AA071334 



423573 
447197 
447420 

452947 
453420 
459170 



229714_1 
711623.1 
721207„1 
920646 1 
939810 1 
966433.1 



BE152644 BE152712 BE1S2668 BE152S59 BE152810 BE15281 1 BE152B16 BE1S2643 BE152706 BE1526S6 BE152660 BE152715 BE152662 
BE152669 BE152661 BE152672 BE152S53 BE1S2716 BE152651 BE152767 BE1S2677 BE152652 BE152714 BE152708 BE152665 BE15267g 
BE152771 BE152775 BE152666 BE152768 BE152813 BE152664 BE152676 BE152681 BE152709 BE152667 BE152814 BE152808 BE152711 
BE1S2707 BE1S2815 BE1S2678 BE152S73 BE1S2782 BE1S2671 BE152S82 BE152760 BE152809 BE152778 BE152780 BE152762 BE1S2776 
BE152781 BE15Z774BE15Z763BE15276g 
T26661Z44135 H23016 



I -Hie DNA sequence of 



Aia78628 N32350 H85772 

AI9Q551 8 AI90551 6 AI905457 AI905515 AW176013 AW176037 

AW130413AI932362 
AJ003459AJ003461 

AI90551 8 AI90551 6 AI905457 AI905515 AW176013 AW176037 



TABLE 2C: 

Pke/: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in ttils column are Genbank Identifier (Gl) numbers. 'Dunham I. et al.' refers to the | 

human chromosome 22" Dunham, et al. (199?) Ualyig 402:489495 
Strand: Indicates DNA strand from which axons were predlcbd 
NLpositlon: Indicates nucleotide positions of predicted exons 



400464 
400704 
400749 
401244 
401537 
401657 
402172 



403687 
403691 
4C37S3 
403795 



Ref 



9100664 
8575911 
9796347 
9212200 

8954241 
8954241 
8954241 
8954241 
8954241 



8705027 
7387384 
7387384 
7229888 



Ntjiosltkin 
22074-22214 
63110-63241 



171964-172239 

174720-175016,175104-176406,175508-175813 

177083-177373,177464-177751 

104247-104420 

30268-30482 

9009-9534 



405281 
405285 



4884062 Plus 

6139075 Minus 

5139075 Minus 

1054740 Pius 

1054740 Plus 

5123990 Plus 



43575-43887 
75073-77664 
7644-7991 
74695-75123 

3649-3750,4161-4305^962-6049.6849-6965 



34202-34351,35194^6,46412-45475,45731-45958,4729&47457,49549-49658,4979049904,50231-50342,53583-S3667,54111- 
54279 

55744-55903,57080-57170,61478-61560 

118677-118807.119091-119296,121626-121823 

124361-124520,124914-125050 
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405790 12039S8 Plus 



136364-136509,136579-136699,136805-136941 



These wars selected as for Table 1A, except tliat tlie ratio was greater tl\an or 



UGID: UniGenelD 
Title: Unl 
PFAMdoi 



40 
45 



Pkey Ex. Accn No. 

420859 AW468397 

422166 W72424 

422158 L10343 

424799 BE550723 

442402 NM_000954 

408522 AI641214 

431369 BE184455 

430520 NM_016190 

428471 X57348 

421978 AJ243662 

437191 NM.006846 

407788 BE514982 

441565 AW953575 

431211 M86849 

419329 AY007220 

430572 U33114 

417079 U65590 

412636 NM_004415 

417515 L24203 

426295 AW367283 



416839 AW250318 

409453 AI885516 

424670 W61215 

417130 AW276858 

423634 AW959908 

4423^9 NM.004613 

456898 NM_001928 

423017 AW178761 

447990 BE048821 

424362 AL137646 

414438 AI879277 

420136 AWB01090 

433336 AF017986 
403741 

430637 BE160081 

424098 AF077374 

441591 AF055992 

426521 AF161445 

406713 U02629 

4C6725 D51245 

422168 AA586894 

406755 N80129 

425593 AA278921 

442257 AW503S31 

421957 AW068637 

447626 AL048763 

406722 H27498 

427223 BE208189 

41442C AA043424 

417259 AW903838 

414191 AW250089 

436906 H96990 

408000 L11690 

414035 Y00630 

432706 NHi.013230 

421948 L42583 

414662 AU)36058 

425071 N013989 
404767 

418327 U70370 

436729 BE621807 

414183 AWg57446 
400163 

433423 BE407127 

423457 F08208 



UGID Title 

Ms.100000 S100 calcium-binding protein A8 (calgranulin A] 

Ms.1 1 2405 S100 caldum-binding protein A9 (calgranulin B] 

Hs.1 1 2341 protease inhibitor 3, skltwlenvBd (SKAI.P) 

Hs.1 53179 fatty acid binding protein 5 (psoriasis-assodaN) 

Hs.8272 prostaglandin D2synthase (21i(D, brain) 

Hs.46320 Small proline^ protein SPRK [human, odontogenic l< 

Hs.251754 secretoiy leuiiocyle protease Inhibitor (antlleuiioprotein 



HS.184S10 

Hs.1 10196 

HS.5476 

Hs.38991 

Hs.169902 

HS.SS66 

Hs.288998 

Hs.245188 

Hs.81134 

Hs.74316 

Hs.82237 

Hs.75839 

Hs.262958 

Hs.77385 

Hs.77385 

Hs.278613 



epithelial V-like antigen 1 

S 100 calcium-binding protein A4 (calcium protein, calv 

hepartn-binding growth factor binding protein 
transglutaminase 2 (C polypeptide, prolein-glutamine-g 
D component of complement (adipsin) 
sen'ne (or cysteine) proteinase inhibitor, dade B (ovalbu 
small inducible cytokine subfamily A (Cys-Cys), memb 
Homo sapiens mRNA; cDNA DKI^86I^824 (flxim 

actin, alpha 2, smooth muscle, aorta 

ESTs, Highly similar to JE0174 feled proteln-2 IH.sa 

predicted exon 

S100 calcium-binding protein A1 1 (calgizzarin) 
sraali proline-rich protein 3 



NICE-1 proteig 

serine protease inhibitor, Kazai type, 5 

8100 calcium4)inding protein A2 

solute earner family 2 (tolltaled glucose transporter), 

gap juncUon protein, beta 2, 26kO (connexin 26) 

SlOO-type caldum binding protein A14 

tissue Inhibitor of metalloproteinase 3 (Sotsby fiindus d 

interieukin 1 receptor antagonist 

desmoplakin (DPI, m) 

ataxia^elangiectasla group D-associated protein 

zinc finger protein 6 (CMPX1) 

ESTs, Weakly similar to alternatively spliced product u 

myosin, light polypeptide 6, alkali, smooth muscle and n 

myosin, light polypeptide 6, al<ali, smooth muscle and n 

interferotv alpha-lndudble protein 27 



Hs.227948 
Hs.20144 
Hs.146001 
Hs.76136 
Hs.195851 
Hs.31386 



S100 caldum-binding protein A7 (psoriasin 1) 



Hs.1908 proteoglycan 1, secretory granule 

gb:UI-HF-BN0-alb-b-05-0AJl.r1 mH_MGC_50 Homo 

Hs.10gS57 hypothetical protein DKFZp434H0820 

Hs.340 small indudble cytokine A2 (monocyte chemotacSc pro 

Hs.283305 Homo sapiens SNC73 protein (SNC73) mRNA, comple 

Hs.174031 cytochramec oxidase subunilVlb 

Hs.76095 immediate earty response 3 

Hs.81800 chondroilin sulfate proteoglycan 2 (versfcan) 

Hs.75807 PDZ and LIM domain 1 (effin) 

Hs.181244 major histocompatibility complex, dass l,A 

Hs.620 buikxjs pemphigokl antigen 1 (230/240kD) 

Hs.75716 serine (a cysteine) proteinase inhibitor, dade B (ovalbu 

Hs.286124 CD24 antigen (small cell \m carcinoma cluster 4 anSg 

HS.1117S8 kerafa'n6A 
Hs.76807 
Hs.154424 

piedkitedexon 

Hs.84136 paired-like homeodomain ttanscripton fador 1 

Hs.301711 ESTs 

predkitedexon 

Hs.8997 heatshod(70kD protein 1A 

Hs.155606 pared mesodennhomeo box 1 
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414085 M114016 

423189 M59371 

438240 N92638 

417366 BE185289 

412774 M120865 

407242 M1B728 

431292 AA370141 
403695 

417365 D50683 

432331 W37862 

424479 AF064238 

444726 NM.C06147 

432314 AA533447 

429500 X78565 

441406 Z45957 

412969 A1373162 

423720 AU)44191 
400111 

407207 T03651 

417164 AA333283 

424971 AA479005 

439394 AA149250 

406657 AI678644 

451092 AI207256 

412596 AA161219 

422103 AA984330 

428785 AlOISgsa 

450988 BE613571 

414622 AI752666 
405022 

408221 AA912183 

446500 U78093 

421416 BE302950 

412247 AF022375 

410541 AAC65003 



406825 AI982529 

443623 AA345519 

404201 AF059556 
405138 

408733 AW264812 

414044 BE614194 

430152 AB001325 

428121 AB006622 

434311 BE543469 
406140 

432918 AF077200 

420107 AL043980 



Hs.171696 EphA2 

Hs.124004 ESTs 

Hs,1C76 small proline-rich ptotain IB (comilin) 

Hs.23136 ESTs 

gb:Human nonspecific crossieacling anb'gen mRN^ co 

Hs.251453 Human DNA sequence from clone 967N21 on chromos 

Hs.82028 
Hs.274368 
Hs.1490g8 
Hs.11801 



Hs,285173 ESTs 
Hs.289114 
Hs.7837 
Hs.75103 
Hs.23388 



Interferon regulatory factor 6 



Homo sapiens cI3NA: aJ21310 fis, clone COL02160 
predicted exon 
lulwlin, beta polypeptide 



inucleoprotelnAJB 



tumor suppressing sublransferable candidate 3 
ESTs, Wealdy sWIar to WDNM m WDNM1 PROT 
major histeompatibilily complex, class I, C 
Homo sapiens raRNAfbr FU00074 protein, partial cds 
diphtheria toxin receptor (hepatin-binding epidermal gro 
protein kinase' H11; small stress protein-like protein HS 
ESTs 

ATP synthase, H+ transporting, mitochondrial FO corap 



Hs.179661 
Hs.81361 
HS.154C36 
Hs.56105 
Hs.277477 
Hs.13766 
Hs.799 

HsJ 25265 

Hs.429 

Hs.76669 



Hs.47447 ESTs 
Hs.15154 
Hs.104125 

Hs.73793 vascular endothelal growth factor 
Hs.64179 hypothetical protein 
Hs.77961 major hislocompatibiiity complex, class I, B 
Hs.94789 ESTs 

H3.84298 CD74 antigen (invariant polypeptide of major hislooom 
Hs.9641 complement component 1. q subcomponent, alpha poly 
Ms.1 03983 solute carrier family 5 (sodium Iodide symporter), mem 

predicted exon 
Hs.254290 ESTs 
Hs.75721 profilin 1 
Hs.234642 aquapoiin 3 

Hs.182536 Homo sapiens cDNA: FU21370fis, done COL03092 
Hs.266263 Homo sapiens cDNA FU141 IS fis, done IUAMMA10 

predicted exon 
Hs.279813 hypothetical protein 
Hs.7886 peliino (Drosophiia) homolog 1 
Hs.180370 cofilin 1 (non-muscle) 

Hs.11081 ESTs, Weakly similar to S57447 HPBRIl-7 protein jH. 

■ SCDNAFU11346 lis, clone PLACE1010 
hypotheUcal protein DKFZp761ACS2 
— n, Wealdy slmlar to S57447 HPB[«i-7 protein IH. 



401781 

448257 AW772070 

428415 AA337211 

424206 NM.003734 , 

406812 AF000575 

425882 U83115 

432501 BE546532 

421786 AI188653 

427981 BE275986 

41C143 AA188169 

451328 AW853606 

414135 Nl^_004419 

414602 AW630088 
4C1785 

411469 TC9997 

419693 AA133749 

417039 AA30218C 

4C6718 AA606625 
4C2543 

408669 A1493591 

414987 AA524394 

445810 AW265700 

406653 AA574074 

407498 U28131 

412524 AA417813 
401521 



Hs.301885 
Hs.184029 
Hs.107125 
Hs.99936 
Hs.247942 

Hs.253146 
Hs.184222 
Hs.198241 
Hs.67346 
Hs.151002 
Hs.287329 
Hs.21351 
Hs.181311 
Hs.288819 
Hs.109012 



tln10(epl 



Down syndrome cribcal region gene 1 
amine oxidase, copper containing 3 (vascular adhesion p 
leukocyte immunoglobulin-like receptor, subfamily B ( 
absent in melanoma 1 



FU21022 fis, done CAE06383 



Hs.76550 Homo sapiens m[*IA; cDNA DKFZp564B1264 (from 

predicted exon 
Hs.70327 cysteine-rich protein 2 
H3.92323 FXYD doraain-flontaining ion transport regulator 3 
Hs.80936 ATP synthase, H-^ transporting, mitochondiial FO comp 
Hs.169476 glyceraldehyde-3-phosphate dehydrogenase 

predicled exon 

Hs.78146 plateletfendotheltal cell adheslbn molecule (CI331 anUg 
Hs.165544 ESTs 
HS.1SS660 ESTs ' 

Hs.77961 major hlstocompab'billly complex, dass I, B 

gb:Human HMGI-C dtlmeiic transcript ml^ partal 
Hs.11177 ESTs 
Hs.221441 ^gyf^^"" 
Hs.183704 ublquitinC 
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409828 AW601137 

459390 BE3BS72S 

445055 BE512856 

411789 AF246606 

410626 BE407727 

410706 AI732404 

419273 BE271180 

407839 AA045144 

444286 AI625304 

449226 AB002365 

414290 AI568801 
401246 

425222 M85430 

409960 R42678 

437201 F29279 

406566 AF088886 

455426 AWg37792 

415160 T82802 

424996 Z46023 

453870 AW385001 

433470 AW960564 

428188 M98447 

417409 BE272506 

425389 AW974499 

434658 AI62443S 

456562 AA306049 

447111 AI017574 

432360 BE04S243 

424125 M31669 

419968 XQ4430 

429415 NM.002593 

451541 BE279383 

424499 N90344 
402144 

422511 AU075442 
400231 

437712 X04588 

417433 BE270266 

419669 AB023206 

428582 BE336699 

421401 AW410478 

414064 BE245239 

431938 AA93B471 

411930 F06485 

423150 AW950647 



413924 AL119964 

420231 R06866 

434715 BE005346 

422831 R02604 

416354 H40164 ' 

422976 AU076657 

426356 BE636836 

433935 AF112203 

430040 AW503115 

406340 AA299679 

426050 AF017307 

426105 BE280066 



gb:qr30gO6.x1 NCLCGAP_GC6 Homo sapiens cDNA 
Hs.250175 homolog of yeast long chain polyjnsaturated fatly acid 
Hs.97199 complement camponenl Clq receptor 
Hs.16036 ESTs, Highly similar to AF151358 1 HSPC0951rtsapi 

gb:Ul-HF-BP0p-ait-e-12-0-Ul.r1 NIH_MGC_51 Homo 

gb:601276347F1 NIHJ4GC_20 Homo sapiens cDNA 
H5.1090S1 glycoprotein, synaptic 2 
HS.721S7 Homo sapiens adilcan mRNA, ccniplete cds 

gb:601299771F1 NIH_MG(L21 Homo sapiens cDNA 
Hs.68846 ESTs 
Hs.293490 ESTs 
Hs.161566 ESTs 
Hs.190312 ESTs 
Hs.23311 KIAA0367 protein 
Hs.71721 ESTs 



Hs.155191 
Hs.3016e9 
Hs.177486 
Hs.11590 



Hs.22 

Hs.8210 

Hs.192183 

Hs.194488 

Hs.1026e9 

Hs.17409 

Hs.274416 

Hs!93913 
Hs.202097 
Hs.26557 
Hs.149436 



vlinn2(ezi1n) 
KIAA0564 protein 

amyloid bela (A4) precursor protein (protease nexin-ll, 
predicted exon 

gb;QV3-DT0045-140200^)82-b07 DT0045 Homo sapi 
gb:yd38a04.r1 Soares fetal liver spleen 1NFLS Homo s 
gb'.HSC2FA041 normalized in^nt brain cDNA Homo s 
Homo sapiens^cDNA: FLJ23173 lis, clone IMG10019 
transmembrane 4 supertolly member 1 
transglutaminase 1 (K polypeptide epidermal type I, pro 
syndecan 1 



DKFZP4340125 protein 
■ •■ -ICnte 



NADH deliydrogenase (ubiquinone) 1 alpha subcomple 
inhibin. beta B (activin AB beta polypeptide) 
interleukin 6 (interferon, beta 2) 
ptocoilagen C-endopeptldase enhancer 
plakophlllnS 



Hs.11 



predicted e» 



ntypeX 



11, alpha 1 



Hs.85844 neurotrophic tyrosine Idnase, receptor, type 1 

Hs.82128 5T4 oncofetal trophoUast glycoprotein 

Hs.92186 Leman coiled<oil protein 

Hs.l 85055 BENE protein 

Hs.104019 transforming, acidic coiled-coil conteining protein 3 

Hs.16165 expressed in activated T/LAK lymphocytes 

Hs.l 15242 developmentally regulated GTP-binding protein 1 

gb:HSC19G051 normalized infant brain cDNA Homo s 

Hs.182684 cytochrome c oxidase subunit Vila polypeptide 2 (liver) 



Hs.74047 < 
Hs.1573 ! 
Hs.31638 
Hs.75616 
Hs.19813 
Hs.l 16410 



sec61 ho 

gb:601054837F1 NIH_MGCJO Homo sapiens cDNA 

13kDa diffei^ntiation-associaled protein 

pM6 protein 

oofilin 1 (non-musde) 

E74-iike factor 3 (els domain transcripton feolor, epith 
hypothetical protein FU220S6 

small proline-rich protein 3 

CD74 antigen (invariant poiypeptide of major histooom 

enigma (LIM domain protein) 

ESTs 



416973 R24707 



468789 AL157458 
400842 

406828 AA419202 

423287 AL137416 

451383 AW239364 

437042 AK000702 

459399 BE407712 

425650 NM_001944 



Hs.260201 ESTs 
H3.132219 ESTs 
Hs.25001 

Hs.20157 Homo sapiens cDNA FU2084'8 lis, clone ADKA0V732 
predicted exon 

Hs.84298 C074 antigen finvaiiant polypeptide of major hislocam 
Hs.126177 Homo sapiens mRNA; cDNA DKFZp4340192 (fim c 
Hsi0242 hypotheticai protein FU12788 
Hs.5420 hypothetical piDtdnFU20695 

gb:601299745F1 NIH_MGCL21 Homo sapiens cDNA 
Hs.1925 .desmoglein 3 (pemphigus vulgaris antigen) 
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412765 AK000620 

419223 X60111 

413796 AW408094 

447796 AW296151 

431103 M57399 

415314 N88802 

428411 AW291464 



Hs.5338 

Hs.74671 

Hs.1244 

Hs.75545 

Hs.163612 

Hs.44 

Hs.5422 

Hs.10338 



407846 
453500 
456054 
45J467 
411794 



405783 
417308 
400749 
413442 
404828 



AL03651B 
AI476427 
BE313241 



F12741 
H50720 
BE140643 
AJ1320e7 



cattionlcanhydraseXII 
ADP-nbosylalIon«aolor1 
CDS antigen (p24) 
interleukrn 4 receptor 



plelotropliin (heparin binding giowtli factor B. neurite g 



imnnunogbljulin lieavy constant gannma 3 (Gm mariief) 

calhepsin B 

heatsliocl<105l<D 



Hs.43125 
Hs.30089 



HS.28S363 
Hs.25590 
Hs.252189 



gb:601151545F1 NIH_MGC_19 Homo sapiens cDI>IA 

ESTs 

pliosplioiylase, glycogen; brain 
ESTs 

gb:QV4«N0090-070400-163KA7 BN0090 Homo sap! 



syndecan 4 (amphlglycan, lyudocan) 
Thy-1 cell surface antigen 

nomiallzBd Intant brain cDNA Homo 



KIAA0101 gene product 
predicted exon 

gb:RC0-HT001S-310599O1S HT0015 Homo sapiens c 



FXYD domain-containing ion transport regglator 5 
predicted exon 
collagen, type IV, ilpha 2 



417055 N39489 

449184 AW29629S 

446542 N|yL004281 

412793 AW997986 

452818 W21909 
402869 

436810 AA353044 
402075 

410480 R97457 

406690 IUI29S40 



409703 NM_006187 

446108 AL036596 

428144 BE269243 

445688 AI248205' 
405411 

410275 U85658 

424675 NM 005512 

450455 AL117424 

414855 AA15e986 

433578 BE336886 
401994 

445033 AV652402 
402277 

428106 BE620016 

448625 AW970786 

422537 AI879352 

457204 BE264152 

444094 AI695764 

414053 BE391635 

430511 BE018156 

434039 \J32977 

424939 AK000069 

414539 BE379046 
404675 

401597 AA172106 



ESTs 



gb:RC1-BN0056^200-021-e1 1 BN00S6 Homo sapie 
ublqulnol-cytochrome c reductase (6.41(D) subunit 

predicted exon 

ARP3 (actin-related protein 3, yeast) tiomolog 

cadherin 13, H-cadherin (heart) 

carcinoembiyonic antigen-related cell adiiesion roolecul 

Homo sapiens mRNA; cDNA DKFZp586A0424 (tram 

voltage-dependent anion diannel 1 

tumor necrosis factor (ligand) superfeniily, member 7 

gb.'60126925BF1 NIH-MGC_8 Homo sapiens OMA c 



Hs.79386 
Hs.7258 
Hs.196491 
Hs.15259 



Hs.5321 

Hs.63984 
Hs.22052g 
Hs.301241 
Hs.149155 
Hs.99899 



Hs.102773 

HS.1B2625 VAMP (veslcle-assodated membrane protein)-associate 
Hs.153244 ESTs 

predicledexon 

HS.6179S transcription factor AP-2 gamma (activating enliancer-b 

Hs.151641 glycoprotein A repeWions predominant 

Hs.25035 chloride intracellular channel 4 

Hs.104640 HIV-1 inducer of short transcripts binding protein 

Hs.3416 adipose differentiation-related protein 

predicted exon 
Hs.155145 ESTs 

predicted exon 
Hs.182470 PTD010 protein 

Hs.178470 Homo sapiens cDNA: FU22662 lis, clone HSI08080 
Hs.1 18625 hexokinasel 
Hs.221994 ESTs 

Hs.75725 transgelin 2 

Hs.2575 calpain 1, (mu/l) large subunit 

Hs.3712 ubiquinol-cytochrome c reductase, Rieskeiton-sullur po 

Hs.153881 Homo sapiens NY-REI^62 antigen mRNA, partial cds 

gb:601236646F1 NIH_|yiGC_44 Homo sapiens cDtJA 

predicted exon 
HS.11C950 Rag C protein 

piedldedexon 

jbza62b02.r1 Scares fetal liver spleen 1NFLS Homo sa 
111 
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412026 AI827461 

414276 BE297862 

444065 AW449415 

447981 R53772 

410677 NM_003278 
400982 

452933 AW391423 

407233 X16354 

430127 AA219498 

448218 AI188489 

413511 AI627178 

459511 AI142379 

410668 BE379794 

458662 AI823410 

451219 AAC54209 

448939 BE267795 

400800 Y10262 

446342 BE298665 

421177 AW070211 

433848 AF095719 

448497 BE613269 

415279 F04237 

419323 AI092379 



437579 



M_014214 



412923 AA179922 

447980 AI703397 

419118 AA234223 

421224 AW402154 

414890 BE281C9S 

447330 BE279949 
405610 

447604 AW089933 

445677 H96577 

456088 BE177320 

417120 N79687 
4C5194 

410687 U24389 

421888 AA299780 

420459 AF016045 

416323 N72630 

446292 AF081497 

416274 AW160404 

43C028 BE564110 

438450 AI050866 
400215 

430014 H59354 

453682 AW854339 
406867 

469170 AI905518 

407944 R34008 

415748 D90086 

423287 H38340 

450944 AA554989 

432906 BE265489 
400104 

449019 AI949095 

406897 M57417 



gb:601174780F1 NIH_MGC_17 Homo sapiens cDNA 
Hs.10260 Homo sapiens cDNA FU11341 fis, done PLACE1010 
Hs.8929 hypothelioalpiDfeinFU11362 
Hs.65424 telraneclin (plasniinogen-binding protein) 



Hs.233952 proteasome (prosome, macrapain) subunll alpha type, 7 
gb.qd09b12.x1Soares_plaoenta_8to9weel(sJNbHP8to 

Hs.75412 Arginine-rich protein 

gb:qg64c01.rl SoaresJestis_NHTHoinosa(Seils oDNA 
Hs.65403 hypolhetical protein 
Hs.159149 karyoptierin alptia 1 (Impoftin alplia 5) 



Hs.14846 
Hs.102415 
Hs.93764 
Hs.21893 
Hs.1447 
Hs.135275 
Hs.237356 
Hs.5753 
Hs.158287 
Hs.75056 
Hs.202355 ESfs 
Hs.139204 
HS.12S812 

Hs!l8141 



Hs^74 
Hs.6838 
Hs.156148 
Hs.46616 

Hs.65436 
Hs.121036 
Hs.97905 
Hs.33981 . 



hypolhetical protein FU10637 
eyes absent (Drosophila) horaolog 3 
Homo sapiens mRNA; cDNA DKFZp564D016 (from c 
Homo sapiens mRNA; cDNA DKFZp586N0121 (from 
carboxypeptidaseA3 

ESTs, Weakly similar to glycerol 3-phosphale permease 
glial fibrillary acidic protein 
ESTs 

stromal celWerived factor 1 



ovo (f}iDsophila) homolog-lllce 1 

Homo sapiens genomic DNA, chromosome 21q, section 

Rh type C glycoprotein 

guanine nucleotide binding protein 10 

NAOH dehydrogenase (ubiquinone] 1 beta subcomplex 

nodal, mouse, homolog 

predicted exon 

hypothetical protein FU11937 
predicted exon 

gb-.RC-BT091-210199-098 BT091 Homo sapiens cDI^ 
desmocollin 2 

pyruvate dehydrogenase (l^mide) beta 
gb:yp70h07.r1 Scares adut brain N2b4HB55Y Homo s 
sudD (suppressor of bImfX, Aspergillus nidulans) homo 
lethal giant laivae (Drosophlis) homolog 2 



430223 Nl^_002514 

406686 11118728 

444747 AW450407 

417883 R22519 



444409 AI7g2140 

428578 BE391797 

433417 AA587773 

426372 BE304680 
402131 

450545 AW135582 

434162 AI221214 



Hs.292816 
Hs.61753 
Hs.279898 



Homo sapiens cDNA: FU23165fls, clone UtlG09846 



Hs.257291 
Hs.23398 
Hs.236494 
Hs.204605 
Hs.212640 



RAB10, member RAS oncogene family 



tiypothelical protein 
ESTs 

DEAIJH (Asp^lu-AlshAspMiB) box pdypepUde 21 



linB(stefinB) 
leucine rich repeat (In FLU) Interacting protein 1 
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436236 T03890 

457439 AW41(M08 

448667 Z78394 

440606 Z40094 

426724 AA383623 
403359 

442826 A1018777 

411503 AW190338 

414540 BE379050 

421595 AB014520 

438802 AA825976 

400491 H25530 

418994 AA296520 

426383 BE537330 

418408 AA219321 

416186 W87575 

416908 AA333990 

453867 AL080236 

439706 AW872527 

441619 NM_C14C56 

417198 F11533 

433662 W07162 

463986 M13232 

457123 AA770021 

433864 AA931550 

409866 AW602208 

448175 BE296174 
406277 

451957 A1796320 

408802 AL048269 
401757 

444751 AI2C7406 

408647 AW245831 

418870 AF147204 

436913 AA789074 

451743 AW074266 

421853 AL117472 

407926 AW966382 

413973 BE279858 

439078 AF085936 
401913 

435138 BE314734 
405311 

413127 BE066S29 

430793 M83181 

434445 AI349306 

413166 AI764416 

431971 BE274907 
401167 

454163 AW175997 

403306 NM_006825 

410627 AA181339 

450796 NM_001988 

442199 BE277633 
402699 

428143 BE379836 

437592 NtL0038Sl 

433598 AI762836 



Ms.157208 ESTs, Highly similar to Anc hoineoproleln lM.musculu 

Hs.271167 L-pipeconcaddoxHase 

Hs.4896 Homo sapiens cDNA: FU22046 fis, done HEP09276 

Hs.185698 ESTs 

Hs.293616 ESTs 

predicted exon 
Hs.131241 ESTs 

Hs.28029 puiineigic receptor P2X, llgand-gated Ion channel, 4 
gb:601236655Fl NIH.UGC 44 Homo sapiens cDNA 
Hs.105958 HomosapienscDNA:FU22735fis,cloneHUV00180 
Hs.136954 ESTs 

Hs.60868 solute cam'er family 22 (organic cation transported, me 
Hs.89546 seleolin E (endothelial adhesion molecule 1) 

gb:601054570F1 NIH_MGC_10 Homo sapiens cDNA 
Hs.173294 ESTs 



ig, mitochondrial FCcomp 



Hs.59761 ESTs 

Hs.7917 DKFZP564K247 protein 

Hs.81634 ATPsynthase,H*trBnspor 

Hs.150826 CATX-8 protein 

Ms.36989 coagulation factor VII (serum prothrombin conversion a 

Hs.16332 ESTs 

Hs.192785 ESTs 



l-HF-BROp-aju-e-09-O-Ul.rl NIH_MGC_52 Horn 
Hs.225160 Homo sapiens cDNA FLJ13102 fis, clone NT2RP3002 
predicted exon 

Hs.10299 Homo sapiens oDIW FU13545 fis, clone PIACE1006 
Hs.288544 Homo sapiens cDNA: FU20882 fis, clone ADKA0320 



Hs.11866 



hypolhelical protein PR01197 

gb:2822937.6pnrae NIH.MGC.7 Homo sapiens cDNA 

chemokine (C-X-C motiO, receptor 4 (ftrsin) 

ESTs 

ESTs, Weakly similar to HuEMAP jH.sapiens] 
ESTs 

□KFZP586P1422 protein 



Hs.59771 ESTs 



Hs.83484 
Hs.247940 
Hs.11782 
Hs.260024 
Hs.77385 



Hs.74368 
Hs.929 
Hs.25482 
Hs.286027 

Hs.167106 
Hs.5710 - 
Hs.271433 



gb:601152976F1 NIH_lulGC_19 Homo sapiens cDNA 
predicted exon 

SRY (sex determining region Y)-box 4 



Hs.164166 ESTs 



ESTs 

Cdc42 effector protein 3 

myosin, light polypeptide 6, alkali, smooth muscle and n 
predicted exon 

gb:QV0-BT0078-190899-005-EC2 BT0078 Homo sapi 
transmembrane protein (63kD), endoplasmic reticulum/ 
myosin, heavy polypeptide 7, cardiac musde, beta 



proteasome (prosome, macropain) sul)unit alpha type, 3 
cellular repressor of E1A-stimulated genes 
ESTs, lliloderately similar to ALU2_HUMAN ALU SU 
predicted exon 



458698 AW452189 

422048 NM_012446 

413460 R61610 
401575 

431822 AA516049 

427275 AA4CC269 

417059 AA442192 
400151 

417190 NM.001369 

443657 AI129066 



Hs.257528 
Hs.288126 
Hs.21527 



ESrt Weakly similar to KIAAC918 protein [Rsapiens 

gb:ng65d01.s1 NCi_CGAP_Up2 Homo sapiens cD[>IA 
ESTs 

cytochrome c oxidase subunil VIII 
predicted exon 

2,4<iienoyl CoA reductase 1, mitochondrial 



Hs.293334 ESTs 



416328 H48389 

427174 AA39884a 

426148 AI751071 

452S44 AWB5188B 



Hs.268886 
Hs.97541 
Hs.167135 



cDNA FU1(J728 fis, clone NT2RP3001 
gb-.Q\/I)-Cr0225-131099^4Hl05 CT022S Homo sapie 
predicted exon 



113 
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428362 M426555 

425349 AA425234 

422440 NM_004812 

410962 BE273749 

411796 AA807197 

458954 AW379075 

408896 A1610447 

457024 AA397546 

414591 AI88B490 

437846 AA773866 



405256 

452416 AA026115 

440684 AI253123 

445603 H08345 

436306 AAB05939 

434867 AF159442 
404727 

407317 AI204033 
405580 

437898 W81260 

448781 AW243419 

457297 AW968188 
405545 

431662 AI8B4334 

440703 AL1 37663 

439848 AW979249 

418149 AAS11473 

439332 AW842747 



406684 X16354 

421064 AI245432 

441249 AA971585 

457624 AA809159 

407395 AF005082 

459006 AW298631 

436827 H72187 

418307 U70867 

456035 N549S6 

457867 AA045767 

440401 A1126341 
400126 

414931 AK000342 

406719 AI832962 

439675 W95357 

456068 N94587 

441926 AI016051 

428423 AU076517 

433S18 BES61958 

420674 NMJOOOSS 

422160 AWS82898 

412408 D51103 
400964 

434360 AW015415 

427977 AW630727 

450339 AI693281 

424059 AW451266 

414626 BE4105B9 
401991 

419741 NM_007019 

457952 U25750 



447306 AI373163 

424966 AU077312 

422739 H20106 

432504 AL121015 

423804 AW403448 

404633 AI924294 

441624 AF220191 

425751 T19239 

452976 R44214 

414642 AA150350 

437452 AL390127 

417426 NM_002291 

414774 X02419 

424631 AA688C21 

413967 AW204431 



Hs.169333 ESTs 



ribQse5-( 

aldo-keto leduclase family 1, member B11 (al 
FKSOe^Wng protein 1A(12kD) 



Hs.244569 ESTs 



17I<D fetal brain protein 

ESTs, Moderately similar to NAC-1 protein [R.norveglc 

predicted exon 

ESTs 

ESTs, Highly similar to NEST_HUMAN NEST1 |H,sap 



pliospholipid scramUase 3 
predicted exon 

ESTs, Wealdy similar to ALU5_HUMAN ALU SUBFA 



ptiosptiodlesterase 2A, cGMP^timuleted 
carcinoembiyonlc antigen-related cell adtiesbn molecul 

ESTs 

tumor necrosis factor, alpha-Induced protein 2 

Homo sapiens oDNA FU13S44 fis, clone PLACE1006 

gb:Homo sapiens skln-specific protein (xp33) raRNA. p 

tiypothelical protein FU20353 

guanine nucleotide binding protein (G protein), gamma 

Rho GOP dissociation intiibitor (GDI) beta 

solute canier family 21 (prostaglandin transportei), mem 



Hs.143887 ESTs 

predicted exon 

Hs.77646 Homo sapiens mRNA; cDNA □KFZp761M0223 (from 
Hs. 1 69476 glyceraidehyde-S-phospliate dehydrogenase 
Hs. 1 38860 Rho GTPase activating protein 1 



Hs.285823 immunoglobulin heavy constant mu 
Hs.1327 butyiylcholinesterase 

gb'.ia07e04.y1 Human Pancreatic Islets Homo sapiens c 
HS.738S1 ATP synthase, H+ transporting, mitochondrial FO comp 

predicted exon 
Hs.127780 ESTs 
Hs.1 81307 H3hislone, family 3A 
Hs.54547 ESTs 
Hs.107418 ESTs 

gb:601303308F1 NIH_MGC_21 Homos^iens cDNA 
predicted exon 
Hs.g3002 ubiquitin earner protein E2-C 
Hs.210783 Human chromosome 17q21 mRNA clone 1046:1-1 
Hs.1 18634 ATP-bindlng cassette, sub-family B (MDR/TAP), mem 
" lipooalin 2 (oncogene 24p3) 



Hs.153985 
Hs.119591 
Hs.277704 
Hs.1706 
Hs.173259 
Hs.179666 
Hs.1940 
Hs.101189 

Hs.7104 
Hs.82124 I 
Hs.77274 I 
HS.179B08 
Hs.117853 



ESTs 

solute canier family 7 (cationlc amino acid transporter, 
adaptor-related protein complex 2, sigma 1 subunit 
oxygen regulated protein (ISOkD) 
interfeion-st'mulated transcript'on factor 3, gamma (48k 
uncharacteilzed bone marrow protein BM033 
uncharacterized hypothalamus protein HSMNP1 
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400174 

431837 T79326 
401628 

418374 AJ011916 

429297 XB2494 
403508 

432638 AI017717 

407382 AA503620 

411492 T46848 

420185 AU)44056 

409545 BE296182 

426662 AA879474 

424247 X14008 

443062 N77999 

422447 AA310711 

421574 AJ000152 

435302 AI076259 

414527 BE241739 

441436 AW137772 



427889 AI400968 

441114 AA917456 

451831 NM_C01674 
405600 

445981 AI652743 

432839 AA679465 
405208 

435025 T08990 

413976 BE29S452 

423515 AA327017 



predicted exon 

• Hs.298262 ESTs, Weakly simdar to dJBBJ8.1 (rtsapiensl 

predicted exon 
Hs.84359 tiypottietical protein 
Hs.198862 fibu!in2 

predicted exon 
Hs.126525 chiomoscnne 21 open reading frame 15 

gbJie49b03.s1 NCLCGAP_Co3 Homo sapiens cDNA 
Hs.70337 immunoglobulin superfamily, member 4 
Hs.1 58047 ESTs 

gb:601177324F1 NIH_lulGCJ7 Homo sapiens cDNA 
Hs.122710 ESTs 

Hs.234734 lysozyme (renal amyloidosis) 

Hs.8963 Homo sapiens mRW. full length insert cDNA clone EU 

Hs.124340 ESTs 

Hs.105924 defensin,bete2 



Hs.185980 ESTs 

gb:CM2-CT01 28-23089W)05^02 CTD128 Homo sapie 
gb:601281335F1 NIH_MGC_39 Homo sapiens cDNA 

Hs.1 81 046 dual specificity phospliatase 3 (vaccinia virus phosphat 

Hs.126600 ESTs 

Hs.460 activating transcriptton factor 3 

predicted exoij 
Hs.197497 ESTs 
Hs.287332 ESTs 

predicted exon 

Hs.4742 anchor attachment protein 1 (Gaalp, yeast) homolog 
HS.7S6SS procollagen-proline, 2-oxaghitarate 4-dioxygenase (pro 
Hs.162204 ESTs 

H6.29106 mitogen-activaled protein kinase phosphatase x 

gb:EST23S29 Adipose tissue, brown Homo sapiens cD 
gb:RC1-HT0370-120100O12-c09 HT0370 Homo sapie 

Hs.204723 ESTs 



446598 AW250546 

434493 AA635305 

429582 AI569068 
403796 
405028 

426597 AA382250 

437308 AA749417 

447384 AI377221 

429060 AW139155 



432999 BE294029 

407663 NM_016429 

446527 AI973016 

413605 BE1S2644 

427286 AW732802 



gb:2821 774.5prime NIH_MGC_7 Homo sapiens cDNA 
Hs.1 21 674 ESTs 
Hs.22247 ESTs 



Hs.145601 ESTs 

Hs.292353 ESTs 

Hs.40528 ESTs 

Hs.194995 hypotheUcal protein DKI=Zp434Oa320 

Hs.291427 ESTs 

Hs.85539 ATP synttiase, H* transporting, mitochondrial FO comp 

Hs.27g903 Ras homolog enriched in brain 2 

Hs.37482 C0PZ2 for nonclathrin coat protein zeta-COP 

Hs.1 5725 hypothetical protein SBBI48 

gb;CM1-HT0329-2S0200-128-f09 HT0329 Homo sapie 
Hs.2132 - 



(th factor receptor path 



457960 AA771881 



412652 AI801777 

428373 AI751656 

416138 C18946 

425134 BE278288 

411028 AW813703 

417438 Z43989 

417534 NM_004998 

427767 AI879283 



Hs.298149 I 

Hs.31861 
Hs.258119 
Hs.196963 
Hs.42644 
Hs.113261 
Hs.6774 



435060 AI422719 

432412 AI470549 

407491 S82769 

418960 NM.004494 



409415 AA579258 



hypothetical protein PRO3073 



Hs.82141 
Hs.82251 
Hs.1 8071 4 

Hs.459 
Hs.1 46246 
Hsi33349 
Hs.162201 

HS.8952S 
Hs.168541 
Hs.137840 

Hs.157938 
HS.1C949 
Hs.6083 



Homo sapiens skln-epecific protein (xp33) mRNA, part 
ESTs 

poliovirus receptor-related 2 (herpesvirus entry mediato 
myeloid leukemia factor 2 

Lutheran blood group (Aubenger t 
gb:RC3-ST01 97-1 301 00-014-ti09 ST0197 He 
"""12 mRNA sequence 



myosin IC 

cytoctirome c oxidase sjbunil Via polypeptide 1 
gb:nn49d09.s1 NCLCGAP.Ki'dG Homo sapiens cDNA 
succinate dehydrogenase complex, subunit A, flavoprol 
ESTs 

ESTs, Weakly similar to fod( tiead like protein lH.sapie 



gbrGABAA receptor gamma 3 s 
hepatoma^erived growth factor {high-mobility group p 
Homo sapiens wsm M\ length Insert am clone EU 
ESTs, ^k)deratelr similar to 8IX1_HUMAN HOMEOB 
predicted exon 

hypothetical protein FU20S95 

Homo sapiens cDNA FU 14162 fis. clone NT2RM4fl02 

Homo sapiens cDldA: FU21028 fis, clone CAE07155 
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20 
25 



408546 W49512 

460008 H62970 

430998 AF128a47 

438901 AF085834 

440500 AA972165 

413101 BE065215 

447452 BE618258 

412446 A1768015 

418975 T75496 

464961 AW847807 
401072 
4C1204 

433626 AF078859 

418047 R37633 

443380 AI792478 

427424 AA402463 

433412 AV653729 

422599 BE387202 

436656 R93409 

413745 AW247252 

418874 T60872 

462574 AF127481 

400332 S66407 
402421 

427138 N77624 

432038 AA524746 

423711 AF05gig4 



Hs.46348 bradykinin receptor B1 
Hs.36588 WAP four-disulfide core domain 1 
Hs.204033 
Hs.29035 ESTs 
Hs.150308 ESTs 

gb:RC1-BT0314-310300-C15«1 BTOSU Homo sapie 
Hs.102480 ESTs 
Hs.92127 ESTs 
Hs.296980 ESTs 

gb:IL3<:T0213-190200-C4O-E12 CTC213 Homosapien 



Hs.86347 hypothetical protein 
Hs.4847 ESTs 
Hs.135377 ESTs 

Hs.81 35 CGI-44 protein; siMide dehydrogenase rite (yeast) 

Hs.1 18533 " 

Hs.120759 ESTs 
Hs.75514 nucleosii 

gb:yb72h11.s1 Stratagene ovaiy (937217) Homosapien 
HS.3S093 lymplioid blast crisis oncogene 
Hs.248032 aT4 

predicted exon 
Hs.173717 phosphatldic acid phosphatase type 2B 



Hs.131953 v-mafmu 



436661 AI125270 

437836 BE269291 

437329 AA811977 

445830 H104S1 

406824 AW515951 

421271 AW170057 
400256 

414028 AA782576 

466728 AL1 20077 

417707 AL036786 

438713 H16902 

460306 AL080080 

438898 AI819363 
403273 

414606 BE390440 

401283 

403703 

416969 AI815443 

442400 AW381148 

447563 BE536116 

419754 HS2299 

408204 AA4S4S01 



400642 

431582 F07136 

442724 AA366626 

417861 AA334651 
402948 

411004 AW813242 

435478 AA682622 

447966 BE544271 

433692 NM_004642 

420865 N73241 



Hs.128069 
Hs^9245B 
Hsi91761 



ESTs, Wealdy similar to similar to collagen (C^legansl 



400807 



402986 BE244588 

451375 .AI792066 

463686 AA248089 

433090 AI720050 

426063 AF046024 

412802 U41518 

409738 BE222975 

428246 AF151Q48 

412582 BE270631 
406207 
400931 

410709 AL122109 

428438 N|yL00igS5 

446918 AL135125 

417821 BE245149 



Hs.4944 

Hs.122967 

Hs.82425 



predicted exon 

Homo sapiens cOm FU12783 f s, clone NT2RP2001 

l<elch (Drosopiiila)-lil<e 2 (Mayven) 

actin related protein 2/3 complex, subunit 5 (16 id)) 

ESTs 

DKFZP564E1962 protein 



Hs.283404 organic i 

Hs.35g3 ESTs 

Hs.160g83 ESTs 
Hs.75243 

Hs.43666 protein tyrosine phosphatase type IVA, member 3 

Hs.250891 ESTs 

HS.2S2SS7 pituitary tumor-transforming 1 

gb:PM2-HT0451-0g0100-002-f04 HTD451 

Hs.272688 hypothet'cal protein FU20397 



H5.261828 

Hs.169604 cysleinyl-tRNAsynBietase 
Hs.82767 sperm specific anfigen 2 
predicted exon 

gb:lk(R3-ST0191-020200-207-g10 ST0191 Homo sapie 
gb:zj20ID9.s1 SoaresJelalJver.spleenJNFLS.SI Ho 
Hs.288390 Homo sapiens cDNA: FU22795 fis, done KAIA2543 
Hs.3436 deleted in oral cancer (mouse, homoiog) 1 
Hs IOOOOl solute carrier family 17 (sodiumpliosphsis), member 1 
Hs.28774 ESTs 

predicted exon 

HS.93B41 potassium large conductance calcium-activated channel 
Hs.99923 lectin, galacloside-binding, soluble, 7 (galectin 7) 
Hs.6466 chaperonin containr.g TCP1, subunit 2 (beta) 
Hs.283902 Homo sapiens BACcloneFy=11-481J13frora2 
Hs.60841 ESTs, Wealdy similar to tuflelin lM.musoulusl 
Hs.1 45362 immortalization-upneguiated protein 
Hs. 1 64320 ubiquitin-activating enzyme El C (homologous to yeast 
Hs.74602 aquaporin 1 (channel-forming Integral protein, 28kI3) 



Hs.65735 Homo sapiens mRNA;cI3NADKFZp434ll«1827 (horn 

Hs.2271 endothelln 1 

Hs.13913 KIAA1577 protein 

Hs.82643 protein tyrosine Idnase 9 
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414511 M148725 

451545 AF051782 

441899 AI372688 

425811 AL039104 

411014 AW816072 

451400 BE16(M79 

459247 N46243 

441633 AW958544 

427466 AA623543 

406893 M22406 
406268 
403348 
400970 

414045 NM_002951 



445834 AI91329a 

422525 AA758797 

425383 D83407 

464590 AW809762 

411529 AA430348 

425397 J04038 
403234 

427267 A1201185 
4C0203 

449296 AL137257 

406704 M21665 

423083 AA321774 

422112 BES4024C 

413282 BE0781Sg 

463702 AA037637 
403065 

440633 AI140686 

466994 AA383623 

468260 R41782 

462388 BE019696 

422278 AF072873 

441989 AA306207 

418768 AW969311 

406646 M33600 

433063 BE301909 

414194 BE175494 

462321 AW844493 

449713 AW027025 

468827 AWg70785 

414092 Z14244 

441730 AI243276 

420701 N42919 
403642 

408987 H85615 

446712 AW204789 
403286 

434439 AIQ22360 
404067 

455694 BEa67300 
403287 

434633 AI189587 

408199 AA132637 

420080 M94066 

408862 AW291435 
403786 

416839 H94900 

434386 AA631946 

446845 AI343645 

402520 

436098 R20597 

438974 AF089816 

447751 AA339541 

451310 AW250651 

435961 BE293127 

452937 BE410390 
404860 

438360 H74149 

436508 AW604381 

430486 BE06210g 

407824 AA147884 
406388 

430204 AA618335 

457560 AI801934 

429621 BE04B708 

429758 AW137722 

441473 AA934995 



Hs.110373 

Hs.7578 



kaiyopherin alpha 2 (F!AG cohort 1, importin alpha 1) 
gb:MR3-ST02204)70100J)21-h07ST0220HoraosapIe 
gb:QV1-HT0413-2102004)81-g05 HT0413 Homosapl 
ESTs 



ESTs 

cellular relinoic aoid-binding protein 1 

gb:Human inlestnal mucin mRNA, partial cds, clone SM 

predicted exon 
predicted exon 
predicted exon 
nbopiioilnll 
EST 



Hs.192807 ESTs 
Hs.156007 
Hs.222056 
Hs.268337 



Hs.23458 
Hs.929 
Hs.10941 
Hs.111783 



Hs.42128 ESTs 

predidec 
Hs.263320 ESTs 



Down syndrome ciiJcal region gene 1-like 1 
Homo sapiens cDNIA FU11572fis, clone HEMBA100 
Homo sapiens cDNA FU12927 fis, clone NT2RP2004 

topoisomerase (DI^IA) II alpha (170kD) 
predicted exon 
ESTs 

predicted exon 

Homo sapiens mRNA; cDNA DKFZp434C1613 (torn 
myosin, heavy polypeplide 7, cardiac muscle, beta 
ESTs, Weakly similar to IPP1_HUMAN PROTEIN PH 
Lsml protein 

gfa:Clbl0-BT0S1S-14O200-17S^6 BT061SHomosapie 



Hs.22279 ESTs 



Hs .29287 
Hs.114218 
Hs.286241 
Hs.87019 



relinoblasioma-binding protein 8 
frizzled (Drosoptiila) homoiog 6 
Homo sapiens cDI^IA: FLJ22698 fis, cl( 



Homo sapiens l£NG8 mRNA. variant C. parial sequen 



Hs.178470 
Hs,75752 
Hs.149017 

Hs.68630 ESTs, Weakly similar to AC007228 1 R31666 2 [Haap 

predicted exon 

gb:/t03f11.r1 Scares retina N2b5HR Homo sapiens cD 
Hs.209828 ESTs 

predicted exon 
Hs,190583 ESTs 

g^Pl5QlmC0Vl61299-OO1-h BT0349 Homo sapie 

predicted exon 
Hs.120915 ESTs 
Hs.15396 ESTs 

Hs.94g25 dihydroorotate dehydrogenase 
Hs.264961 ESTs 

predicted exon 
Hs.17882 ESTs 
Hs.259580 ESTs 
Hs.156108 ESTs 



Hs.9739 ESTs 



chromosome 19 open reading frame 3 
hypothetical protein FIJ22056 
ESTs, luloderately similar to dJ447F3.3 tH.sapiensl 



Hs.288193 
Hs.121121 
Hs.241551 
Hs.9812 

Hs.146137 
Hs.163909 
Hs.50949 
Hs.246804 



chloiide channel, calcium activated, faraiiy member 2 



ESTs, tAfealdy similar to R28830 1 [H^saplens] 

117 
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411724 AA770569 

450453 AA009883 

419687 AI638859 

442162 AW294966 



435551 AF212365 
440069 BE617892 
432277 A1669790 



Hs.71618 

Hs.50186 

Hs.227699 

Hs.160849 

Hs.5422 

Hs.82112 

Hs.76664 

Hs.1 19206 

Hs.5470 

Hs.6896 

Hs.16ie2S 

HS.18222S 

Hs.72010 

HS.1S3684 

Hs.294092 



polynnerase (RNA) II (DMA directed) polypeptide L (7. 

ESTs 

ESTs, Wealtiy similar to Yhr217cp JSxerevisiael 
ESTs 

glycoprolein M6B 
Interleukin 1 receptor, type I 
CD151 antigen 

InsuHn-like growlh fector landing protein 7 
IL-17B receptor 

actin related protein 2^ complex, subunit 3 (21 kD) 
ESTs 

RNA binding motif protein 3 
ESTs 

frizzledH-elated protein 



433096 
400344 
417501 
400449 
453801 ; 
436849 I 
454181 ; 
414807 ; 
406326 
421921 i 
416700 J 
458857 i 
405501 
416601 
426600 
425590 
423161 
426420 
414428 
404601 
403861 
443363 
406655 
435372 ; 
413164 
443021 , 
412975 ■ 



AU076803 

NM_012368 
AL041219 



NM_003378 
AI954686 
AA422028 



440919 
423725 
401747 
454209 
417661 
426499 
404240 



AW134832 
D56C51 
AW291274 
AJ403108 

AW1 79083 

CI 4937 

AA307634 



439739 AI199391 

437974 T74445 

427490 Z95152 

443482 AW188093 

411420 BE3906S2 

435196 F35675 

417022 NM_014737 

413531 AL036958 

428981 BE313077 

421598 AW630942 

443907 AU076484 

406754 AA477223 



Hs.282976 
Hs.258574 . 
Hs.82222 



oUaotoiy iBoeplor, family 2, subfamily C, member 1 

sema domain, inmunoglobulln domain (Ig), shortbasic 

predicted exon 

mRNA for FU00023 protein 

ESTs, Weakly similar to CLDEJlUimN CLAUDIN- 

gb:CM4-CT0129-190899-007-eCg CT0129 Homosapte 

iiydroxyprostaglandin dehydrogenase IS-(NAD) 



translocase of inner mitodiondiial membrane 10 (yeast) 




Hs.20205 
Hs.171014 
Hs.158321 



Hs.8904 
Hs.75106 
Hs.74304 



VGF neive growth factor inducible 

beaded fflamentstructural protein 2, phaklnin 

gb.zv26g06.r1 Soafes_NhHMPu.S1 Homo sapiens cDN 

VAMP (vesicle-associated membrane proteinj-assodate 
predicted exon 
predicted exon 

6-pyruvoylletfahydroptertn synthase 

major histocompatibility complex, class I, A 

ESTs, Highly similar to CIKG J1UMAN VOLTAGE-G 

gb:RC0-BT0362-021299-031-b06BT0362Homosapie 

Ig superfamily protein 

cluslerin (complemenl lysis inhibilw. SP-40,40, sulfated 



Hs.47094 ESTs 

Hs.1 36348 osteoblast specific factor 2 (fasciclin 1-like) 
gb:Human dynacGn mRNA, partlsl cds. 



Hs.100623 
Hs.15476 
Hs.180372 

Hs.1 12689 
Hs.65771 
HS.B2962 
Hs.245295 
Hs.78888 



gb:nh26e06.s1 Na_CGAP_(W Homo sapiens cDNAc 
i19,BACCIT-HSPCJ04F 



thymidylate synthetase 
ESTs 

diazepam binding inhibitor (GABA receptor modulator 

m.asii'a ESTs 
Hs.132127 hypothetical protein LOC57822 
predicted exon 

gb:MR4-ST0065-2708994)C&A07 STC06S Homo sapi 
Hs.1 5454 Homo sapiens cDNA: FU21351 lis, done 00102762 
Hs.1 1169 Gene33Mg-6 

predicted exon 

Hs.6650 vacuolar protein sorting 45B (yeast homolog) 
predicted exon 



ESTs 



Hs.301469 
Hs.1 24454 
Hs.5957 
HS.17B695 
Hs.250385 ESTs 



Hs.188128 
Hs.80905 
Hs.75416 



HS.9313S ESTs 
Hs.106061 
Hs.9963 
Hs.75922 



gb:601286820F1 NIH_MGC_44 Homo sapiens cDNA 
ESTs, Moderately similar to ALUBJ1UMAN ALU 
Ras assodaOon (RalGDS/AF^) domain faimty 2 
•■dproteln2 



RDRNA-blnding protein 
TYRO pmlein tyrosine kin; 
brain protein 13 
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Hs.112704 ESTs 

Hs.292911 
Hs.1 10524 
HS.2S0S70 
H5.236816 
Hs.248144 
Hs.278909 
Hs.166676 
Hs.41135 



transkxase of inner mitochondrial membrane 10 (/east) 



424126 AA335635 

408473 BE259039 
401962 

447326 AW002252 

469053 AI807C52 
403362 

402061 H83363 

433785 BE044593 
405423 

429259 AM20450 

444071 AI6278Ce 

410512 AA085603 

440376 AI024452 

457353 X55533 

432749 NM_014438 

415602 F12920 

407891 AA486620 

455910 Z43712 

425716 NM_006379 

444246 H93281 



446625 AI333070 Hs.1 561 41 ESTs 

423334 AK000906 Hs.1 27273 hypottietical protein FU10044 

423103 AA322029 gb:EST24685't;erebellum II Homo sapiens cDNA S' en 

443549 T89603 Hs.1 6601 ESTs 

419299 AI311085 Hs.62406 Homo sapiens cDI>lA: FU22573 lis, clone HSI02387 

411942 AW877015 gb:QV2-PT0010-250300mf12PT0010 Homosapien 

442440 BE464435 Hs.1 461 80 ESTs, Weald/ similar to non-receptor protein tyrosine k 

454574 AW809109 gb:MR4-ST0117070100-027-a04ST0117Homosapie 

454377 AA076811 gb:7B03C12 Chromosome 7 Fetal Brain cDNA Library 

422365 AF035537 Hs.115521 REV3 (yeast horaolog)-like,catalyl!csubunlt of DNAp 

421733 AL119671 Hs.1420 fibroblast growth factor receptor 3 (achondroplasia, tha 

Hs.4775 junctophlllnS 



endotrxin-2 

gb:HSC1JA121 normdized infent brain cDNA Homo s 
senna domain, immunoglobulin domain (Ig), short basic 
hypolh^l protein FU20417 



401373 
402292 

444118 AA458542 

408310 AW179023 

411236 AW833752 

431405 AI470895 

441408 AI733249 

453994 BE180964 

444618 AI1B0278 
402407 
404270 

409103 AF251237 

415198 AW009480 

430771 BE387244 

432636 AA340864 

433504 NM_014874 

415606 W70022 

401401 BE047878 

420758 AW297536 

457520 AA553495 

432323 AK001409 
404750 

450645 AL117441 . 

445160 AI299144 

418461 BE242781 
4Q1809 

458121 842416 

435106 AA100847 

448398 AW444665 

428145 BE243327 

445302 AK001537 

407352 H47860 

413190 AA151802 

436371 AI821912 
400965 

433427 AI816449 

427504 AA776743 

425759 AI590401 

423792 AW135866 



437453 A1761350 

409275 AW372097 

449628 AI697676 

421043 BE379455 

442344 AI022925 

448744 AL135424 

416062 AA724811 

414500 W24087 



H5.10326 coatomer protein complex, subunit epsikn 

gb:PM3-ST0036-170899-001-e08 ST0036 Homo sapie 
gb;QV4-TT0008-130100-077-b07 HOOOB Homo saple 

Hs,252S74 ribosomal protein LI Oa 

Hs.126897 ESTs 

Hs,165590 riboscmal protein S13 

Hs.146884 ESTs 



Hs.1 12208 XAGE-1 protein 

Hs.943 natural killercellUansoript 4 

Hs.2664 flavin containing monooxygenase 4 

Hs.278562 claudin7 

Hs.3363 K1AA0214 gene product 

gb:zd51e10.r1 SoaresJetal_hearLNbHH19W Homo sa 

Hs.99093 Homo sapiens chromosome 19, cosmld R28379 

Hs.33053 ESTs 

Hs.162264 ESTs 

Hs.274366 fiypothelical protein FU 1 0547 

predicted exon 

Hs.25264 DKFZP434N126 protein 

Hs.150797 ESTs 

Hs.288037 Homo sapiens cDNA FU12999 lis, cbne MT2RP3000 

Hs.74647 Human T-cell receptor active alphachain mRNA from 

Hs.193380 ESTs, Highly similar to AF174600 1 F-box protein Fbx 

Hs.170838 ESTs 

Hs.182626 chromosome 22 open reading frameS 

Hs.12488 hypothetical protein FLJ10675 

gb;yp76h12.r1 Scares fetal liver spleen 1NFLS Homo s 

Hs.40368 adaptor-related protein complex 1, sigma 2 subunit 



i.113912 



predicted exon 



Hs.191589 ESTs 

Hs.21213 ESTs 

Hs.245854 ESTs 

Hs.34298 CD74 antigen (invariant polypeptide of major his 

Hs.277477 major histocompatibility complex, class I, C 

Hs.181391 hypothetical protein DKFZp761G2113 



Hs.301212 ESTs 

Hs.9469 phosphoi 

Hs.74427 p53-induced protein 

Hs.76285 DKFZP564B167 protein 
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427272 NM_001096 
403964 

433217 AB040914 

427902 AI8092C2 

449586 AI863918 

430826 U10061 

414195 BE263293 

416305 AUC76628 

411088 BE247593 

419407 AW410377 



417286 AA122237 
405515 

439319 AW016401 

419387 BE379356 

414015 AA3409B7 

447778 BE620592 



Hs.174140 ATP citrate lyase 

predicted exon 

Hs.278628 KIAA1481 prolain 

Hs.208343 ESTs, Weal<ly similar to cei 

Hs.195078 ESTs 

Hs.248019 POU domain, class 4, transcription factor 3 

gb:601 1 44881 F2 NIH_MGC_19 Homo sapiens cDNA 

Hs.79187 coxsackie virus and adenovirus receptor 

Hs.145053 ESTs 

Hs.41502 Homo sapiens cDNA: FU21276fis, clone COU)1829 
HSJ5050 ■ ■ 

Hs.252720 ESTs 

Hs.81874 microsomal glutaltiioneS-l 

Hs.233476 ^'^f^''^™ 



457822 AA970001 

442424 AI342716 

418394 AF132B13 

413477 AI815825 
405277 

450192 AA263143 

442191 W95186 

429490 AI971131 

406744 AA554082 

425205 NM_005864 

414387 AL043148 

411311 AW864370 

433382 U90441 

414333 BE274897 
403747 

435542 AA687376 



Hs.1 86257 
Hs.3622 



ESTs 

ESTs, Moderately simil 
Kruppel-like factor 5 fin 
ESTs, luloderately sirail 
predicted exon 
F!AD51-interaSing protein 
endothelial PAS domain protein 1 
ESTs, Weakly similar to alterratively spliced product u 
hypothetical protein RJ20030 
receptor (calcitonin) activity modifying protein 2 
ESTs 

gb:PM4-SN0016-10050OO04-hC9 SN0015 Homosapie 
procollagen-proline, 2-oxoglutarate 4-dioxygenase (pro 
gb'.601122959F1 NII1MGC_20 Homo sapiens cDNA 



Hs.269533 ESTs 



450506 NM_004460 
404763 

454533 AW811380 

440738 AI806594 

411800 N3g342 

441361 BE263308 

422033 AW245806 
405333 



438397 AA8C6478 

435948 AA7C2676 

450273 AW295454 

435969 W85773 

427031 AA3976C1 

454505 AW801365 
403447 

433297 AV6S8S81 

443326 BE1S6494 

448283 AI340462 

458067 AA3936C3 

452359 BE16722g 

434098 AA6254g9 

450911 M011S86 

410342 R31350 

407082 Z470SS 

415271 X94232 

417413 AA197072 

408937 AA210734 

433459 AAS93498 



fibroblast activation protein, alpha 
predicted exon 

99-01 9-D05 ST01 43 Homo sapien 



Hs.1 23577 ESTs 



Hs.11 

Hs.82932 
Hs.291990 
Hs.7416 
Hs.1 23206 
Hs.1 141 35 
Hs.24743 
Hs.191386 
Hs.1 251 47 



TH1 drosophila homolog 

TERF1 (TRFI)-interactlng nuclear fector 2 

claudin 5 (transmembrane protein deleted in velocartliof 

predicled exon 



hypothetical protein FU20171 



gb:IL5-UITO57-24030fr05&a01 UM0067 Horaosapl 



Hs.188478 
Hs.182979 
HSJ6752 
Hs.29206 



ribosomal protein LI 2 

Homo sapiens cDNA: FU22834 &. done KAIA4314 
Homo sapiens clone 24659 raRNA sequence 
gb:af69g08.r1 Soares_NhH»/Pu_S1 Homo sapiens cDN 
ESTs 

Fc fragment of IgE, high affinity I, receptor far, gamma 



Hs.78335 
Hs.86092 
H3.291386 1 



gb:nn27b05.s1 ICLCGAP_Gas1 Hi 

ESTs 

delta-6 fatty acid desalurase 



410003 AA079487 

44C857 AA907808 

461072 AA013461 

418693 AI750878 

443624 BE616129 

422626 AA344932 



463317 NM_002277 



gb;zm97f08i1 Stratagene colon HT29 (937221) Homo 



Hs.1 17929 ESTs 



Hs,66159 
Hs.132929 
Hs.41696 
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60 
65 

70 



421486 AW408800 

428834 AW899713 

451419 R36309 

448413 AI746379 

424323 AA338791 

423943 AF163570 

439423 BE536573 

408246 N55659 

441579 AW468847 

420867 NM_014183 

453680 AU)79647 
400202 

410768 ARBBISS 

409932 AI376750 

425563 AR)84199 

440475 AI807671 

452767 AW014195 

410570 AI133096 

419600 AA448953 

419588 AI347205 

428976 NM_004672 

443928 A135026C 
403924 

419889 AA261600 

405023 AW408800 

426065 N32049 

453199 AI336266 

455132 AW857955 

442932 AA467211 

432055 AA4C1039 

444652 BE513613 

417935 R53697 

430050 AA430993 

446272 BE268912 

425996 W67330 

416964 D87467 

437418 AI478954 

447255 AI884908 
402203 

417611 AW993983 

426560 AA381661 

445153 AA026880 

445017 AI205493 

438558 AI222068 

442238 AW1 35374 

443195 BE148235 

442609 AL020996 

415591 AA091976 
403674 

430514 AA318501 

411696 AW857404 

434550 R13062 

422527 BE336857 

414364 D38521 

409119 AA531133 

425540 U34051 

436044 BE247571 
401657 

449763 AI822112 

409601 AF237621 

44S636 AI6S6608 



Hs. 147099 ESTs 

Hs.216381 Homo sapiens Clone 

Hs.43946 L13 protein 

Hs.1 27194 ESTs 

Hs.100002 HSPC162piateln 

Hs.14485 ESTs 



Hs.66187 
Hs.57600 
Hs.299837 
Hs.128343 



Homo sapiens clone 23700 mRNA sequence 



ESTs, Wealdy similar to unknown [S.cerevlslae] 
ATPsyntliase, H+ transporting, mitocliondrial FIR), su 
NEU1 protein 

Human done 23614 mRNA sequence 
milogen^ctivated protein Itinase Idnase kinase 6 
Homo sapiens cDNA aJ11743 lis, clone HEMBA100 

gb;2s10cl12jf'NCI_CGAP_GCB1 Homo sapiens cDNA 
hypolhelicai protein DKFZp762E1312 
gb;yw96g08.s1 Scares j)lacenta_8to9weeks_2NbHP8to 
Homo sapiens PRO0412 mRNA, complete cds 
gb.'PM0^T0325-1S1299J02'A12 CT032SHomosapi 



Hs.81256 SlOOcddum-binding prDtetitA4 (calcium prDtein,cdv 



Hs.1 19878 
Hs.25252 
Hs.1 76860 
Hs.123571 
Hs.270949 
1.193063 



gb:RC1-BN0035-130400413-a04 BN003S Homosapie 
ESTs 

Homo sapiens cDNA FU13603 Us, clone PLACE1010 



Hs.3518 
^s.79387 



predicted ex 

megakaryocyte-enhanced gene transcript 1 protein 

gb:CM3-CT031 3-291 1 99-046<1 1 CT031 3 Homo saple 

Homo sapiens clone 24877 mRNA sequence 

transforming growth factor, beta-induced, 68kD 

KIAA0077 protein 

G proteirwoupled receptor 44 

ESTs. Hghly similar to CD58_HUMAN CYaiN-DE 



Hs.118241 
Hs.80828 
Hs.281328 
Hs.167047 
Hs,241507 
Hsi24277 



lal protein 86 



411651 AW855392 
404097 

447252 R90916 

430024 AI308780 

412828 AL133396 

444558 AW181975 

420869 X58964 

448812 H30776 

431777 AA570296 

422007 AI739435 
403051 
402427 

417408 F17211 

460698 AF151076 



Hs.227730 

Hs.165892 
Hs.123633 
Hs.22140 
Hs.105470 
Hs.39158 



ESTs 

gb:qk55g09.x1 NCLCGAP_& 
gb;ph1h3_19/1TV Outward Alu-primed hncDNA librar 
DMA (cytosine-5-)-methyltransferase 1 
predicted exon 

gb;CI^3-CT0275-191 099-024^12 CT0275 Homosapie 

predicted exon 

gb:yn01e1 0.rl Soares adult brain N2b4HB55Y Homo s 
Integn'n, alpha6 

prion protein (p27-30) (CreutzfeldJakob disease, Gerst 
ESTs 

regulatory factor X, 1 OnBuences HLA class II express! 

BIUI88 antigen 

found in inHamraatoiy zone 1 



.Human DNA sequence from clone RP1 1-243J 1 6 on chr 
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421121 M459028 

458488 AL040665 

417158 AW965223 

439318 AW837046 

428758 AA433988 

447672 AI631546 

434434 AA633516 

409994 D86864 



436659 AI217900 

422731 AL138411 

429294 AA095971 

432847 BE266941 

416977 AW130242 

406827 AA971409 

453758 U83527 

431314 AI732204 

423185 BE299590 

435086 AW975243 

447383 N24231 

456251 R13326 

456327 H68741 

450594 N31036 



418294 AF061739 

446646 BE1 67687 

421100 AW351839 

455993 BE179085 

469376 BE251770 

454803 AWe60148 

445474 AI240014 

443198 AIC39813 

441557 AW452e47 

420206 M91463 

442202 BE272352 

416913 AW934714 

419355 AA428520 

452975 Ma5521 

432525 AI796096 

453718 AL1 19317 

437270 R13087 

450954 AI904740 
402958 

445656 W2205C 

410684 AA088500 

437669 AI35ei05 

447869 AW139113 

458026 AI276406 

445614 AV660763 

454610 AW810224 

449303 AKC01495 

422105 AI929700 

444788 AI871122 

414057 AI815559 

408822 AW500715 

433379 AAS86368 

441552 AA937975 



430129 BE3C1708 
401465 

448913 AA194422 

410261 AF145713 

421199 BE244219 

450489 AI697990 

410186 AW602528 

447224 BE617125 
403010 
404881 

445572 AI243445 

419440 AB020689 

443406 AI056238 

457901 AW207023 

448364 T08958 

407239 AA076350 
401847 

429523 AK000788 



Hs.159732 

Hs.157201 

Hs,67735 

Hs.255453 

Hs.5476 

Hs.279895 

Hs.173108 

Hs.144464 



TRIAD3 protein 
ESTs 

ESTs, Weakly similar to ACR3_HUMAN 30 KD ADIP 
lupled receptor 56 

ns cDNA FU14303 lis, clone PLACE2000 



acetyl LDLreceptonSREC 
ESTs 

serine protease inhibitor, Kazal type, 5 

Homo sapiens mRNA for KIAA1578 protein, partial cd 

Homo sapiens cDNA: FL)21897 lis, clone HEP03447, 



gb:DKFZp434A1229 r1 434 (synonyn 
KIAA0750gene product 
prateasome (prosome, macropain) 26 



Hs.105423 
Hs.125078 
Hs,122596 



Hs.178113 
Hs.121619 
Hs.83954 
Hs.156628 



ornithine decaitoxytase antlzyme 1 

ESTs 

gb:yx22a11.r1 Soaies melanocyte 2NbHMHomosaple 

ESTs 
ESTs 

gb:yx51g04.rrSoares melanocyte 2NbHM Homo sapie 
ESTs, l/loderately similar to kinesin like protein 9 lM.m 
chromosome 1 1 open reading frame 15 
protein associated with PRK1 
ESTs 

Homo sapiens cDNA: FU21763 fis. clone COLF6967 
gb:RC0-HT0613-14030O-021-d06 HT0613 Homo sapie 
gb:601 1 12470F1 NIH.MGC_16 Homo sapiens am 
gb:RC0-CT0379-2901004)32-b10 CT0379 Homo sapie 
ESTs 

gb:ox49d06.x1 SoaresJotal_fetus_Nb2HF8_9w Homo 
ESTs 

solute carter family 2 (facilitated glucose transporter), 
Homo sapiens cDNA: FU22625 fis, cbne H8I06009 
gb:RC1-[]T0001-031299411-a11 DT0001 Homosapie 



Hsi70482 
Hs.95958 

Hs.106534 



Hs.69469 dendritic cell protein 



Hs. 120360 
Hs.11282 
Hs.246783 

Hs.25691 receptor (oafcltonin) activity modl^ing protein 3 

predicted exon 

Hs.21 299 ESTs, Weakly similar to AF151840 1 CGI-82 protein [H 

Hs.170298 ESTs 

Hs. 1 231 64 ESTs, Weakly similar to match to ESTs AA667999 [H. 
Hs.164307 ESTs 

gb:ql63c10.x1 Soares_NliHIVIPu_S1 Homo sapiens cDN 
Hs.110675 apdipoprolein C-IV 



Hs.23467 
Hs.1 11680 
Hs.202821 
Hs.75730 
Hs.57079 
Hs.190232 



gb:MR4-ST0125421199-017-e07ST012S Homo sapie 
hypothetical ptDleinFU10633 
endosulline alpha 



gb:oc08e123l NCLCGAP.GCBI Homo sapiens cON 
predicted exon 
Hs.205555 ESTs 
Hs.58314 ESTs 

Hs .2661 75 phosphoprotein assodated with GEMs 
Hs.233955 hypothetical protein FU20401 

Hs.22564 myosin VI 
Hs.61490 
Hs.1 02497 



Hs.90419 KIAA0882 protein 

Hs.143316 ESTs 

Hs.250497 ESTs, Highly similar to dJ745C22.1 [Hsapiens] 

Hs,16561 HSPC141 protein 

Hs.67646 leukocyte Immunoglobulin-like receptor, subfamily B ( 

Hs.205280 Homo'^^ns cDNA FU20781 Us, clone COL04235 

122 



wo 02/102235 



PCT/US02/19297 



432345 AI989751 

400246 

404971 



448199 AI953278 
443422 R10288 
401117 
400613 

431214 AA294921 



Hs.77837 
Hs.1 35657 
Hs.200302 
Hs.1 00000 
Hs.10248 
Hs.1 94657 
Hs.1 70557 
Hs.301529 



predicted exon 
predicted exon 
ESTs, Weakly similar to: 
UDP-glucose pyraphosphoiytase 2 



SilOO calcium-binding protein A8 (calgianulln A) 
Homo sapiens cDNA FU20157fis, ctone COL09512 
calherin 1, type 1, E-cadhetin (epithelial) 



predicted exon 
predicted exon 

v-ral simian leukemia viral oncogene liomolog B (ras re 
Homo sapiens cDNA: nJ22615 (Is. done HSI051 18 
42isjb KWU Ms.iuaaua ARS component B 
427154 AL137262 Hs.288991 Homo sapiens cDIW; FU22523 fis. clone HRC12507 

436678 BE512828 Hs.5273 NADH dehydrogenase (ubiquinone) Fe^ protein 3 (30k 



430478 NM_014349 

437623 D6388C 
401244 

415167 AA160784 

438291 BE514605 
406183 

436480 AJ271643 

466691 AI023428 

418332 R34976 

446CS2 AA3S8760 

444859 AW449137 

437192 AW975786 
400891 

448372 AW445166 

425798 AA364002 

469263 AL167476 

420746 AW195932 

414717 BE271039 
400727 

422691 NM_003365 
405639 

414444 BE298594 

456146 AL034349 

41461C BE338044 

414267 AL078459 
401268 
403613 

414203 BE2S2170 

454315 AW373564 

452114 N22687 
404638 



TNF-inducible protein CG12-1 
ctiromosome condensation-related SMC-assoclated pro 
predicted exon 

NA: FU22380 Hs, clone HRC07453. 



Hs.26410 ESTs 



Hs.87459 
Hs.205696 
Hs.78293 



Hs.205696 ESTs 



gb:EST67699Felallun! 
ESTs 

ublqultl^coniugafmg en 



Hs.170802 ESTs 



Hs.32913 
Hs.197488 
Hs.77060 



449966 H60542 
402585 

436008 AI078428 
40.1492 

412288 m 003005 
405088 

437345 BE259522 
432280 BE440142 



439938 AI147392 

418844 M62982 

.446081 AA972412 

443534 AI076123 

459510 AA076705 

451938 AI354355 

454478 AW805749 

407214 AA41204B 



450657 AK001579 



proteasome (prosome, macropain] subunit beta type, 6 

ubiquinol-cytoctirome c reductase core protein I 

predicted exon 

gb:601 1 1 9754F1 NIH_MGCJ7 Homo sapiens cDNA 
protein tyrosine phosphatase, receptor type, K 
gb;601 283747F1 NIH_MGC_44 Homo sapiens cDNA 



predicted exon 



sel 



Hs.22316 Homo sapiens done 24819 mRNA sequence 
Hs.1 81 165 eukaiyollc translation dongatbn factor 1 alpha 1 
Hs.21497 Homosaplens mRNAfbr FU00042 protein, partial cds 
Hs.37848 ESTs 

predicted exon 
Hs.58785 ESTs 

predicted exon 

Hs.73800 seleclin P (granute membrane protein 140kD, antigen C 



Hs.5556 
Hs.2943 
Hs,91448 
Hs.264881 
Hs.252351 
Hs.6019 
Hs.124607 



Hs.59236 ESTs, I 



NADH dehydrogenase (ubiquinc 
signal recognition particle ISkD 
MKP-1 like protein tyrosine phosphatase 



ESTs 



f-box and WD40 domain protein 2 
gb:oy92e04.x1 Soares_fetalJirer_spleenJNFLS_S1 H 
gb;7B01B02 Chromosome 7 Fetal Brain cONA Ubraiy 
ESTs, Weakly simHar to B35049 ankyrin 1, erythrocyte 
down-reguiater of transcription 1 , TBP-binding (negativ 
gb;QV1-UM0105-180400-162-f10 UM0105 Homos^ 
Ca-39 protein; cell dealh-regulatoiy protein GRIM19 
predicted exon 
cytochrome c-1 



wo 02/102235 
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428797 AM96205 

416272 AA178882 

431267 AF039697 

447775 BE179318 

444140 AV648089 

446102 AW168067 

416475 T70298 

430783 AW971248 

414070 AW963783 

444283 AI138971 



409417 AA156247 

435380 AA679001 

406752 AI285S98 

406096 F12200 

417551 AI816291 

441252 AW360901 

419608 AL037237 

438894 AI630819 

451287 AK002158 

412499 AW956916 

433355 AI808235 

416818 Aig86408 

438765 AI031888 

424470 BE244261 

416194 H27114 

446702 R44518 

414222 AL135173 

443122 AI806656 

448648 BE614345 

456394 W28506 



Hs.58033 ESTs 

predicted exon 

Hs.1 93700 Homo sapiens mRNA; cDNA DKFZp586l0324 (from c 
gb:zp38b09.r1 Sttatagene muscle 937209 Homo sapiens 

Hs.143843 ESTs 



gb:RC1-KTC615-2903C(W)21-g05 HT0615 Homo sapie 

predicted exon 
Hs,282383 ESTs 
Hs.252956 ESTs 

gb:yd26gOZs1 Scares fetal tver spleen 1NFLS Homos 
Hs.291289 ESTs, Wealdy similar to ALU1JHUMAN ALU SUBFA 

gb:EST375856 MAGE resequences, MAGH Homo sap 
Hs.154636 ESTs 

Hs,3057 zinc finger piotein 74 (CosS2) 

gb:RC2-ST01 68^)71299-013^ 8T0168 Homosaplen 
ESTs, Wealdy similar to ALU7JHUMAN ALU SUBFA 



HS.2959C8 
Hs,1 92221 
HsJ17493 
Hs.5811 
Hs.82273 
Hs. 183047 
Hs.91686 
Hs.300431 
Hs.26194 
Hs.11238 



chromosome 21 open reading frame 59 
hypothetical protein 
ed protein product [H5 



hypothetical protein FU11296 
KIAA0622 piotein; Drosophiia'mullipie asters' (Mast 
gb:wf44e01j<1 Soaies_NFLT_GBC_S1 Homosaplen 
ESTs, WeaMy similar to B48013 proHne-rfch proteogly 



ESTs, Weakly sWIarto Pro-PoMUTPase polypmtein 



40 

45 



403153 

444907 AW772696 

421946 R99629 

437513 AW410681 

407752 AA573581 

447953 AI804218 

425708 AK001342 

421449 AA713491 

418323 NM_002118 

447787 BE620108 

422716 AI702835 

443958 BE241880 

417908 AA207221 

438542 AA810131 

400288 X06255 

456826 H67220 

431360 NM_000427 

414266 BE267834 

440571 AA904461 

426075 AW513691 

413488 BE144017 

446767 AI380107 

418008 W56044 
404239 

458401 AW236939 

412955 BE241849 

423072 A1792946 

444954 AW247076 

449023 AI623261 



404345 AA730407 

418512 AW498974 

411551 AW851309 

446726 AW300144 

410748 BE383816 

449618 A1076459 

429697 AW296451 

424012 AW368377 



is.209614 
^s!291501 



ilpha 5 (fibronectin receptor, alpha polypeplid 




ras homolog gene family, member G (rho G) 

solute earner family 12 (sodlum/potassiumtehloride tran 

eultaiyotic translation initiation factor 2, subunit 2 (beta 



myosin, heavy polypeptide 9, non-muscle 
gb,yj38c09.r1 Soares placenta Nb2HP Homo sapiens cD 
hypotheUcal protein FU2a079 
ESTs 

Homo sapiens mRNA Ibr inositol 1,4,5-trisphosphale 3 
gb:QV3-BT0296-010300-111-e04 BT0296 Horaosapie 

protocadherin 1 1 

diacylglycerol kinase, zeta (104lcD) 
gb:IL3-CT0220-1 70200-067-C1 1 CT0220 Homo sapen 
Homo sapiens cDNA FU11629 fis, clone HEMBA100 
ESTs, Highly similar to bGI 15G20.2 |H.sapiens] 
Homo sapiens cIWA FLJ12819 fis, done NT2RP2002 
ESTs 

tumor protein 63 Kta with strong homology to p53 

124 



wo 02/102235 
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403151 

452363 AI582743 

425971 AF135024 

432826 X75363 

431972 A1805145 
400269 

404703 AI904493 

449335 AW150717 

413443 NM_006239 

445773 H73458 

433732 Al=l»0945 
405473 

420331 AA280824 

402939 

405196 

452947 AW130413 

414170 AA335996 

437133 AB018319 

458356 AI024855 

407857 A1928445 
405687 

415189 L34657 



430224 AW675175 

458792 N56665 
402944 

422675 BE018517 

408661 AW247625 

423238 AA323S69 

421517 ABC183S2 

429865 AB023217 

440815 AW071945 



407464 AJ276396 



4276C9 M62606 

418846 AI821602 

448891 AI537332 

44693C AF05600g 

421254 AK001724 

447073 AW204821 

446438 AB014578 

432126 AA865239 

424091 AF236C97 

440832 A1057548 

449228 AJ403107 



Hs.94963 ESTs, Highly similar to CI QC_HUMANCOMPLEME 
Hs.165296 kallikrelnIS 
Hs.250770 kallikreinIS 
Hs.191711 ESTs 

predicted exon 

Ms.99890 polymerase (DMA directed), delta 1, catalytic subunit (1 
Hs.296176 STAT induced STATInMbitorS 
Hs.85146 v-ets avian eiythroblaslosis virus E26 oncogene homolo 
Hs.13299 Homo sapiens mRNA;cDNADKFZp761M0111 (from 

gb:Homosaplens clone HQ0670 

predicted exon 
Hs.190035 ESTs 



gb:)(f50f04.x1 NCI_CGAP_Gas4 Homo sapiens cDNA 
matrix metaloproteinase 24 (membrane-inserted) 
KIAA0776 protein 
ESTs 

hypothetical protein FU20163 
predicted exon 
plateletyendothelialcell; 



08.X1 Scares_NhHMPu_S1 Homosai 



Hs.280482 
Hs.105399 
Hs.225g68 
Hs.7436 

Hs.25846 
Hs.279918 
Hs.288013 



zinc metalloproleinase, STE24 (yeast, ha 



similar to yeast BET3 (S. cerevisiae) 

gbtHoino sapiens mRNAfor matrix extracellular phosp 

gb: Homo sapiens full length Insert cDNA ctone ZD47B 



Hs.1 89729 ESTs 



Hs.129663 

Hs.2161 
Hs.1 151 27 
Hs.209115 
H3,13456 
Hs.102950 

H3!l2707 



Hs.1 39263 calciur 



gb;ns92h09Ls1 NCLCGAP_Pr3 Homo sapiens cDNAc 
^^dfcled 



rs. Wealtiy similar to AF208846 1 BM-CC4 [H japle 



400837 

458866 BE616694 

417124 BE122762 

414376 BE393856 

418636 AW749855 

454128 AL031269 

441074 AW500001 



431838 AI097229 
402866 

449635 A1989942 

434392 AW983709 

414973 CI 9089 

428374 AW4C5156 

415745 AI301107 

432532 AW058459 

417112 AA193439 

418101 AL047476 

453110 AW384928 



Hs.288042 
Hs.25338 
Hs.66915 



Hs.217484 ESTs 



I cDNA FU 14299 fs, clone PLACE1010 
ESTS 

ESTs. Wealdy similar to 1 6.7Kd protein [H.sapiens] 
gb:QV4-Br0534-281299^)53HK)5 BT0534 Homo sapie 
programmed cell death 2 

Homo sapiens cDNA: FLJ22035 lis. clone HEP08838 

ankyrin 2, neuronal 
predicted exon 



predicted exon 



Hs.1 83994 
Hs.1 50790 
Hs.1 62246 



hypothetical protein FU20129 
gb:C19089 Human placenta cDNA (TFufiwara) Homo 
' , catalytic subunit. alpha Isofomi ■ 



436989 AA741028 Hs.256155 ESTs 



gb2r41bC9.s1 Soares_NhHli4Pu_S1 Homo sapiens cDN 
gap junction ptolein, beta 4 (connexin 30.3) 
Homo sapiens CDNAFU13102 Us, clone Mr2RP3002 
gb A)239397 Uni-ZAP XR retinal pigment epithelium H 



wo 02/102235 



PCT/US02/19297 



10 
15 
20 
25 
30 
35 
40 



407396 AF011767 

449684 AI659166 

454665 AW812994 

430492 U15197 

439460 AA836220 

449231 BE410360 

453060 AW294092 

416961 BE391476 



424585 AA46484C 

431029 BE392726 

441680 AW444598 

437830 AB020658 

409479 BE163800 

409885 AWS0306B 

459090 AA443323 

429324 AA488101 
403766 

413970 U59309 

456674 BE266120 

417931 W95642 

430125 U46418 

452154 AW953265 

422984 W28614 



446360 AF062112 

450214 BE439763 

449733 R74546 

411660 AW855718 

442663 BE269247 

447652 AI394125 

448712 W01048 

420180 AI004035 

440099 AL080068 

427550 BE242818 

432894 AW167668 

412113 AW161274 

431614 AI189827 

445870 AW410053 

424347 AA723883 

425132 AW250114 

439766 AL369661 

432946 U60399 
406130 

453359 AA448787 
405491 

436481 AA379597 

446826 AK000526 
441211 



Hs.207144 ESTs 



gb:Honio sapiens RAGE binding protein (PI 2) iiiRNA, 



418711 
457301 



439090 H65724 

416586 044643 

411940 AW876686 

407639 AW205369 

458012 AI4248g9 

426490 h8i»_001621 

408741 M73720 

437371 AK000868 

437134 AA349944 

441890 AI809547 

409442 AA310162 

407078 226266 

436653 AW407157 



Hs.21594 
Hs.80617 
Hs.255976 



gb:601302340F1 NIH_MGC_21 Homo sapiens cDNA 

ESTs 

nbosomal protein S16 



gb:zx43li11.r1 Soares_lotaUielus_Nb2HF8J9w Homo 
Homo sapiens PACcbne RP5-1163J12fircni 7q21.2^3 
IViPl, GTP-GDP dissociation stimulator 1 
KIAAD851 protein 



Hs.76663 fumarate hydratase 

Hs.269358 ESTs 

Hs.82961 trefoil factor 3 (intestinal) 

Hs.233950 serine protease intiibltor, Kunitz type 1 

Hs.271 277 tiypothelical protein from EtJROlMAGE 363668 



protein tyrosine phosphatase, receptor lype, A 
CD53 antigen 
predicted exon 



S.I 2540 lysophospholipasel 

s.227571 regulator of G-proteinsignainng 4 

3.29438 Homo sapiens cDIMA FU12094 lis. clone HEIUIBB100 
gb;RC1-CT0279-070100^21-a06 CT0279 Honiosapie 

S.I 70226 Homo sapiens ctane 23579 mRNA sequence 

S.160413 ESTs 

S.1 81 634 Homo sapiens cDNA: FU23602 fis. clone LNG15735 



Hs.6909 

Hs.179606 nuclear RNA heficase. DECD variant of DEAD box fam 
HSJ79772 brain specific protein 
Hs.74427 pSSnnduced protein 

gb:qd19d07.x1 Soares_placenta_8lo9weel(sJNbHP8ta 
Hs.13406 syntaxlnlS 

Hs.145513 Homosapiens mRNA; cDNA DKFZp434lj0435 (from 

gb:2821134,5prlme NIH_MGC_7 Homo sapiens cDNA 
Hs.283852 Homo sapiens mRNA full length insert cDNA done EU 
Hs.279854 mannosidase, alpha, class 2B, member 1 
predicted exon 

Hs.24872 ESTs, Weakly similar to aortic carboxypeptid^-iil(e p 
predicted exon 

Hs.5133 HSPC150 protein similar to ubiquiti^conjugating enzy 
Hs.16230 liypothetical protein FU20619 
Hs.7750 hypothetical protein AL133206 
Hs.875a5 translocase of Inner mitochondrial membrane 22 (yeast) 
gb:nc67e03s1 NCLCGAP_Pr1 Homo sapiens cDNAc 
■ -Ts, Highly similar to ALU4_HUMAN ALU SUBFA 



Hs,271663 ESTs 
Hs.14144 



Hs.252936 

Hs.188211 

Hs.170087 

Hs.646 

HsJ570 

HS.4291S 

Hs.128075 

Hs.169248 

Hs.181125 



aryl hydrocarbon receptor 
cartioxypeptidase A3 (mast ceil) 
hypothetical protein FU10006 
ARP2(a( 



448771 BE315511 Hs.296244 SNAREprotein 



423623 AB011117 

422661 NMJ16670 

403221 AL1 34378 

431620 AA126109 

404794 NM.000078 

412944 AA384110 

450817 N71597 

418666 AF001434 

451836 AW173270 

426302 AA45g085 

454485 AW795322 



KIAA0545 protein 
DKFZP686K091 9 protein 
karyopheiln alpha 4 (impofiin alpha 3) 
2'-5'oIigoadenylate synthetase 2 



Hs.197143 ESTs 
Hs.29693 ESTs 
Hs.155119 EH domain containing 1 



wo 02/102235 
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431006 BE152871 

'M3476 AW068594 

438828 AL1 34275 

407634 AW015559 

436857 AA732647 

431525 Y1C129 

447386 NM_006289 

436573 AA723297 

43285B BE61860g 

437352 AL353957 

413209 AW0B3791 

4Q7376 AA993138 

430475 BE387420 

446764 AW291276 

425868 .AB017548 

453464 AI884911 

447246 AW449032 
4C1780 

434063 AA018893 

416114, AI695549 

441018 AI809587 

426972 BE391663 

426C62 N57014 

451234 AI914901 

429665 AB020719 

418092 R45164 

424650 AI650541 

426023 AW956889 

446213 AW204314 

418102 R68968 

450082 AI908B94 

446749 NM.016059 
406124 

457408 AL137607 

410051 U25773 

440965 AIS23646 

440190 AW752597 

417437 U52682 

454249 AW249008 

432276 AF163302 
401116 

423960 AA164516 

451661 ABa206S0 

450983 AA305384 

446187 AK001241 
404122 

411299 BE409857 
403077 

438000 AI825880 

447118 AB014599 
417878 U90916 
444079 H09048 
458234 BE551408 

434208 T92641 

423136 AW375506 
403177 

448699 AI8S7269 

425248 AW957442 



Hs.258742 
Hs.18420 
HS.12713B 
HS.279S91 
Hs.284181 
Hs.21263 
Hs.142287 
Hs.241531 



hypothetical protein FLJ20502 
predicted exon 

gb.CM1-HT0333-101299^64^i12 HT0333 Homo sapi 

ESTs, Weal(lysiinilartoAF151889 1 CGI-131 protein 

hypothetical protein DKI=Zp761F2014 

ESTs, Highly similar to AF241831 1 1ntracellular hyalu 

gb:nz89d01.s1 NCLCGAP_GCB1 Homo sapiens cDN 

myosin-binding protein C, cardiac 

KIAA1027 protein 



hypothetical pratdn DKFZp434P0531 

Homo sapiens cDNA FU13152 lis. clone NT2RP3003 

ESTs, WeaWysimlar to ALUF.HUMAN !!!! ALU CL 



Hs.3727 nnr-interactlng protein 



ESTs 



Hs,44( 
Hs.24052 
Hs 207802 K1AA0912 protein 
Hs.106604 ESTs 
Hs,1 15298 ESTs 

Hs,154210 endothelial difrerentlation,sphingollpldG-proteltv<»up 
H3.170784 ESTs 
Hs,26608 ESTs 
Hs.245893 ESTs 
Hs.16089 CGI-136 protein 
predicted exon 

Hs.255348 Homo sapiens mRNA; cDNA DKFZp761 P21 1 (from c 
Hs.218182 ESTs, Weakly similar to dJ1042K10.2 [H^sapiens] 
Hs.1 69859 ESTs 

gb:IL3-CT0214-161299^)46-B05 CT0214 Homo sapien 
Hs.82132 interferon regulatory foctor 4 

gb:2821048.5pi1me NIH_MGC_7 Homo sapiens cI»IA 
HS.27425S somatostatin recsptor-inleracting protein 

predicted exon 
Hs.1 36309 CGl-61 protein 
Hs.26777 KIAA0843 protein 
Hs.25740 ER01(8.cerevlsiae)-lil(e 
Hs.1 4229 hypothetical protein FLJ10379 

predicted exon 
Hs.69499 hypothetical protein 

Hs.5986 non-kinase Cdc42 effector protein SPEC2 

Hs.1 7411 KIAA0699 protein 

Hs.82846 Human clone 23815 mRNA sequence 

Hs.23606 ESTs 

Hs.1 27196 ESTs 

Hs.127648 hypothetical protein PR02176 

Hs.124147 ESTs 

predicted exon 

Hs.227351 ESTs 

Hs.252766 ESTs 

HS.1SS33S ESTs 



TABLE 3B; 

Pkey: Unique Eos protiesetldentfler number 
CAT number. Gene cluster number 
Accession: Genbank accesskm numbers 



408310 
408647 
408661 



410626 
411004 
411014 
411028 



1051011 1 
1071855_1 
1073036_1 



1212621 -1 
1228S75 1 
1229091J 
1229404J 



Accession 

AW179023AW179010 
AW245831 AW273207 
AW247625AW249214 

H85615 H86300 H86263 H86282AA059278 H86304 
AW389668 AW389667 AW609198 AW389649 



AW501137 AW501296 AW501212 
AW502208 AW502366 AW502148 
AW503068AW503789 

AA079487AA128547AA128291 M079587 M079e 



BE407727 

AW813242BE146089AW81319SAW813173AW813206BE1459S3BE146212AW813196AWB54582AW813241 BE061582 
AW816072 AW813375 AW813385 AW813372 AW813436 AW816148 AW813475 AW8161C7 AW813398 AW813479 AW814475 AW813317 
AW813703AW813839 
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10 
15 
20 
25 
30 
35 
40 



411541 
411551 
411651 
411660 
411696 



411942 
412793 
413047 
413101 
413154 



60 

65 



1236374J 
1245222_-1 
1249044 1 
1249196J 
1252835 1 
1263078 1 
1254304J 



AW833752 AW833633 AW833776 AW833719 AW833362 AW833749 



1346806J 
1349154.1 
1351 077J 
1358147.1 
1370508J 
1375671.1 
1379792 1 



413679 1382784.1 



413758 
414070 
414196 
414203 
414266 
414276 
414333 
414444 
414539 
414540 
414605 
414610 
414626 
414642 
414663 
414973 



417611 
417908 
418635 
418874 

419889 
420902 
422150 
422731 



141442 1 
1424864 -3 
1426610_2 
14309B4 1 



1472628_-1 
151 0755 J 
1525766 J 
1540470 J 
1661575_1 
158407_1 



170764 1 
177402 1 
1799516.1 



197525J 
212412.1 
220507 1 
221879.1 



230314.1 
241151.1 
245794.1 



266686.1 
260276.1 
265381.1 



428151 
431006 
431257 
431614 
431822 
432966 



433782 
434098 
435138 
435478 



AW855718 AW856740 AW866748 
AW857404AW857401 BE144856 
AW854370 AW864319 AW864504 
FD6485AW876454 

AW876686 AW876717 AW877215 AW876691 AW876722 AW877218 AW876694 AW876725 
AW877015AW877133AW876978AW877071AW8769a8AW877069AW877063AW877013 



BE1 40643 BE140545 BE140644BE140657 BE140660 BE140659 BE1406ei 
BE14722S BE14720SBE147234 

BE152644 BE152712BE1S2668BE1S26S9 BE152810 BE152811 BE1S2816 BE1S2S43 BE1S2706 BE1S26S6 BE1526S0 BE152715 BE1S2662 
BE152669 BE152661 BE162672BE152653 BE152716 BE152651 BE152767 BE152677 BE152652 BE152714 BE152708 BE152665 BE152679 
BE152771 BE152775BE152666 BE152768 BE152813 BE152664 BE152676 BE152681 BE162709 BE152667 BE152B14 BE1528C8 BE152711 
BE152707BE152815BE152678BE1 ' ' " ' " " " ' 



BE156765 BE1S6770 BE156767 BE156769 BE156B03 BE1S6802 BE1S6847 BE15SBS3 BE1S6780 BE1S6836 BE156792 BE1S6834 BE1 56779 
BE166789 BE156833 BE156844 BE156831 BE156849 BE1S6797 BE1S6784 BE156801 BE156843 BE156793 BE156852 

BE162391 

AW963783 F36521 F30667 AW753177 AW753195 AW853065 AA135150 AA375C28 
BE262170 BE3a2553 BE261026 BE273627 



BE379046BE396459 

BE3790S0 

BE390440 

BE388044BE391117BE39153C 
BE410589 BE390949 BE408297 BE3B9S29 
AA150350AA361174AWg59038 

BE396326 

C19089C18814 C16621 
T82802 D78670R08506 

W70022 R35201 F1 2763 T74725 H63485 Z45782 H61 126 
Z43912 H09194 

AA1 78882 AA1 79898 AA1 78897 
T70298 H58072 R02750 

AW934714 BE161007BE162500AW749902AW749864BE162498BE161005AA190449AW513465 

AA193439AA193637AW814128 

AW993983 AW994798 AW993990 AW993999 AW993989 AA204755 
AA207221 BE538271 

AW749866 AA225996 AW750208 AW750206 
T60872 T60906 

AA528295 AW971284 AA247945 

AA251500AA279607 

AA742277 AW976493 AA281585 

AW682898AA305114 

AL133411 AL138412AA315860 

R02504 AA317715 AW961465 AF121172 

AA320946 H921 14 BE14444g BE144438 



H38340 H39081AA324112 
BE002904 H64880AA32BS79 
AA464840AA343628 



37414J 
330006.1 
401159.1 
406683.1 



AW2501 14 Z43124 AA431421 AI8790S4 AA3S1616 AA3S103S AL048999 

BEC042S7AW811190AA36CS76BE172402BE181703 

AA364002AI522307 

N32049R34821 R78237 

BE536836AA376153 

BE637380 BE265216 

AA422028 W79191 

BE162871 BE152870AA490552 

AF039597 BE243938 

AI189827 AW360554 AW860552 AA508543 

AA616049AW004922 

AA6601 14 AW974148 AA572946 

AA582307 BE273018 

AI808236AI024296AA584528 

AA693498 AW749647 AW749630 

F12741 T75166AA594014 

AF090945 AW996754 AI064870 

AA625499 AA625259 AA625184 

BE314734AA666393 

AA682622.BE141B96 



wo 02/102235 
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440190 



428Q68 1 
46841J 
47272_1 
477806J 
488021 _1 
499861J 
441552 620138 1 
442257 63699J 
443198 562655 1 
443534 672957 1 
446052 65988J 
446598 68463J 
447224 71279_-1 
447252 714160 1 
447333 71990 1 
447775 73665_1 
447737 73719 1 



454163 
454178 
454181 



AA732647 BE00897C BE009028 
AF085936 H64070 H64017 
AF086285 W69S87W69421 
AW979249 D63277AA8469B8 

AW752597 AW848781 AW849062 AW848490 AW752699 AW752604 AW7S2700 



AA937975F11215BE 
AW5C3831 AW503317BE565665 
AIC39813 A1684642 Z40121 AI951414 BE501049 
AIC76123A1244834A1695239 
AA353760 AA158850 AW052737 AW062738 AV656291 
AW26C646 BE257108 BE251005 BE256967 BE250926 BE513012 AV659318 
BE617125 

R90916 AL120023 R18429 Z42095 AI369730 R90824 
N24231 BE617964 N36313 
BE179318 BE620044 



AI188489 BE622201 

AI492857AWC70478AI885157 

BE614761 AA263136WC0335 W00327 

BE41 0360 AA442408AA31 5540 

N31036 N42915 F07753 AA010329 

BE160479 BE160478 BE069211 AW861069 AI793147 

AW851888 AW851889 AW852147 



1051071.1 
1073933.1 
114761.1 
1214744 1 
1215381_1 
1219564 1 
1225636 1 
1226543.1 
1227504 1 



454303 
454961 
455132 



AW175997 AW176C00 AW175999 AW175994 AW176004 AW175989 
AW177274 AW177249 AW177223 AW177216 AW177233 

AW177377 AW177357 AW177359 AW177385 AW177358 AW177396 AW177394 AW177396 AW177383 AW177333 AW177384 AW177382 
. .AW177360AW177356 

AW179083 AW179085 AW179087 AW179081 AW179084 AW179086 AW179082 AW801493 AW80165B AW801714 

AW249008BE295653BE2g6765 

AA076B11AWB14764 

AW805749 AW805872 AW794466 AW798102 AW796921 AW794538 AW794380 
AW795322AW7953C8AW795311 AW795310AW795314 AW795321 
AW801365 AW801435 AW801372 

AW80910gAW80gi12AW809122AW809126AW809128AW809133 AW80gi31 AW80gi13 AW809111 AW809132 . 
AW810224 AW810337 AW81029S AW810333 AW810335 AW810296 AW8160S3 
AW81 1380 AW81 1385 
AW812gg4 AW812723 AW812930 

AW860148 AW862380 AW821887 AW821863 AW321870 AW821894 AW862351 AW862378 
AW847807 AW847935 AW847636 

AW867966 AW861636 AW857967 AW857958 AW857943 AW857946 AW857963 AW857968 AW857959 AW857961 AW857956 BEC72135 
AW857972 BE0721 37 AW857952 AW857935 AW857940 AW857944 AW857947 AW857934 
AW937792 BE072250 BE072251 BE072264 



455694 1350650.1 BE067300 BE067293 BE067279 

455910 1382504.1 Z43712 BE156729 BE1S6538 BE156731 BE1S6673 BE1S6S39 BE15SS74 BE1S6430 BE1S6672 BE1S6676 BE1S6432 BE1S6S41 

455993 1398665^1 BE1790B5 BE179084 BE179086 BE179264 

456054 1452761.1 BE313241 BE383148 

456329 1789807.1 T41418T41320T41379 

456394 1843275.-2 W28506 

457301 314434.1 

458025 46409.1 

458606 65568.1 

458640 670076.1 AI284935AW409822BE408182 

458710 69727.1 AV660856BE1 67375 

458792 748294 1 NS6666AI460076 

459170 920646.1 AI905518 AI905516AI905457AI905515 AW176013AW176037 

459270 969232.1 AL039604AL039497 

TABLE 3C: 

Pkey. Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 dIgK numbers In tills column are Genbank IdenbTjer (Gl) nu 

human chromosome 22" Dunham, et al, (1999) Nature 402:489^95 
Strand: Indicates DNA strand from which exons were predksted 

NLposilion: Indicates nudeotkle positions of predicted exons 



s. "Dunham 1. et al." refers to the pubTication entitled "The DNA sequence of 



Pkey 



8118474 
8118768 
8118768 



50389-51188 
92278-92472 

101102-101223,101886-102018 

10475-1C845 
84912-86187 
58189-68323 

72969-73050,73713-73800 
106175-107016 



400842 
400848 
400891 
400917 



1927148 
1927148 

7283186 



wo 02/102235 
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400931 7661921 

400964 7139719 

400965 7770575 
400970 7860452 
400982 8078794 
401010 8117391 
401072 3687273 
401088 8492704 

401116 9966559 

401117 8570083 
401167 9438381 
401204 9743388 
401220 9929324 

401244 4827300 

401245 4827300 
401268 9797154 



142145-142353,144311-144721 



173043-173564 

92744-92895 

119245-119471 

83967-84180 

64370-64524 

194659-195179 



18944-19176 
33694-33872 
48079-48279 
55359-56375 



152272-152433,157312-167418.158025-158206,158838-158974,160716-160952 



401373 7248205 

401405 7768126 

401465 6682292 

401492 7341778 

401521 7705251 

401566 8469090 ■ 

401575 7229804 

401589 9966292 

401628 8575954 

401657 9100664 

401747 9789672 

401757 7239630 

401780 7249190 

401781 7249190 
401785 7249190 
401789 7249213 
401809 7342191 
401847 7139731 
401837 7229981 
401913 9369520 
401962 3176728 
401991 4156128 
401994 4153858 
402023 7528158 
402065 6549269 
402071 8117361 
402075 81174C7 
402131 7704961 
402144 7242326 
402203 8576119 
402277 2894631 
402292 2447220 



210617-210796 
7312-8163 

118596-118816,119119-119244,119609-119761,120422-120990,130161-130381,130468-130593,131097-131258,131866- 

13193^132451-132575,133580-134011 

88641-88751 

28397-28617,2892&-29046,29135-29296,29411-29567,29705-29787,3022«0573 
83215-83435,83531-83656.8374^83901,84237-84393,84955-85037.86290-86814 
165775-165996,166189-166314,166408-166569.167112-167268,167387-1674691168634-168942 

70399-70629,70941-71055 

107543-108298 

85447-85593 

93973-94120 

33753-33904 

71433-71648,76711-76833,78677-78845,79585-79763,82349-82485 

2398-2513 

42904-43124,43211-43336.44607-44763,4519945281.46337-46732 

132872-133040 
135543-136031 



121907-122035,122804-122921.124019-124161.124455-124610,125672-126076 

33114-33209,33496-33678 

115425-115977 

8124-8286 

1698O-17152,17933-18018.18170-18306 
33880-34029,34176-34336.34953-35103 



402570 9884747 
402585 9908890 
402639 9958129 



171761-171996 

96314-96539 
89684-90893 



174893-175050,163210-183435 

2C167-22383 

2218-2440 

182773-182883,184551-184732 



402869 6434643 

402939 9187334 

402944 9368423 

402948 9368458 

402958 9368493 

403010 3132346 

403036 3132360 

403051 4827080 

403065 8954197 

403077 8954241 

403093 8954241 

403151 7407965 

403153 9799871 

403177 9838213 

403223 7630969 

403234 7637801 

403273 8018055 

403286 8080320 



110411-110716,111173-111640 
143456-143626,143808-143935 
13324-13507 



66545-66712 
5269-5411 

71615-71773.73930-74144 
146923-147222,147326-147628 
177083-177373,177464-177751 
14055-14264 



81529-81692 
180641-180822 
133809-134099 
118369-118872 
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403287 


8080320 




126097-126411 


403348 


7239627 


Plus 


13809-13968 


403359 


8570207 




108939-109229 


403362 




Plus 


64099-64260 


403447 


9837821 


Mhus 


159072-159387 


403508 


7630896 


Plus 


5570-5719 


403582 


8101186 


Plus 


18308-18458 


403613 


8493504 


Plus 


81290-81465 


403642 


8699671 


Plus 


7062-7311 


403662 


5823349 




58627-59062,59222-59548 


403674 


7321642 


Plus 


104988-105623,107394-107590 


403687 


7387384 


Plus 


9009-9534 


403695 


3046276 


Plus 




403703 


4966380 




83681-84042 


403741 


7630932 






403747 


7658395 


Minus 


20493-20621 


403766 


7229888 


Plus 


136283-136830 


403786 


8083636 


Minus 


73028-73217 


403796 


8099896 


Minus 


75073-77664 


403833 




Plus 


13522-13664 


403852 


7708872 






403861 


7708966 


Plus 


58363-58649 


403912 


7710730 


Minus 


72000-72290,72431-72700,72929-73199 




7711688 




89369-89592 


403964 


7596976 


Plus 


178174-178300 


404034 


8567760 




404067 


3282162 


Plus 


1415-2071 


404097 


7770701 


Plus 


55512-55781 


404122 


9796270 


Plus 


90540-92977 


404230 


7981448 


Minus 


92934-93093 


404239 


5002624 


Plus 


94841-95095 


404240 


5002624 


Minus 


116132-116407,116653-116922 


404270 


9828129 


Minus 


364g-37S0,4161-4306,S96»049,6B49{g65 


404356 
404600 


7630858 
8705107 


Minus 
Plus 


126433-126623 

118354-118444,118649-118792 


404601 


8705107 


Plus 


128449-128693,129085-129249,130525-130733 


404638 


9796751 




99433-99528,100035-100161 


404666 


7272179 




18677-18993 


404676 


9797204 




48532-43e45,49808-4g975,S108a«136g,S4944-5l 


404727 


8031050 




115534-115747 


404750 


7596836 


Plus 


181879-182198 


404763 


7882612 


Plus 


50981-51392 


404767 


7882327 






404828 


6530415 






404850 


5420148 




35145-35413,4063541062 


404881 


5931510 






404890 


7329390 




101280-1Q1408 


404971 


3212939 




74585-75532 


405022 


7330304 


Plus 


217163-217439 


405028 


7533974 




110588-110847,110933-111115 


405071 


7703797 


Minus 


11115-11552 


405088 


8072518 




115690-117621 


406133 


8516055 




28127-28288 


405138 


8576241 


Plus 


90303-90516 


405183 


7209940 


Plus 


12335-12653 


405194 


7230072 


Plus 


190465-190645,193346-193610 


405196 


7230083 


Minus 


135716-135851 


405208 


7230142 


Plus 


8068-8214 


405226 


7248966 


Plus 


53547-54128 


405227 


6731245 


Minus 


22550-22802 


405256 


7329310 


Plus 


26070-26309 


405277 




Plus 


23471-23572 


405307 


3638954 


Plus 


39195-39429 


405311 


3638954 


Plus 


46313-46496 


405333 


3165399 


Plus 


149905-150215 




3451356 


Minus 


17503-17778,18021-18290 


405423 


4753276 


Plus 


5162-6983 




5801645 




31857-82045 


405501 


9211311 




49085^9400,4956549679,501 17-50262 


405515 


9454624 




37329-37469 


405545 


1054740 


Plus 


1 1 8677-1 1 8807,1 19091-119296,121626-121823 


405580 


4512267 


Plus 


169232-169547 


405536 


5002511 


Plus 


33810-39017 


405600 


5923640 


Plus 


25662-27225 


405610 


5757553 




71907-72080 


405539 


5091650 


Plus 


211184-211350 


405687 


6249668 




54787-54891,65844-55917 


405699 


4165331 


Plus 


100727-100859 


405783 


5738434 


Minus 


27238-27885 


405367 


6758731 




74553-75173 


406036 


7107817 


Plus 


9418-9573 


406124 


9149714 




1331-1774 


406130 


9161404 


Minus 


32394-32498 


4C6140 


9168231 


Minus 


49887-60219 



131 
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406160 7144945 
406207 5923550 
406215 7342161 



406388 9256205 

406457 9755793 

406473 9795566 

406537 7711478 

406571 7711622 

406580 7711838 



TABLE 4A lists aba ut 1 31 genes up-regulaled in ovarian cancer compared lo noimal ovaries thai are likely to be extracellular or cell^urface proteins. These were selected i 
Table 3A, except that the ratio was greater than or equal to 1 0, and the predbled protein contained a PFAM domain that is india'tlve of extracellular localization. 

TABLE 4A: AB0UT1 31 UP-I^GULATED GENES ENCODING EXTRACELLUIAR/Cai SURFACE PROTEINS, OVARIAN CANCER VERSUS NORMAL OVART 

Pl<ey: Primekey 
Ex. Accn: Exemplar Accession 
UGID: UniGenelD 
Title: Utiigene Title 
PFAM domains 



Pl<ey 


Ex. Accn 


UGID 


Title 


PFAM 




403077 






predicted exon 


fn3 


15.0 


426535 


AU077012 


Hs.238582 


ESTs, Wealtly similar to ubiquitous TP 
predicted exon 


Kunitz_BPTI 


14,9 


403089 






fh3 


14.9 


457148 


AF091036 


Hs.184627 


KIAA0118 protein 
ESTs 


artjras 


14,8 


431176 


Ai026984 


Hs.293662 


laminin_EGRIaminin_B; 


14.8 


434293 


NM_004445 


Hs.3796 


EphB6 


fh3;pklnase;EPHJbd 


14.8 


408482 


NM.000676 


Hs.45743 


adenosine A2b receptor 


7titu1 


14.6 


428695 


AI355647 


Hs.189999 


purineigic receptor (family A group 5) 


7tm.1 


14.5 


426125 


X87241 


Hs.166994 


FAT tumor suppressor (Drosophiia) ho 


EGF 




423732 


AF058056 


Hs.132183 


solute earner family 16 (monocarboxy 


sugar_tnMCT 


14.3 


422125 


NM 003459 


Hs.1 11967 


solute canter tami^r 30 (zinc transporte 


CaUon efflux 


14.2 


407483 


NM_012363 




(NONE) 


7tm_1 


14.2 


446639 


AW594695 




ESTs 


TtmJ 


14.1 






Hs.59545 


ring finger protein 15 


zf-C3HC4;SPRY;zf-B_box 


14.0 


423217 


NM 000094 


Hs.1640 


collagen, type VII, alpha 1 (epidermoly 




14.0 


405448 


AI015709 




Homo sapiens mRNA; cDNA DKFZpS 


tiypsln;sushi;CUB 


14.0 


450684 


AA872605 


Hs,25333 


interleukin 1 receptor, type 11 


ig 


14.0 


406692 


L36607 




gb:Horao sapiens (clone 22) pregnancy 


ig 


13.9 


42SS49 


U64863 


Hs.158297 


programmed ceil death 1 


■« 


13,8 


452755 


AW138937 


Hs.213436 


ESTs 




13.8 


427637 


AK000816 


H3.1 79986 


fiotiliini 


Band 7 


13.7 


424591 


R55704 


Hs.1 50968 


hypooretin (orexin) receptor 1 


7tm.1 


13.7 


406024 








TGF-betaTGFbjropeptide 


117 








""rented r°n 








U84722^^ 


Hs.784 


Epstein-Ban- vims induced gene 2 (lym 


7tmj 


13.7 


420256 




Hs.76206 


cadherin 5, type 2, VE-cadherin (vascu 


cadherin;Cadher1n_C_tenn 


13.6 


420511 


AF052692 


Hs.98485 


gap junction protein, beta 4 (connexin 3 




13.5 


448638 


R17122 


Hs.21639 


nuclear protein, marker tor difl^rentlat 


ig 




431117 


AF003S22 


Hs.250500 


delta (Drosophlla)-like1 


EGRDSL 


m 


439285 


AL133916 


Hs.298998 


ESTs 


ig;pkinase;LRRNT:LRRCT 


114 


424283 


AA338246 


Hs.301678 


ESTs 


E1-E?.ATPase;HydrDlase 


113 




AI742878 


Hs.124116 


ESTs 


ig 


113 


443859 


NM_013409 


Hs.9914 




kazal 


112 


410016 


AA297977 


Hs.57907 


small Inducible cytokine subfamily A ( 


IL8 


112 


414020 


NM.002984 


Hs.75703 


small inducible cytokine A4 (homoiogo 


IL8 


112 


400242 






predicted exon 


Ephiin 


13.0 


429057 




Hs.194816 


stomatin-like protein 1 


Band_7;SCP2 


12.9 


438294 


A!69376f 


Hs.143004 


ESTs 


E1-E2 ATPase;Hydroiase 


12.9 


458493 


AV649408 


Hs.282413 


ESTs 


RYDFLITPR 


12.8 


444181 


AB033063 


Hs.1 0491 


KIAA1237 protein 


fn3;ig;PH;RhoGEF 


12.8 


422357 


AF016272 


Hs.1 15418 


cadherin 16, KSP-cadherin 


cadherin 


12.7 


409632 






serine (or cysteine) proteinase Inhibitor 




12.7 




U12139 




gb:Human alphal(XI) collagen (C0L1 


TSPN;Collsgen;CQLR 


12.6 


417064 


W02903 


Hs.15440 


ESTs 


leclin_c 


12.6 


439389 


AA318940 


HS.56C04 


ESTs 


hemopexin;Peptidase_M1 0 


12.6 


407786 


AAS87538 


Hs.38972 


letraspan 1 


transraembrane4 


12.5 


410498 


AA355749 




gb;EST64459 Jurkat T-cells VI Homo 


aajpermeases 


^2S 




AJ010901 


Hs.198267 


mucin 4, tracheobronchial 




12.5 


422330 


D30783 


Hs.1 15263 


epireguiin 


EGF 


12.5 


402425 








ton_lians 


12.4 


414875 


H42679 


Hs.77522 


major histocompatibility complex, clas 


ig 


12.2 


424239 


M67439 


Hs.143526 


dopamine receptor 05 


7tni 1 


12.2 


442622 


NH.000435 


Hs.8546 


Noteh (Drosophiia) homolog 3 


EGF;ank;notch 


12.2 


405368 






predtetodexon 


7tmJ 


12.2 


402406 






predicted exon 


Gal-blndJecUn 


12.1 


426514 


BE616633 


Hs.301122 




TGF-beta;TGFb_propeplide 


12.1 


406811 


U82979 


Hs,67846 


leukocyte immunogiobuiin-iike reoepto 


ig 


12.0 


416441 


BE407197 




. gb:601301552F1 NIH_MGC_21 Homo 


SDF 


12.0 
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40 
45 
50 
55 

60 
65 
70 
75 



433221 


AB040917 


Hs.97e60 


KIAA1484 protein 


442915 


AA852875 


Hs.8850 


a disintegrin and metalloprotelnase dom 


423S13 


AF036038 


Hs.129910 




411213 


AA67693g 


Hs.69285 


neuroplin 1 


425483 


AF231022 


Hs.301273 


Homo sapiens protocadherin Fal2 (FA 


421258 


AA286731 




gb:zs53d08.r1 NO CGAP GCB1 Horn 


423795 


AW849759 




gb:IL3-CT021S-24020OO77-C04 CTO 


422424 


AI186431 


Hs.116577 


prostate differentiation factor 


443296 


AI755286 




gb:wi73b0&x1 Na_CGAP_Kid12Ho 


448999 


AF179274 


Hs.22791 


transmembrane protein with EGF-liice 


414878 




Hs.77541 


ADP-ribosylation factor 5 


429344 


R94038 


Hs.1 99538 


Inliibin, beta C 


402114 






predicted exon 


419216 


AU076718 


Hs.1 64021 


small inducible cyloltine subfamily B ( 


430263 


D12614 


Hs.36 


lypliotoxin alplia (TNF superfamily, m 


400464 








456341 


AA375863 


Hs.152345 


poliovims receptor-related 1 (lierpesvir 


409420 


Z15008 


Hs.54451 


lamlnin, gamma 2 (nicein (1001(D), Ital 


418043 


AW377752 


Hs.83341 


H.sapiens mRNA for tyrosine kinase re 


426523 


S68616 


Hs.1 70222 


solute carrier family 9 (sodium'hydrog 


446051 


BE048061 


Hs.153315 


ESTs 




AF086543 




gb:Homo sapiens full length insert cDN 


416602 


NM_006159 


Hs.79389 


nel (ohicl<en)-lil(e 2 


418299 


AA279530 


Hs.83968 


integrin, beta 2 (antigen CD18 (p95), ly 


425721 


AC002115 


Hs.15930g 


uraplaldn 1A^ 


409757 


NM_001898 


Hs.123114 


cystatlnSN 


430630 


AW26g920 


Hs.2621 


cystatjnA(stefinA) 


429630 


M85239 


Hs511573 


heparan sulfate proteoglycan 2 (perleca 


427289 


AI097346 


Hs.174203 


solute carrier family 1 (glutamate/neutr 


401248 


AB02898g 


Hs.88500 


mitogen-xtivated protein kinase 8 Inte 


412627 


BE391959 


Hs.74276 


chloride intracellular channel 1 


420104 


U09825 


Hs.1287 


zinc linger protein 173 


405275 


AB028989 


Hs.88500 


mitogen-activaled pniteln kinase 8 inte 


425864 


U56420 


Hs.159903 


olfactory receptor, family 5, subfamily 


446745 


AW1 18189 


Hs.1 56400 


ESTs 


441834 


AL1 38034 


Hs.7979 


KIAA0736 gene product 


450986 


BE241345 


Hs.25744 


Novel human gene mapping to chomcs 


416118 


N52773 


Hs.167721 


ESTs 


443071 


AL030021 


Hs.8986 


complement component 1, q subcompo 


431247 


AL021578 


Hs.278489 




431449 


M55g94 


Hs.256273 


tumor necrosis factor receptor superfam 


457044 


S73399 


Hs.2131 


arginine vasopressin receptor 1A 


416319 


AI815601 


Hs.79197 


CD83 antigen (activated B lymphocyte 


402172 






predicted exon 


424218 


AF031324 


Hs.143212 


cystatlnF(leukocystatln] 


409208 


Y00093 


Hs.61077 


integrin, alpha X (antigen C011C (pi 5 


426330 


M77235 


Hs.169331 


sodium channel, voltage-gated, type V, 


439758 


AA845235 


Hs.124470 


ESTs 


412429 


AV650262 


Hs.75765 


6R02 oncogene 


449987 


AW079749 


Hs.184719 


ESTs. Weakly similar to AF1 16721 11 


432408 


N39127 


Hs.76391 


myxovlrus (Influenza) resistance 1, hom 


406672 


M26041 


Hs.ig8253 


major histocompatl}iiity complex, das 


419749 


X73608 


Hs.9302g 


sparetosteoneclin, cvrov and kazal-like 


419086 


NM_000216 


Hs.89591 


Kallmann syndrome 1 sequence 


425C09 




Hs.154151 


protein tyrosine pliosphatase, receptort 


423869 


BE409301 


Hs.134012 


Clq-related factor 


430209 


AF177941 


Hs.235368 


Pro.{alplia)3(V) collagen 


400834 






predicted exon 


442941 


AU076728 


Hs.8867 


cysteine-tich, angiogenic inducer, 61 


403691 






predicted exon 




AJ011021 


Hs.247905 


potassium voltage^ated cliannel, subfa 


432342 


AL036128 


Hs.274404 


plasminogen xtivator, tissue 


413731 


BE243845 


H3.76511 


oonnecl've tissue growth factor 


423309 


BE006775 


Hs.1 26782 


sushi-repeat protein 


431728 


NM_007351 


Hs.268107 




450245 


AA007536 


H3.271767 


ESTs, Moderately similar to ALU1_HU 


446983 




Hs.9719g 


complement component Clq receptor 






Hs.75893 


anl<yiin 3, node of Ranvier{ankyrin G) 


400253 






predicted exon 


406694 


M94891 


Hs.225932 


pregnancy speclflc beta-1-glycoproteln 


418793 


AW382987 


Hs.88474 


prostaglandin-endoperoxMe synthase 1 


410664 


NM 006033 


Hs.65370 


lipase, endothelial 


427274 


NM_005211 


Hs.174142 


colony stimulating factor 1 receptor, fb 



fh3,ig;LJ?RCT 

dlsWegriniRepro 

igiSemaiAcetyltransf 
CUB;MAM;F5_F8_type_C 
EGF;cadl ■ ■ • ■ - 
7tm.3 



TABLE 4B: 

Pkey: tJnique Eos probeset identifier number 
CAT number: Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT Number Accession 

410498 120611_1 AA355749 AA085520AW966333AA340319BE170936 

416441 1594B0_1 BE407197AA182474AA180369BE275628 BE276131 

421258 200725_1 AA286731 AA287621 AW188228AW137774 

423795 232093_1 AW849759 AW849758 T89549 AA331069 



7tnu1 

PHiRiraGAPjGal-blndJecfin 
hemoRexln;Peptli- ••- 
C1q;Caliagen 
EGFivwa 
TNFFI.C6 
7tm_1 



ABC_tran;ABC_membrane 
ion_tians;K_tetra 
ig;MHCJl_alpha 



fh3;lg;Yjih06phatase;MAM 
GTP_EFTU;EFG_C 
Collagen;COLFI;TSPN 
IRK 

CysJnot;tsp„1;viic;IGFBP 
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Ptey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in tliis column are Qeribark Idenf 

human chromosome 22" Dunham, et al. (19991 Nature 402:18&495 
Strand: Indicates DNA strand from which exons were predicted 
NLposition: Indicates nucleotide positions of predicted exons 



if (Gl) numbers. 'Dunham I. et al.' refers to the publication entitled Tlie DNA sequence of 



400464 
400834 
402114 
402172 



8575911 
3970929 
9796347 
8954241 
8954241 
7387384 
7107727 
6139075 
2104517 



121963-122288 
71578-71715 
143378-143671 
10872-11123,12932-130 



8850048697 

55744-55903.57080-57170,6147841560 
46055-47188 



TABLE SA: 685 DOWN-REGULATED GENES, OVARIAN CANCER VERSUS NORMAL OVARY 

Pkey: Primeltey 

Ex.Accn: Exemplar Accession 

UGID: UniGenelD 

Title: UniGene Title 

ratio; ration normal ovary vs tumor 



421013 M62397 



455056 AW853057 

420727 

461617 
401308 



459054 
435330 
436642 



Hs.55346 
Hs.37288 
Hs.1 19059 



AA911705 
T40760 
BE314524 
AA804362 
M908263 
AW798466 
R16769 
AA724430 
All 20800 
AI808964 



436812 AW298067 



412637 
419166 
423739 
413813 
416211 
443131 
415866 
410130 



Hs.130229 
Hs,90469 
Hs.78776 

Hs.291625 



ESTs, Weakly similar to Z141_HUMAN ZINC RNGE 



7,5'-oiigoadenylate synthetase 1 



Hs.127g60 ESTs 



gb:DKFZp762E15^1 762 (synonym: hmel2) Homo sa 

gb°u|l||-'m^|M^-09*Ul.s1 NCI_CGAP_Sub6 Hera 
gb:601062418F1 NIftMGCJO Homo sapiens cDNA 



R14625 
AI033833 
T10115 
AI912097 
AI131502 
408141 U69205 
419015 T79252 
441573 BE56396S Hs.6529 
419386 AA236867 Hs.143868 ESTs 



Hs.293584 
Hs.97600 
Hs.76661 

Hs.132689 
Hs.92423 
Hs.1 63208 
Hs.143135 
Hs.45152 
Hs.14463 



gb:yg45c03j1 Scares infant brain 1NIB Homo sapiens 



Z43123 
AW993300 
AW1 18681 
N67659 



421418 
413597 

454102 AW752363 
445487 AI806287 
457804 AI004397 



AW302885 Hs.117183 ESTs 



gb:RC2-BN0033-180200-015-g06BN0033Homosapie 
ESTs 

ESTs, Weakly similar to neuronal thread protein AD7c- 
gb:Ph&HT0344-071299-003<K)8 HT0344 Homo saple 
gb:601117374F1 NIHJIGC 16 Homo sapiens cDNA 
gb:ob88g0S.s1 NCI_CGAP..GCB1 Homo sapiens cDN 



gb:RCO-Cn)201-270999-011403 CT0201 Homo sapien 
ESTs 

ESTs, Ufeakly similar to similar to Osiabglycoprotein 
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422046 
441284 
446224 



50 
55 

60 

65 



AA927676 Hs.196542 

AW450551 Hs.13308 

AU077260 Hs.153924 

AW009077 Hs.232947 

AA167642 Hs.14632 



431877 
411337 
410596 
417762 
406364 
452238 

415288 R15794 
407437 AF220264 
439126 AF0a5984 
452453 
431800 
426380 
449529 
437755 
448307 
439586 
420051 
425806 
433923 
408159 
434844 

430197 AA46BBBB 
440332 AI218517 
450061 AI797034 
454994 AWB50176 
402105 

409090 W56067 

408074 R20723 

459200 Y09306 

416310 T81421 

421976 AL138443 

429755 NM_001364 

448732 BE614063 



W01715 
AK000043 
AA278781 



416453 H56968 



410495 
403277 
444302 



S.I 24764 
S.30148 
S.221396 



Hs.1 62045 
Hs.1 49990 
Hs.232033 
Hs.291887 
Hs.211026 
Hs.1 10039 
Hs.44745 
Hs.173369 
Hs.146626 
Hs.1 18526 
HSJ01356 



gb:QV2-LT0038-27(1300-108^j12 LT003B Homo saple 
gt):EST86290 HSC172 cells I Homo sapiens cDNA 5 e 
gb:zq48a10.r1 Stratagene hNT neuron (937233) Homo 
predicted exon 

ESTs, Moderately similar to S22703 voltage^sled pota 
ESTs, Weakly similar to ALU1_HUMAN ALU SUBFA 
gb:Homo sapiens M08T-1 mRNA, complete cds. 
gbiHomo sapiens full length insert cDNA done YT99F 
gb:QV-BT009-1011984)51 BT009 Homo sapiens cDNA 
-^Ts 

Ts, Weakly similar to unnamed protein product |H.sa 



hypotlieticai protein LOC56757 

ESTs, Weakly similar to ALUSJUMAN ALU SL 



gb:IL3-CT021»-2710994)22-HI)4 CT0219 Homo sapien 



Hs.1 031 05 ESTs 



Hs.280698 ESTs 



homeodomain-lnleracting protein kinase 3 

ESTs 

mRNA for FU00023 protein 

discs, large (Drosopiilla) horroiog 2 (chapsyn-110) 

gb:601503993F1 NIH_MGa.71 Homo sapiens cDNA 

ESTs 

Homo sapiens cDNA: FU21440 Us, clone COL04389 
Homo sapiens cDNA FU12g77 fis, cbne NT2RP20062 
ESTs, Weakly similar to Lpg6p [Sxerevislae] 
hypothetical protein FLI20036 



gb:CM0-Hn)143-270999462-d12 HT0143 Homo sapi 
ESTs 

DKFZP547G0910 protein 



Hs.87517 
Hs.1 12692 
HS.88S0 
Hs.1 19813 



a disintegrin and metalloprotelnase domain 12 (meltrin a 



rin homology, Sec7 and coiled/coil domains 1{cy 



S-oxoguanine DNA glycosylase 



predicted exon 
histone deacetylase 5 

ESTs, Highly similar to FXD3„HUMAN FORKHEAD 
ESTs 

gb:qz30a07jc1 NCLCGAP_Kid11 Homo sapiens cDNA 

ESTs 

ESTs, Weakly similar to ALUB.HUMAN ALU SUBFA 



homotogy 3 domai^containlng protein HIP-55 
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452351 AA025547 



431888 H99557 



446361 
458253 
433682 
455790 

436513 

440231 



Hs.44553 
Hs.277215 
Hs.2864 
Hs.26418 
R83017 Hs.204828 
H16442 Hs.127376 
AF0B5922 Hs.113968 
AI291234 Hs,282241 
AW296gS2 Hs.196802 
AA642418 Hs.17381 



AW294e70 

AJ278110 

N94087 

AW015420 

AA460104 

AA345232 



gbze85d01.r1 Soares_ley_heart_NbHH19W Homo sa 
gbza40a06.r1 Scares fetal liver spleen 1NFLS Homo sa 

unc5 (Celegans homolog) c 
ESTs 

eariy endosome antigen 1, 162kD 



KIAA0266 gene product 



Hs.223672 
Hs.1 25507 
Hs.26073 
Hs.163323 
Hs.99540 
Hs.21227 
Hs.137579 



gb.'RC1-BTO72O-28030&O1 1-g02 BT0720 Homo saple 



410454 
438116 
409138 
423047 
440212 
404108 
456253 
409365 
444013 
454071 
419761 



444131 
441679 
450077 
421209 
445140 
421126 
447037 
407168 
436196 
442772 
444138 
453589 
451640 
441318 
407490 
438224 
451638 
457356 



401287 
424188 

404868 



AW749041 
AI904106 
W73159 



AI041793 
M17373 
AA4gi275 



M74587 

AI357668 

R46175 

AK001084 

AW503680 

AI701572 

AV654623 



HS.22644C 
Hs.44404 
Hs.42502 
Hs.93177 
Hs.236940 



gb:RC3^T0319-100100-012-c05 BT0319 Homo sa| 



gbA588F Heart Homo sapiens cDNA clone A588, mRN 
Homo sapiens done 24881 mRNA sequence 
■- 1nPR01488 



Homo sapiens cDNA FU12542 lis, clone NT2RM4000 
predicted exon 

gb:PM2-UM0027-230200-002-h02 UM0027 Hoino sap 
EST, Weakly similar to intrinsic factor-B12 receptor pr 



Hs.1 20855 
Hs.102576 
Hs.1 9791 3 
Hs.1 021 22 
Hs.157612 



Hs.151153 ESTs 



gb:yg40f01.s1 Soares Infant brain 1NIB Homo sapiens 
gb:Homo sapiens cDNA FU10222 fls, clone HEMBB1 
ESTs, Weakly similar to T1SB7.2 [Celegans] 



Hs.176130 ESTs 



Hs.82396 
Hs,191670 
Hs.22801 
Hs.145328 
Hs.226410 
Hs.291453 



AWg54552 Hs.142634 zinc linger protein 



jb;panaeatic ribonuclease [human, raRNA Recombinan 
gb:on91f04.s1 8oa-es_NFI_T_6BC_S1 Homo sapiens 
2',5'-ollgoadenrlate synttietase 1 



ESTs, Weakly amilar to ALU1 JiUMAN ALU SUBFA 



Hs.1 8498 
Hs.246872 
Hs.132458 



AW593104 

A1204451 

BE267477 

NM_007374 

BE386606 

BE391510 

AW515763 

AA649253 

AF040247 

AW846202 

AW991925 

N48990 



AI734233 Hs.226142 

N1VL005036 Hs.993 

AA812633 Hs.10845 

AI151030 Hs.146830 

AW973801 Hs.1 34656 

AA813982 Hs.291842 



gb:601 189542F2 NIH.MGC_7 Homo sapiens cDNA cl 



0 sapiens eiylhroid diffefentialion-related factor 
gb:Qy0-CT0179-ai1299^)61-f10 CT0179 Homo sapie 
gb:PM3-BN001 1-13010O<)02-b07 BN0011 Homo sapi 



449862 AI672277 
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441133 
446323 
440347 
439481 



403531 
429773 
422563 



433371 
410667 
410890 



T78S25 Hs.268694 

AL1 57686 Hs.293737 

H18252 Hs.227263 

AA918191 Hs,194457 

AI288274 Hs.149868 

AI125590 HS.142S64 

AF086294 Hs.125844 
W28557 

R34177 Hs.1 81315 

R93483 Hs.260273 

A1332482 Hs.218791 

BE29g342 Hs.1 9348 

Z437B4 Hs.78713 

AL048807 Hs. 1807 14 

N72050 Hs.1 64144 

F12B93 Hs.13301 
T25451 
AW936099 
AW809575 



441705 AI087052 



423073 BE252922 



424719 
439542 
444433 
438831 



401004 
425837 
420497 

429409 
447959 
407340 



414475 
432075 
417906 
406518 
441460 



ls.123119 

BE300369 Hs.42643 
H90462 

AW297571 Hs.17646 

AV649844 Hs.282436 

BE263273 Hs.301128 
AW812744 

AA03g843 Hs.61948 

AI807339 Hs.152174 
AA325483 
AW813110 

AI221631 Hs.166788 



403535 
441919 
416717 



NM_G14517 Hs.28423 
BE0S9037 

AA355288 Hs.271408 



predicted exon 

proteoglycan 4, (megakaiyocyte sUmulallng factor, arte 
Homo sapiens cONA FU131 19 fis, clone NT2(?P30026 



cylccliFome c oxidase sulxinit Via polypeptide 1 



gb:PTHI188 HTCDL1 Homo sapiens cDNA 5IS simila 
gb:QVO-DT0020-210100^)95^l04 DT0020 Homo sapie 
gb:MR4-ST0121-Q5020D^-a12ST0121Homosapie 
predicted exon 



AA927317 Hs.176719 

AI594817 Hs.155980 

AI452784 Hs.270270 

AA810168 Hs.232119 

NM_014479 Hs.145296 

AF027219 Hs.9443 

T76167 Hs.285516 

BE302955 Hs.1 19598 
AW972934 

R24769 Hs.23725 

W28077 Hs.79389 

A1962478 Hs.226804 

T49427 Hs.181244 
AA380828 
AW968485 

BE04732g Hs.1 44483 

AI557139 Hs.129179 
N50617 

AW593050 Hs.128580 
AF003834 

AW448914 Hs.202391 

AI657038 Hs.1 961 09 

H69125 Hs.133525 

AW504860 Hs.288836 

AA320749 Hs.209464 

AW079984 Hs.262480 

N62937 Hs.269109 

ALII 0237 Hs.28425 



MAD(motheisag 
ESTs, Wealdy simlar to KIAA101 6 protein [Rsaplens 
gb:yvO1c03j1 Scares fetal Tiver spleen 1 NFLS Homo sa 



gb:RC3-STD186-181099^2.c0g 570186 Homo sapien 
ESTs 

ESTs. WeaWy similar to Z140_HUMAN ZINC FINGE 
gb:EST28475 Cerebellum II Homo sapiens cDNA S en 
gb:CM4ST0189 051099-021-f05 ST0189 Homo sapien 



Insulin receptor substrate 4 



ESTs 
ESTs 

dislntegrin protease 
zinc linger protein 202 

ESTs, Weakly similar to hypothetcal protein [H.sapien 
ribosomal protein L3 

gb:EST385030 MAGE resequences, MAGM Homo sap 
ESTs 

nel (chicken)-llke 2 

ESTs, Moderately similar to ALUC_HUMAN iUI ALU 
major liistocompatibility complex, class I, A 
gb:EST93827 Activated T-cells Vil Homo sapiens cDN 



gb:AF003834 aontedi HI1149x Homo sapiens cDNA 



Homo sapiens mRNA; cDNA DKFZpS66D224 (fiam c 



H79559 Hs.297726 ESTs 

NM_005092 Hs.243197 tumorne 

AA449013 Hs.99203 ESTs 

R15955 Hs.21753 ESTs 

AA017066 Hs.237636 EST 
AI221511 



hypottietical protein SP192 
upstream binding protein 1 (LBP-la) 
gb:QV3-BT0379-161299-040H!l2BT0379Homosapie 
ESTs 

gb:DKFZp761D2315j1 761 (synonym: Iiamy2) Homo 



AV650231 Hs.282941 



wo 02/102235 



PCT/US02/19297 



Hs.269347 ESTs 



AW419296 
A1890955 
AW88114S 
AF103907 



448313 
411279 
440652 
416608 
420405 
405717 
435267 



Hs.262983 ESTs 



hypothetical protein HJ20093 



^s.143977 ESTs 



432474 
455388 
426456 



436404 
457740 
437385 



50 

55 

60 



AA584042 
AW936234 
AA58C748 



446190 
443542 
430444 
454573 
409346 
. 456141 
456140 
441635 
416677 
401740 



411922 
452320 
431644 
409892 
418132 
414372 
400196 



AW500458 
AA757055 
AV650124 
AA046914 
AWB34039 
BE548749 
BE500941 

AI279299 
AI927065 
AW295421 
BE146471 
AW501748 
A17513S7 
AA169516 
AI459261 
T83470 

AA255714 

AW272457 

AA37418S 

AW876260 

AA042873 

AW972822 

AW956113 

T92670 

AA143654 

MS9964 

AA380876 

N39015 

AW46r?91 

AA808136 

AA806607 

AA001308 



453530 
415553 
416874 
454885 
419896 
440962 
419401 



gb.'CM-BTdS4W399-054 BT054 Homo sapiens cDN 
gb:nn6Se09.s1 Na_CGAP_Lar1 Homo sapiens cDNA 
gb:QVO-DT0020-090200-105^05 DT0020 Homosaple 



small nuclear ribonucleoprotein polypeptide G 
gb:601 159362F2 NIH_MGC_53 Homo sapiens cDNA 
Homo sapiens mRNA tor partial 3'UTR. sequence 2 
gb:Ui-HF-BN0-al<b^l-O7-O-Ui.r1 NIH_MGC_50 Homo 



Hs.282435 ESTs 



AW021633 
AA885143 
H98752 



Hs.1048 
Hs.270 

Hs.1 90368 
HS.1 55561 
Hs.177698 
Hs.292206 
Hs.193213 



gb:QVO-HT021641 11994)43409 HT0216Homosapie 
gb:UI-HF-BR0p-a|m-b-12-(HJI j1 NIHJiIGC J2 Hom 
Homo sapiens cDI^A: FU222S6 Us, clone HRC02860 



gb:yd46g06.r1 Soaresfey llverspleen INI^ Homos 

predicted exon 

Fanconi anemia, complementation group A 



gb:PM4-PT0019-131299-006-E04 FT 
ESTs 

gb:EST3S8183 MAGE resequences, MAGD Homo sap 
ESTs 

gb:zo6Sa02j1 Slratagene pancreas (S3720B) Homo sap 

predicted exon 

KITI'igand 

pledtstrln homology, Sec7 and coiled/coll domans, bind 



Hs.125719 ESTs 



gb:601342622F1 NIHJilGCJS Homo sapiens cDNA 
predicted exon 

gb:df26cOZy1 Morton Fetal Cochlea Homo sapiens cDN 



gb-.QV1-LT0036-15020(M)74-h06 LT0036 Honwsapie 
gb:HSZ99362 DKFZphamyi Homo sapiens c(]N A den 
ESTs, Moderately similar to A Chain A, Secypa CompI 
gb:QV4-UMC094-16030O-135^K)6 UM0094 Homo sap 



AW161533 
AJ001865 
AI148037 



Hs,32703 
Hs.41371 
Hs.40400 
Hs,28S711 
Hs.1165 
Hs.300866 



wo 02/102235 



PCT/US02/19297 



444551 W5845g 
455067 AW854533 
442657 BE502631 
AA835639 
AI379772 



Hs.103120 ESTs 



429142 
429274 
437774 
427737 



20 
25 
30 



441951 
440706 
434231 

411039 



443601 
404041 
406122 



Hs.130645 ESTs 



gb:RC3-Cn)255-200100O24-b02 CT0255 Homo sapie 



AA43S988 

BE182082 

R37529 
H70565 
AW967472 
AA035197 
AA747549 
AW450202 



ESTs, Weakly similar lo AF068289 5 HDCME31 P |H.s 



Hs.246973 ESTs 



Hs.1 07375 
Hs.259122 
Hs.1 22963 
Hs.254602 



431828 
438872 
438673 
416624 
401963 
402867 
408315 
418320 
447199 



ESTs 

gb:PM1-HT0345-121199-001-c08 KT0345 Homo sapie 
AvvDi-!^ gb:hg97c10.x1 NCLCGAP_Kld11 Homo sapiens cDN 

AWg63g69 gb-.EST375042 MAGE tesequences, MAGH Homo sap 

NM_002410 Hs.121502 mamiosyl (alpha-1.6-)-glycoprx)lein beta-1,6-N-acetyl-g 
AA572994 gb:tim33f1is1 NCLCGAP Up2 Homo sapiens cDNA 

R64197 Hs.23589 ESTs 
Aia24717 Hs.123443 ESTs 

H69044 gb:yr77l)05j1 Soaies fetal liver spleen 1NFLS Homo sa 



AW514021 

Ai/V9S3444 

AIJ046575 

W31002 

AA904244 

API 19901 



Hs.84084 
Hs.160900 
Hs.193945 



AA309234 
AV652707 
AL040421 
AI078554 



Homo sapiens cDNA FU13962 Us, done Y79AA10012 



ls.250568 tiypottietical protein PR02831 



Hs.1 63348 ESTs 



gb:PMO-BT0340-231 199-001-1)07 BT0340 Homo sapie 
Homo sapiens cDNA: FU231 19 fis, clone LNG07978 
Homo sapiens mRNAfull length insert cDNA clone EU 
gb:DKF2p434B0714_r1 434 (synonym: htesS) Homos 



411899 
426758 
421776 
430169 



439328 W07411 
429066 AAS68S55 



424916 
424769 
400080 
421521 



441392 
424025 
448527 



AA370573 

AW301994 Hs.108183 
AA468531 Hs.189047 
AI808007 Hs.66450 
AW807573 

Hs.161640 
Hs.118212 
Hs.178222 
AI94S490 Hs.98765 
AA837146 Hs.180275 
AI580083 Hs.176154 
AW615224 Hs.252839 
AA825721 Hs.246973 
NO01874 Hs.1 69765 
AW867440 Hs.23096 



AI275715 
AW451831 
AI701852 
AI525606 
AA351109 
AI673006 
AI657014 



423209 
409854 
414941 
456337 
415296 
423338 
41561 B 



AW501833 
C14865 
AW7S1661 



ite tumor suppressor p33 ING1 homolog 



9b:MR1-ST0088-0212g94X)4^01 ST0088 Homo sapie 
tyrosine aminotransferase 

ESTs, Moderately sWar to ALUa.HUMAN ALU SU 



ls.2221 1 9 ESTs, Weakly simflar to K1CCLHUMAN KERATIN, T 
ls.301296 ESTs 

gb:Fri J_03_G05.r tumorl Homo sapiens cDNA 5, mR 
ls.5437 Taxi (human T-cell leul<eraia vims type i) binding prot 
ls.231 948 ESTs, Wealdy similar to ALUB_HUMAN !!!! AIOJ a 

gb-.tt49a12.);1 t>ICl„CGAP_GC5 Homo sapiens cDNAc 
ls.132395 ESTs 

gb:zh78c08.s1 Soare3jetaUiver_spleen_1Na8_S1 H 
Is. 1 06823 Ksapiens gene from PAC 42616. similar to syntaxin 7 
gb:UI-HF-BR0p-aKMj-01-O-Ul.r1 N|H_MGC_52 Horn 
fs.1821S9 ESTs 
fs.65919 ESTs 



gb:HSC01A011 nonnalized infant bra'n cDNA Homo s 
ftDuuraoi ns.i^rj« KIAA0492 protein 
F129S4 gb:HSC3GG091 no 

predicted exon 

AF217509 Hs.283077 cenlrosomal P4.1-associaled protein; uncharacterized bo 

139 



wo 02/102235 



PCT/US02/19297 



450867 


AA011454 


Hs.246122 


ESTs 


3.5 


431339 


AA506294 


Hs.267266 


ESTs 


3.5 


441969 


AI733386 


Hs.129194 


ESTs,WeaWysimilartoALU1 HUMAN ALU SUBFA 


3.5 




AW970603 


Hs.300941 


Homo sapiens cONA FU1 1661 fis, clone HEMBA100 


3.5 


434317 


A1674095 


Hs.1 16323 


ESTs 


3.5 


414741 


RS1321 


Hs.25780 


Homo sapiens cDNA IU122S2fis. clone MAMMA10 


3.5 


439707 


AW297702 


Hs.102915 


ESTs 


3.5 


443178 


AI631241 


Hs.47312 


ESTs 


3.5 


400397 


AJ270770 


Hs.154485 


transcription factor 7-lilie 2 (T-cell specific HMG-I»x) 


3.5 


455887 


BE154173 




gb:PM1-HT034O-20129MO4.f12 HT0340 Homo sapie 


3.5 


434362 


W27081 


Hs.295446 


ESTs 


3.5 


409211 


AA078835 




gb:zm94li04.s1 Stratagene colon HT29 (937221) Homo 


3.5 


414390 


BE281040 




gb:601156234F1 NIH_MGC_21 Homo sapiens cDNA 


3.5 


457142 


AI924353 


Hs.290969 


EST 


3.5 


423006 


U29700 


Hs.1 23014 


anii-Mullenan tioimone receptor, type it 


3.5 


453363 


AI989776 


Hs.232623 


ESTs 


3.5 


418913 


BE046745 




gb:hn39b06,x1 NCI_CGAP_RDF2 Homo sapiens cDN 


3.4 


440016 


AW118114 


Hs.1 37057 


ESTs 




405096 










435072 


AW592176 


Hs.1 16932 


ESTs 


3.4 


438535 


L09078 




gbiHomo sapiens mRMA fragment 


3.4 


424001 


W67883 


Hs.137476 






428361 


NM 015905 


Hs.183858 


Iransoriplional intermediary factor 1 


3.4 


410587 


AA370706 


Hs.11252 


ESTs, Weakly similar to Weak simiarity witti llie Ysy6 


3.4 


454543 


AW806895 




gb:QV4-ST0S23-1 6040O-172-CO6 ST0023 Homo sapien 


3.4 


419515 


881944 


Hs.90791 


gamma-aminobutyric acid (GABA) A receptor, alpha 6 


3.4 


410280 


AA083558 


Hs.261286 


ESTs 


3.4 


425714 


AW963278 




gb;EST375351 MAGE resequences, MAGH Homo sap 




416895 


AW961600 




gb:EST373672 MAGE resequences, MAGG Homo sap 




427935 


AW503687 


Hs,1 19424 


ESTs, Weakly similar to unnamed protein product (H.sa 




411673 


BE064863 




gb;RC1-BT0313-110300^)15-f06BT0313 Homo sapien 


3.4 


453339 


AW992599 


Hs.252797 


ESTs 


3.4 


424696 


BE43g547 


Hs,15ig03 


Homo sapiens clone 24706 mRIM sequence 


3,4 


436242 


AK002187 




gbiHomo sapiens cDNA FU1 1325 fs, clone PLACE10 


a4 


442837 


AI022082 


Hs.50492 


ESTs 




452807 


AA028g33 


Hs.162434 


ESTs 


3,4 


418110 


R43523 


Hs .217754 


Homo sapiens cDNA: FLJ22202 lis, done HRC01333 


3.4 


433936 


AI208072 


Hs.123459 


ESTs 


3.4 


458177 


AI744995 


Hs.267072 


ESTs, Moderately simflar to ALU4_HUMAN ALU SU 


3,4 


401896 










406237 






predicted exon 


3,4 


457688 


AL110157 


Hs,3843 


Homo sapiens mRNA; cDNA DKFZp586F2224 (from 


3,4 




AW363582 


Hs.75323 


prohibitln 


3,4 


421916 


R34441 


Hs.101007 


Homo sapiens cDNA: FU23546 fis, clone LNG08361 


14 


419321 


N48146 


Hs.269069 


ESTs ^ 


3,4 




AV654978 








406197 






predicted exon 


3]4 


443005 


A1027184 


Hs.200918 


ESTs 


3.4 


450078 


AI681743 




gb;U33g10j(1 NCI.CGAP_Ui24 Homo sapiens cDNA 




431301 


AA502384 


Hs.151529 


ESTs 


3.4 


430202 


T85775 




gb:yd60g02r1 Scares fetal liver spleen 1NFLS Homo s 




423559 


H24338 




ESTs 




455731 


BE072188 




gb:QV4-BT0536^112994S5-b09 BT0536 Homo sapie 


a4 


420735 


AW297440 


HS.B3653 


ESTs 


3.4 


405836 


NM.000B09 


Hs.248112 


gamma-aminobutyitc acid (GABA) A receptor, atplia 4 


3.3 
3.3 


449178 


AI633748 


Hs.1 97597 


predicted exon 
ESTs 


3.3 


453265 


U61232 


Hs.32675 


tubulin-spedfic cliapeione e 


3.3 


430700 


AA768902 


Hs.247812 


H2A histone family, member K pseudogene 


a3 


424496 


AI733451 


Hs.129212 


ESTs 


3.3 


446963 


AI862668 


Hs.176333 


ESTs 


13 


422879 


AI241409 


Hs.1 88092 


ESTs 


3.3 




AW448930 


Hs.5415 


ESTs 


3.3 


449570 
406255 


AA001793 




gb.'zli86c06.r1 SoatBS_letaUiver_spleen.1NFLS_S1 H 


3.3 


41 23.1 


AW936903 




predicted exon 

gb:RC1-DT0029^020t012-d02 DT0029 Homo sapie 


3.3 


401350 






predicted exon 


3.3 


439093 


AF085955 




gb:Homo sapiens fiill lengtli insert cDNA done YR86G 


3.3 


450539 


AI701505 


Hs.202526 


ESTs 


3.3 


430749 


AJ242956 


Hs.25960 


v-myo avian myeiocytomatosis viral related oncogene, n 


3.3 


430539 


AI695595 


Hs.293219 


ESTs 


3.3 


454753 


AW819212 




gb:CM1-ST0283J71299O61-c07 ST0283 Homo sapie 


3.3 




AA1 94980 


HS.30B13 


Homo sapiens cDNA FU13681 fis, done PLACE20000 


3.3 


413516 


BE145907 




gb:MRO-HT0208-221299-204^12 HT0208 Homo sapie 


3.3 












457107 


AA413246 


HS.1B5796 


EsfrWelwy sWIa'to b34iaTS 


13 


421480 


NM_016158 


Hs.104671 


erythrocyte transmembrane protein 


3.3 


444289 


BE267060 


Hs.76391 


myxovims (intuenza) rasistance 1, liomolog of murine 


3.3 


417725 


R25257 


Hs.21503 


ESTs 


3.3 


453631 


AL046418 




gb:DKFZp434l4247_n 434 (synonym: Iites3) Homo sa 


3.3 


450692 


H50603 


Hs.94037 


tiypothetical protein FLJ230S3 


3.3 


413357 


W47611 




gb2c35e06.r1 Soares_5enescentJbroblaste_NbHSFH 


3.3 


415327 


H22769 


Hs.1861 


membrane protein, palmttoylated 1 (55kDj 


3.3 


457569 


AW970021 


Hs.291120 


. ESTs, Weakly similar to ALU8_HUMAN ALU SUBFA 


. 3.3 



wo 02/102235 



PCT/US02/19297 



Homo sapiens cONA: FU21685 lis, clone 00109372 
Homo sapiens mRNA: cONA DKFZp43401672 (from 
cold shock domain protein A 



Homo sapiens TNF receptor homolog mRfW, partial cd 
ESTs 

gb:EST39621 Epididymus Homo sapiens cDNA 5 end, 




. gb:ym26c07j1 Soaies infant brain 1NIB Homo sapiens 

141 



wo 02/102235 



PCT/US02/19297 



il04134 








3.1 






Hs.126895 




3.1 










3.1 


411876 


AW961336 


Hs.69705 


ESTs, Weakly similar to K1AA0443 [Rsapiens] 




451048 


AA013349 


H3.60602 


ESTg 


11 




AI356564 




ESTs 




404083 






predicted exon 


3.0 


415833 


H05175 


Hs.107510 


ESTs 


3.0 


402142 






predicted exon 


3.0 


415820 


R53720 

AW016534 

AA001278 


Hs.1 89745 
Hs.226g94 
Hs.59905 


ESTs 
ESTs 
ESTs 


3.0 

3^0 


457593 


AI738815 


Hs.1 17323 


ESTs 


3.0 




AW850767 




gb:lL3-CT0220-031 199^)25^05 CT0220 Homo sapien 
predicted exon 


3.0 


449561 


AI022240 


Hs.17924 


ESTs 












3.0 


420306 


AA258318 


Hs.219226 


ESTs 




BE068130 




gb:CM2-BT0368-171299-056-a01 BT0368 Homo sapie 




448221 
415920 


BE622615 




gb:601440775T1 NIH MGC 72 Homo sapiens cONA 
gb:HSCZRD121 normalized infant brain ctm Homo 


3^0 
3.0 


425357 


AI902802 
AA3SS842 




gb:RC-BT015-311298-026 BT015 Homo sapiens cDNA 
gb:EST64303 JurkalT^»lls VI Homo sapiens cDNA S 




454724 


AA091228 




gb:cchn21525eq.F Human fetal heart Lambda ZAP Ex 


3.0 


429395 


AK002071 


Hs.201624 


liypotheticaljiratelnFU11209 


3.0 


427607 


AA4C6119 


Hs.270479 


ESTs 


3.0 


443598 


AW499970 


Hs.14822 


ESTs 


3.0 


437948 


AA772920 




gb'.ae73c09.e1 StraUgene schizo brain 811 Homo sapien 


3.0 


418105 


AW937488 


Hs.178000 


ESTs 


ao 


426763 


AL042262 


Hs.172101 


Human DNA sequence iom clone RP1-202I21 on chni 


3.0 


403473 






predicted exon 


3.0 


427501 


AI369280 


Hs.131743 


ESTs 


3.0 




NM_000933 


Hs.32539 


KIAA1264 protein 


3.0 








trorneo box [H6 family) 1 


3.0 


433964 


AW241987 


Hs.197025 


ESTs 


3.0 




AL03792S 




gb:DKFZp564M037j1 564 (synonym: hlbr2) Homo sa 


3.0 


433183 
435399 


AF231338 
W89093 


Hs.222024 


transcription factor BMAL2 
ESTs 


3.0 
3.0 


425626 
428931 
426693 


AI537536 
AA994979 
AW968560 


Hs.'l 73519 
Hs.98967 


ESTs 

ATPase, HW-transporting, lysosomal, noncatalytic acc 
gb:EST370630 MAGE resequences, MAGE Homo sapl 


3.0 

3.0 


431899 


AA521381 


Hs.187726 


ESTs 


3.0 


422406 


AF025441 




Opa-interacting protein 5 


3.0 


448178 


AI479482 


Hs.170789 


ESTs 


3.0 


404227 






predicted exon 


3.0 


440575 


AA889870 


Hs.126006 


ESTs 


3.0 


431193 


AL047634 


Hs.231913 


ESTs 


3.0 


434221 
459459 


AF119885 


Hs.283040 


hypothetical protein PR02543 
gb:zx66h11.r1Soares_lotaLfetus_Nb2HFtL9wHomo 


3.0 
3.0 



TABLE 5B: 

Pkey: Unique Eos probesetidenlHernui 
CAT number. Gene cluster number 




R86913 R86901 H25352 R01370 H43764 AW044451 W2129B 

AW850158 AW862385 AW8601S9 AW8623B6 AWB62341 AW821869 AWB21B93 AW062660AW062656 

AA046914AA057231 H38371 
AW179148AW179150 

AA078835 AA07931 9 AA078816 AA079026 AA122167 AA1 11933 AA0e89B9 AA0B4691 AA068999 AA06903B AA069225 AA6S0522 
BE154650 BE154785 AW468343 BE154816 BE1S4667 

AW501748 AW502972 AW502513 
AW501833 AW502145 AW502581 
AW955113AW503580 
AW812744 AW581974 AW8t2725 
AW749041 BE066026 H35202 
N95428 W24040 AW751366 H81987 
AA374186 AW963684 AA086107 AI491986 
AW936099 AW936243 AW936097 BE162104 BE162109 AW794263 
BE535988AW801777 

AW8O9575BE090626BE090617AW936SS1Am36SS2AWg36S30AW936SS0AW936481 
AW812929 AW812779 AW813088 

AW834039 AW834040 AWB34047 AW845410 BE003128 AW852479 



NS0617 N47321 R5415g AW860545AW835317 
AWB37349AW8373S5AW882717 

AW850767AW851180AWBS13S9AW8S1223AW8S1360AW851222AVV8S1108 
AW851248AW8S1425AW850305AW851021AW85090S 

BE064863 BE153698 AW856751 BE153820 BE064737 BE153674 BE064730 BE06S062 BE1S3536 AW856622 BE1SS079 BE064661 
BE153665BE0646S0BE064e91 

AA370573 BE160501 BE160600 BE160498 BE160S02 BE1604g7 N72424 AA096462 
AW876260AWe76269AW876340AW876146A™76323AW876320AW876171AWB76421AW876227AW876243 
AW936903 AW936907 AW936908 AW936914 

142 



wo 02/102235 



PCT/US02/19297 



412732 
413137 
413161 
413247 



414193 
414337 
414372 
414390 
415296 
415618 
415920 



417178 
417762 
418913 
419401 
419896 
420352 
421418 
422046 
422128 
423403 
424093 
424719 



432075 
432474 
433371 
433403 
433767 
434738 
43619G 
436242 
436812 



1350383.1 

1351262_1 

135544_1 

1364165l1 

1374595.1 

1376621J 

1424706 -2 

1436706.1 

143909_1 

1441570.-1 

1533628.1 

1540651.1 

1551733_1 

1578993.1 

1580451.1 



180520.1 
184454.1 
1888552.1 
192979.1 
202288.1 
210744.1 
211994.1 
227942.1 

242B89J 
247B25_1 
250578.-1 
2S2945L.1 
255333.1 
263994.1 
268722.1 
269748.1 
314322J 
319643.1 



338201.1 
341066.1 
348197.1 
364430.1 
36534.-1 
374014.1 
392S62 1 
41562J 
41641.1 
427323.1 
43772.1 
445966.1 



BE142364BE142341 Ml 12025 
AW993300 N23107 R22345 
BE06691SBEC66942 

BE068130 BE06B135 BE06B134 BE068183 BE068184 BE068094 
AW963969 AWg63971 M127651 AA376726 
W47611BE087851 

BE145907 BE145796 BE145803 BE145851 BE145923 BE145812 BE145B09 BE145852 BE145856 

BE148411 BE14a415H59098 

BE260069 



AA143654 AW753140 AA213770 AW970865 AA569C7S AA492132 
BE281040 

F050B6R)5091R1715B 

F12954 H10624 R11948 RS6523T75190 

Z45684HC9361 R53285 
R14625R17962 H29120R14650 
W9044a H30749 
H70565N77403 H67949 
H69044T47567 H75691 T50292 
T83470T84283 H74054 
AW961600 AA190217 AA321260 
N51636T6ie74T51829 
AA205976AA205930 

BE046745 AI074878 AIB1 7476 AW572613 AA447586 H2B330 AA232486 AA365704 BE271 167 

AW804663 AW805C17 AA236969 
Z99362Z99353 

BE258835 AW963316 AA258918 AW843305 R14744 A1680388 BE071923 R36280 

AA806639 AA2910C8 AA836274 AW978806 

AI638562 T16929 H13401 F07773 R55836 

AW881145 AA490718 M85637 AA304575 T06067 AA331991 

AA325483 AW962159 AW962660 

AA335025 AA335496 AW966145 

H90452 AA345767 AW964302 H90399 

H16622R17322AA351959 

AA3S6842 

AA3S9119AW963014 079684 

AW963278 AA362266 AA362267 

AA374185AW9S6130H38344 

AA380828 AW963760 AA380805 AA380B30 

AW958560 AA382199 AW444933 

T85775AW963345AA468998 

AW968485 AW968670 AA480922 BE350425 

BE267477 AA491488 AW836723 

AW612343 AA922553 AA505925 AA92703B AW972537 A1693S64 

AA572994 AA516249 AA702595 

AW972934 AA526260 AA525266 AAB35021 BE000149 BE000148 

AA584C42AW973273AA548798 

T25451AA585296AA585305 

AF040247 

AA609245 AA724581 AW241989 A1377274T4730C 

AA836266 AA648266 AW974440 
AK001084AA078092AA829049 
AK002187 R56351 

AW298067 AA731645 AA810101 AW194180 A1690673AW978773 



AA772920 D59870 D61151 AI591331 
AA933999AA781181 

L09078 L03145 109094 L09098 U)316S L09102 

AF085g55H69158He9081 

AF085984 H95905 H9S9C6 



445832 
448221 
448527 



45353C 
453631 
453725 
453752 
453823 
454102 
454487 



651926.1 
75534.-1 
766707.1 
77773.-1 
804513 1 
81018.1 
823882.1 
91233.1 
918300.1 
968371.1 
97021.1 
975024.1 
978760 1 
979899.1 

1011603.1 
1216101.1 
1223775.1 
1224407.1 
1225624.1 



AI657014AW594035AI657036AI63B3gO 
AA001793AA001871 

AI681743 AW897287 AW897205 AW897284 
AA025647 R45716AW753786 
AI902519A1902518AI902516 
AL037926 AL037931 AL037957 
AW021633 AA036730 AI866854 
AL046418 N52738 R33840 
W2B543AL1 19631 



AL137967BE064150BE064186 
AW752363BE14712C N22640 
AW796342AW796356BE161430 



AWB07573 AWB07566 AWBa7572 

BE14B471 AW833743 AW833609 AW821469 AW821488 AW821541 AW821S31 AW821513 AW821549 AW821384 AW821625 AW821S77 
AWB21547AWB34577 



wo 02/102235 



PCT/US02/19297 




AWB12088AWB12105AW812082 
AWB13110AW813113 

AA091228 H71860 H71073 -- ■ 

AW819212A™19170BE158474AW819172ATO19213AVVB19200AW819256AW8192S4AW819178AW8192UAW819215AVV819233 

AW819171 

AW836922 AWB76719 AW876688 AW836919 AW836897 AW836908 AWB36912 AW836993 

AW846202 AW8461 74 AW846532 AW846181 AW846458 AW846206 AW846432 AWB46553 AW846533 AW846197 AW846198 AW846189 
AW846459 AW846530 AWB45560 AW846536 AWB46472 AW846470 AW846466 AW846192 AW846479 AW846260 AW846204 AW846139 
AW846187AW846353AWg46462AWB46151AW846549AW846538AW846527AW845567AW846531 
AW850175 AW850513 AW850412 AW850451 

AVV853057AVV853039AW853042AW853050AW853114AW853105AW8S3102AWB53111AW853121AW853109AW853126 

AW854538 AW854418 AW854412 
AW856866 AW856858 AW8568S6 
BE069037 AW936025 BE069178 AW936034 

AW936234AW936074AW936181AW936179AW936217AW936077AV«936227AW936191 

AW991925AW991919 

BE000637 BE180584 BE180M0 BE180542 BE180545 



BE090690 BE09C688 BE090681 BE090693 BE090675 
BE154173BE154Q98BE154096 

BE154837 BE154879 BE154850 BE154877 BE154835 BE154849 BE154902 BE1S4905 BE154867 BE154901 BE1S4904 BE154899 
T12198 T19584T11683 R16526 R15585 R4S876 R15562 
AI148037 AA287178 W\236756 



refers to the publicalion entitled Hie DMA sequence of 



ngtoanEosprolwset 

e. The 7 digit numliers in this column are Genbank Identifier (Gl) numbers. "I>inham I. et al." 
)some22- Dunham, etal. (1999) Mj£g 402:489495 
Strand: Indicates DMA strand from which exons were predicted 
Ntjiosilion: Indicates nucleotide positions of predicted exons 



401004 
401237 
401308 
401350 
401740 
401871 
401896 
401963 
402105 
402109 
402142 
402451 



7656749 
7229982 
9801612 
9212516 
9931226 



8569194 
3126783 
8131588 
8131678 
7704985 



NLposition 

91593-91757,92720-92843,93962-94079,94824^4997 

62580-62772 
42287-42431 



171722-171859,173197-173303 
29932-30698 
48137-48343 



9255944 Minus 

9945095 Minus 

6067111 Plus 

8076842 Minus 

6862664 Minus 



404044 
404083 
404108 
404134 
404227 
404418 
404451 
404582 
404708 



405717 
405752 
405836 



8655956 Minus 
8886967 Minus 



4512287 Plus 



54241-54437 

7221-7441 

76903-76134 

157028-157145,161725-161900 
145556-145873 
136463-136646 
174449-174663 

1334-1503,2483-2585,52305337,19656-19804 

225757-225939 
16650-17082 
63603-64942 
40633-40911 
93110-93259 
153339-153481,15 
105191-105622 



77522-77658 
3995440430 

140844-140897,148510-148581 

79582-79765 

977-1116 

17802-17966,18573-18697 

10878-11048 

56211-56353 



wo 02/102235 



PCT/US02/19297 



406122 9144067 Minus 

406197 7289992 Minus 

406237 7417725 Plus 

405241 7417725 Minus 

405255 7417729 Plus 

405354 9256114 Minus 

405454 9588360 Minus 

405481 9864741 Minus 

405552 7711584 Plus 



50715-50833 
91746-91958 
91439-91579 



23275-23493,23723-23903 



TABLE 5A lists alwul 68 genes hie 



ated in ovarian cancer compaied to nwmal ovaries. These were seleded as for Table SA, except the 'average' ovarii 
to the maximum value amongst varbus ovarian cancers and the "average" normal ovaiy level was set to the minimum value from various non-malignant ovary 
specimens, and the ratio was greater than or equal to 2.5 (i.e. 2,S-W down-regulated in the highest taitinrvs. the lowest normal ovary). This has the overall effect or increasing 
stringency, and reducing the number of false-positives. 

TABLE 6A: ABOUT 68 HIGHLY DOWN-REGULATED GENES, OVARIAN CANCER VERSUS NORMAL OVARY 

Pkey: Primekey 

Ex.Accn: Exemplar Accession 



ratio: ration of normal ovary vs. tumor 



443131 
431075 
412637 
408141 
420122 



Ex.Accn 
AA678441 
AA804362 
AA642418 
AF220264 
AI908263 
AK000043 
AA911705 
AI033833 



U69205 

AA255714 

AW902062 



UGID 
Hs.1ig059 
HS.18C544 
Hs.17381 



Hs.291625 ESTs 



gb:Homo sapiens MOST-1 mRNA, complete cds. 



hypothetical protein FU20036 



Hs.261313 
Hs.45152 
Hs.284163 
Hs.30280 



gb:601 189542F2 NIH_MGC_7 Homo sapiens cDNA do 



421418 
450061 
409725 
434738 
431644 



419401 
402105 
444997 



AI797034 

T40760 

AA836265 

AW972822 

AW753734 

AW206285 

AI131S02 

R86913 

AA167642 



Hs.277215 
Hs.253548 
Hs.143135 

Hs.14632 
Hs.105754 
Hs.160412 



gb:QV4-UM0094-160300-13S^06 UM0094 Homo sa| 



Hs.146196 ESTs 



gb:QV0-DT002aO90200-106^05 □T0020 Homo sapie 



409090 
406606 
441202 
422046 
442865 
444431 
426294 
412480 



W55067 

AI532143 
AI638562 
N57669 



Hs.103105 
Hs.135853 
Hs.1 14541 



gb:Homo sapiens fiill length insert cDNA clone YT99F0 



459054 
421379 
454338 



AA278807 
AW798466 
Y15221 



AI672277 
AV6S9942 
AB014594 



Hs.103982 

Hs.1050 

HS.27Q425 

Hs.270720 

Hs.44024 

Hs.ig9475 

Hs.258132 

Hs.137579. 



2',5'-oligoadenytate synthetase 1 
smU indxible cytokine subfamily B (Cys-X-Cys), mem 
gy, Sec7 and coJed/coil domdns 1(cyt 



wo 02/102235 



PCT/US02/19297 



464102 AW752363 



ESTs 



301 1-f03CT0201 Homo sapien 



DNAgly 



gb:zbB0d09s1 SoaresjenescenLfibroblasls_NbHSFHo 
gb:RC1-DT0029-03020CH)12-(102 DT0029 Ho 
gb:PM3 HT0344-071299^J<!08 HT0344Ho 
gb:qz30a07.x1 NCI_CGAPJ<ld11 Homosapl 
discs, large (DK}sophila) homobg 2 (chapsyn-IIO) 



TABLE 6B: 
Pkey: Unique Eo 
CAT number: Gene cluster numbei 
Accession: Genbankao 



R86913 R86901 H25352 R01370 H43764 AW044451 W21298 
BE154650 BE15478S AW463343 BE154816 BE154667 
N95428 W24040 AW751366 H81987 



BE142364BE142341AA1 12025 
AW804663 AW805017 AA23696g 
AAB06639 AA29100e AAB36274 AW978806 

A1638S62T16929H13401 FO 

AA374185AW956180 H38344 



412480 
419401 
421418 



AF035984HgS905 H9S906 
A1261545 N59134 AW875371 AW87S247 
AI681743AW897287AW897205AW897284 
464102 1011603_1 AW762363 BE147120 N22640 

465388 1287904J AW936234AW936074AW936181 AW936179 AW936217AW936077AW936227AW936191 
TABLE 6C: 

Pkey: Unique number corresponding to an Eos piobeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identlier (Gl) numbers. Dunham I. et al." refers to the publlcdion entitled "The DNA 

sequence of human chromosome 22" Dunham, et al. (1999) Nature 402:489495 
Strand: Indicates DNA strand from which axons were predicted 
NLposltion: Indicates nucleofde positions of predicted exons 



9212516 Plus 



Table 7A lists about 770 genes up-regulaled in ovarian cancer compared to nonnal adult tissues. These were selected tmm 35403 probesets on the Afiyn»trlx/Eos-Hu01 
GeneChip array such that the ratio of 'average' ovarian cancer to 'average" normal adult tissues was greater than or equal to 2.S, The 'average' ovarian cancer level was s( 
the 2nd highest amongst various ovarian cancers. The 'average' nonnal adult tissue level was set to the 7th highest amongst various non-malignant tissues. In order to rem 
gene-specllic background levels of non-specific hybridization, the 15th percentile value amongst the non-mallgnant tissues was subtracted from both the numerator and the 



TABLE 7A: ABOUT 770 UP-REGULATED GENES, OVARIAN CANCER VERSUS NORMAL ADULTTISSUES 

Pkey: Primekey 

Ex.Accn: Exemplar Accession 

UGID: UnlGenelD 

Title: UniGene Title 

raUo: ration tumor vs. nonnal tissues 



F09255" 
W70242 



101804 
123005 
114965 



119780 
104301 
132632 



AA479726 
AA250737 
AA347193 



UGID 

Hs.4993 

Hs.68086 

Hs,50758 

Hs.1145 

Hs.3132 

Hs,2142 

Hs.53631 

Hs.30743 

Hs,1904 

Hs.256579 

Hs.9700 

Hs.169840 

Hs.105577 

Hs.72472 

Hs.62180 



chromosome-associated polypepfde C 



preferentially expressed antigen in i 



Hs.158688 KIAA0741 gene product 
Hs.98502 ESTs 
Hs.184942 
Hs.191381 
Hs.6783 
Hs.5398 



146 



wo 02/102235 



PCT/US02/19297 



105293 AA233459 



Hs.26369 
Hs.62180 
Hs.151586 



109166 
121779 
102915 
105317 
125250 



AA043562 
M448297 
AA436666 
M179845 
M422036 



Hs.237825 
Hs.59761 
Hs.73625 
Hs.98367 



signal recogniOoti particle 72kD 
ESTs 

RAB6 interacUng: klnesln-like (rabkinesin6) 



AA233926 Hs.23635 



ESTs; Weakly simnar to D20922 [Celegans] 



Hs.101515 
Hs.1 69252 
Hs.21943 



AA434329 Hs.365S3 



106474 
123308 
106124 
111345 
105200 
116416 
118846 

120472 
115291 
111185 
108778 
132939 
134620 
114724 
116296 
102136 
132725 

116401 
127663 
104252 
120438 
131S7S 
134621 
120671 
102627 
100661 
116204 
131386 
129097 
131226 



125370 
130010 
131946 
116238 
102221 



101255 
106895 
104943 
129229 
102305 
106553 
112305 
123972 
102676 



AA423987 

N89820 

AA195399 



Hs^7B6 
Hs.104472 
Hs.122579 
Hs.12844 
Hs.90847 
Hs.61152 
Hs.257325 



U15552 
L41887 
F04600 
AAS99963 
AI367707 
AF002246 
AA243441 
□80008 
L02547 
AA280738 
U66551 
HG2874-HT3018 
N59359 Hs.48443 
AA096412 HS.17313E 



Hs.210863 
Hs.99488 
Hs.36232 
Hs.172865 
Hs.128679 



NS2966 

M87339 

AA479362 

U24575 

R00841 

U81523 

M14113 

M610086 

U26727 



AA454967 

R54822 

C14782 

U72514 

AA449741 

M025104 

AA399371 

N76398 



ESTs; WeaHy similar to Gag-Pol polypratein [ 



ESTs; Highly simlar to SPERM SURFACE PROTEIN 

Homo sapiens mRNAicDNADK 

aoidc 82 l(Da protein mRNA 



Hs.24485 
Hs.25324 
Hs.172608 
Hs.151791 
Hs.142838 
Hs,3S12C 
Hs,47144 

Hs.18925 
Hs.25195 
Hs.79345 



Hs.31511 

Hs.237146 

Hs.1 49894 

Hs.25245 

Hs.169674 

Hs.109643 

Hs.90073 

Hs.5887 

Hs.26244 

Hs.70337 

Hs.12045 

Hs.4029 

Hs.61252 



to L1CAM 

ESTs; Weakly similarto ORF YKR074w [S.cerevi 

KIAA0186 gene product 

cleavage stimulation (actor; 3' pre-RNA; subu 



dual-specllicity tyrosine-(Y]-pliosphorylation 
HKR-T1=Knippel-like zinc linger protein piuma 
chondroit'n sulfate proteoglycan 6 (bamacan) 



KIAA0092 gene product 



ESTs; WeaWy similar to cDN A EST yk339a7.5 co 
endometrial bleeding assxiated fector (left- 
c;procoagulanlconipon 



cydm-dependent kinase inhibil 



polyadenylate binding protein-'nteracling pro 
chromosome segregation 1 (yeast i-.omolog)-!il(e 
ESTs: Higtily similar to RNA binding motif pro 



ESTs; Weakly similar to zinc finger protein S 



wo 02/102235 
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20 
25 



55 

60 



131646 


M17189S 


HSJ0057 


Homo sapiens clone 24749 and 24750 mRNA seque 


5.2 


121770 
122512 


AA421714 
AA449311 


Hs.11469 
HS.9B653 


KIAA0896 protein 

budding uninliiblled by benzimidazoles 1 (yeas 


5.2 


105870 


AA399623 


Hs.23505 




&1 


100341 


D63506 


HS.BB13 


synlaxh binding protein 3 


5.1 


116848 


H65187 


HS39001 


ESTs 


5.1 


120821 


AA347419 


HS.96B70 


Homo sapiens mRNA tiill lengtli insert cDNA clo 


5.1 


130690 


AA084286 


Hs.139033 


paternally expressed gene 3 


5.1 


122661 


AA454936 


Hs^45541 


ESTs 




123169 


AA4a3892 


H3.104472 


ESTs; Wealdy similar to Gag-Pol polyprotein [ 


5.1 


108810 


AA130596 


Hs.71331 


ESTs; Weakly siraJar to POTENT HEAT-STABLE PR 


5.0 


110799 


N26101 


Hs.7838 


Human ring zi no-linger protein (ZNF127-Xp) ge 


5.0 


120619 


AA234372 


Hs.111471 


ESTs 


5.0 


122792 


AA460225 


Hs.99519 


ESTs 


5.0 


129912 


AA047344 


Hs.107213 


ESTs; Higtily similar to NY-REN-6 antigen [H.s 


5.0 


102823 


U90914 


Hs.5057 


carboxypeptidase D 


4.9 


129890 


M13699 


Hs.1 11461 


cemloplasmin (feiroxidase) 


4.9 


101084 


L05425 




Homo sapiens autoantigen mRNA; complete cds 


4.9 


134859 
115955 


D87716 


Hs.90315 
Hs.44198 


KIAA0007 protein 

Homo sapiens BAC clone RG054D04 from 7q31 


4.9 


106616 


AA257971 


Hs.21214 


ESTs 




114932 


AA242751 


Hs.16218 




4.9 


106672 


AA451300 


Hs.30643 


ESTs 




106126 


AA424006 


Hs.22972 


ESTs; MoiJerately similar to H6AR [M.musculus] 


4.8 


110695 


H93463 


Hs.124777 


ESTs 


4.8 


102025 


U03911 


Hs.78934 


mulS (E. coll} liomolog 2 (colon cancel; nonpo 


4.8 


133282 


U52960 


Hs^OBSS 


SRB7 (suppressor of RNA polymerase B; yeast) 


4.B 


119708 


W67810 


Hs.57904 


mago-nashi (I>osaphila) homolog; proTiferalio 


4.7 


120695 


AA291468 




ESTs 


4.7 


128651 


AA446990 


Hs.103135 


ESTs 


4.7 


103152 


X66533 


Hs.77890 


guanylale cyclase 1; soluble; beta 3 


4.7 


108699 


AA121514 


Hs.70832 


ESTs 


4.7 


115094 


AA255921 


HS.S809S 


ESTs 


4.7 


121429 


AA406293 


Hs.193498 


ESTs 


4.7 


123203 


AA4a9S71 




glutamate-cystelne ligase (gamma-glutamylcyst 


4.7 


126802 


AA947601 


Hs.97056 


ESTs 


4.7 


130527 


C17334 


Hs.184227 


F-box protein 21 


4.7 


134470 


X54942 


Hs.83758 


CDC23 protein kinase 2 


4.7 


100449 


D87470 


Hs.75400 


K1AA0280 protein 




110970 


N51374 


Hs.96870 


Homo sapiens mRNA full length Insert eONA do 


4.7 


115901 


AA436403 


Hs.86909 


ESTs; Moderately simiar to Frizzled-6 [H.sap 


4.7 


109799 


F10770 


Hs.180378 


Homo sapiens clone 669 unknown mRNA; ccraplete 


4.6 


116195 


AA465148 


Hs.72402 


ESTs 


4.6 


132122 


U65092 


Hs.40403 


Cbp/p300-interacting transaolivaton with Glu 




108990 


AA152296 


Hs.72045 


ESTs 




109055 


AA160529 


Hs.48524 


ESTs 


4.6 


115937 


AA443269 


Hs.30991 


KIAA0957prolein 


46 


133520 


X74331 


Hs.r4519 


priroase; polypeptide 2A (58kD) 


4.6 


131200 


AA609427 


Hs^10706 


ESTs; Moderately similar to llll ALU SUBFAMIL 


4.6 


121369 


AA405657 


Hs.1 28791 


Human DNA sequence from ckxie g67N21 on chrom 


4.5 


132880 


AA444369 


Hs.177537 


ESTs 


4,5 


127386 


AI457411 


Hs.106728 


ESTs 


4.6 


120067 


W93592 


Hs.47343 


ESTs 


45 


122986 


AA479063 


Hs.102947 


ESTs 


4,5 


135286 


AA401269 


Hs.97849 


ESTs 


4.6 


130155 


L33404 


Hs.151254 


kallikrein 7 (chymotrypGc; stratum comeum) 


45 


106103 


AA421104 


Hs,12094 


ESTs 


4.5 


102654 


U68494 


Hs,24385 


Human hbc647 mRNA sequence 




107876 


AA025315 


Hs,61184 


Novel hunan gene mapping lo chomosome X 


4.4 


109464 


M232265 




ESTs 


44 


125960 


D63307 


Hs!l45968 


ESTs 




126892 


AI160190 


Hs.76127 


tied (homologous to the E6-AP (UBE3A) carboxy 




100269 


D38560 


H3.1189 


E2Flransciipllon factors 




134161 




Hs,79440 


IGF-llmRNA-blnding protein 3 




100502 


HG1496-HT1496 


Adrenal-Specific Protein Pg2 




105542 


AA261858 


Hs.8241 


ESTs; Weakly similar to heat shock protein hs 




109787 


F10610 


Hs.34853 


inhibitor of DNA binding 4; dominant negative 




110759 


N21671 




ESTs 




129970 


M478975 


Hs!200434 


ESTs 


4.3 


134666 
117693 


AA482319 
N40939 


Hs,8r52 
Hs.44162 


putalive type II membrane protein 

ESTs; Weakly similar to cDNA ESryk342h12£ c 


4.3 


111008 


N55388 


Hs.7222 


ESTs 






AA398155 


Hs.97600 


ESTs 


42 


105803 


AA393808 


Hs.21490 


KiAA0438 gene product 


4.2 


121331 


AA405747 


Hs.97865 


ESTs; Weakly similar to i/VASP-family protein [ 




100893 


HG4557-HT4962 




i.2 


107175 


AA621762 


Hs.7576 


ESTs 


42 


118976 


N93629 


Hs.93391 


ESTs 


4.2 


130703 


N6329S 


Hs.13103 


ESTs 


4.2 


106S40 


AA4S4607 


Hs.38114 


ESTs; Weakly similar to coded for by C elega 


4.2 


119367 


T78324 


Hs.90905 




4.2 


133633 


D21262 


Hs.75337 


nucleolar phosphoproteinp130 


4.2 


105520 


AA258068 


Hs.33085 


WD repeat domain 3 


4.2 


114264 


Z40074 


HSJ7595 


. ESTs 
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X02530 

AA210695 

X63187 

R37700 

H5%17 

NS4915 

F03517 

R59312 

AA291716 

AA432074 

R43300 



116134 
107638 
131941 
106154 



Hs.17212 
Hs.2719 
Hs.208261 
Hs.5199 
Hs.82719 
Hs.90787 
Hs.197642 
"Hs.97258 
Hs.32538 
Hs.22929 
Hs.30696 
Hs.50441 
Hs.42743 
Hs.35086 



IP10; 'smaB inducible cytokine subfamily B ( 
ESTs 

epididymis-specilic; whey-addic prolein type 



ESTs; Weakly slirilar to DNA-DIRECTED RNA POLY 



ESTs; Highly similar to CGl-04 proteiii [H-sap 
ESTs; Weakly similar to predicted using Genef 
ubiquitin-specific protease 1 



AA424ai4 H3.187509 



ESTs; Weakly similar to ORF YGLOSOw (S.cerevi 



25 
30 



AA436102 

J05032 

U91327 

U14518 

U90651 

W86748 

AA621108 

Z39338 

AA458934 

AA490862 

AA397822 

R63542 

AA172215 

U42360 

N34096 

D83004 

AA135468 

AA401475 

S82597 

U71207 

AA292677 

U40434 

AA461063 

AA463463 

M430124 

AA621340 

M452141 

J04177 

AA431448 

H02554 

N91948 

L0751S 

T97307 

N91246 

N73808 

Ml 67268 

W70205 

AA1 16036 



114253 Z39909 
103169 X68560 
111269 N70711 
T03488 



102123 
102813 
113970 
107145 
114212 
106614 
132742 
120948 

103835 
133330 
133928 
133640 



116157 
106691 
115844 
107159 
106498 
134405 
106260 
109864 
124648 



Hs.21015 

Hs.256559 

Hs.80758 

H3.6466 

Hs.1594 

H5.151461 

Hs.8109 

Hs.173001 

Hs.21201 

Hs.179912 

Hs .55901 

Hs.104650 

Hs.1 10488 

Hs.93748 



H3.7111I 
Hs.7766 
Hs.75356 
Hs.71573 
Hs.39733 
Hs.80120 
Hs.29279 
Hs.80624 



lls.10600 

Hs.7171 

Hs.82772 

Hs.5250 

Hs.30323 

Hs.125034 

Hs.89232 

Hs.161720 

Hs.102897 

Hs.24936 

Hs.62349 

Hs.43670 



112876 
118251 
130335 



100418 
101484 
102547 



134098 
101188 
132149 



AA126474 
C20976 
M15341 
AA504631 



U57911 

Z29083 

AA478601 

AA328993 

AA412720 



aspartyl-tRNA synthetase 
chaperonin containing TCP1; subunit 2 (beta 
centronaere protein A {17kD) 
embryonic ectodenn development protein 



DKFZP566B0846 prote'n 
ESTs 

ESTs; Weakly similar to C43H8.1 [Celegans] 

ESTs; Highly similar lo similar lo mago nashi 
KIAA0990 prolein 

ESTs; Moderately similar lo TRANSCRIPTION FAC 
PutaCvc prostate cancer lumor suppressor 
ubiquilin-conjugaling enzyme E2E 1 (liomologou 
ubiquilin-conjugaling enzyme E2N (homologous 



prolein CRIPT 
UDP-N-3cetyl-alpha-[3^alaclosamine:polypeplid 
eyes absent (Drosophila) tiomolog 2 
ESTs 

mesothelin 

ESTs; Highly similarlo HSPC011 (H .sapiens] 
ESTs; Weakly similar to ACTIN; CYTOPLASMIC 2 
ESTs 

ESTs; Weakly similar to ORF YKRO8I0 |S.cerevi 



ESTs 

Human ras inhibitor mRNA; 3' end 
kinesin family member 3A 
chromosome 20 open reading frame 1 
kallikrein 8 (neuropstntovasin) 
replicaBon prolein A3 (14kD) 



Hs.82128 
Hsi5B737 
HS.104S58 



ESTs; Highly similar to ribosomal protein 1^ 
small nuclear ribonucleoprotein polypepJde B 
ESTs; Weakly similar to hypolhelical 43.2 kDa 
Guanosine ^-Monophosphate Synthase 
KIAA0225 protein 
interferan-induced protein 56 
chromosome 1 1 open reading frame 8 
5T4 oncolistal trophoblast glycoprotein 



ESTs; Highly similar to CGI-1 18 pn 
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AA465358 Hs. 11 8793 ESTs; Highly sin 



109139 
111870 
113848 
127947 
123056 
129914 
132148 
134644 
115047 
102398 
127479 
105545 
101433 



AA428300 
AA23436E 
LC4510 
AA121289 
AA176121 
R37778 

A1432475 
A1379480 
U22377 
AA283988 



30 

35 
40 
45 
50 
55 
60 
65 
70 



M24486 
AA291268 
AA464103 

Z47727 

M251330 

AA412488 

AA258421 

U66615 



Hs.179729 

Hs.5152 

Hs.76768 



110787 
131621 
132813 
116370 
131965 W90146 
115221 AA262942 
116093 AA456020 
123507 AA600176 
129801 F11087 
115084 AA255566 



AA598B03 
AA253217 
D13645 
AA2SS030 
AA281950 
D83767 
HG4053-HT4 
U31814 
U36602 
N79820 
U44754 
AA435840 
AA425354 
AA451676 
M025234 
AA056538 

W37721 

AA127459 

D87120 

N34893 

D63875 



115061 
100146 
115140 



102287 
102788 
118836 
102423 
106300 
106156 
106483 



129121 
131638 
132962 
133767 



134372 D63877 

130938 AA0132SO 

116169 AA258427 

123978 C20553 

108807 AA129953 

132581 R42255 

134654 W23625 

105730 AA292701 

111295 N73275 

102009 U02580 



AA421171 
AA463745 
S77154 



Hs.35962 
Hs.79741 
Hs.50848 
Hs.112345 
Hs.239666 
Hs.42484 
Hs.111496 
Hs.41271 
Hs.2471 



Hs.4210 

Hs.30299 

Hs.61260 

Hs.27842 

Hs,43756 

Hs.151353 

Hs.108788 

Hs^9832 

Hs.6153 

Hs.17328a 

Hsi4844 

Hs£2324 

Hs^139B 

Hs.68427 

Hs.170278 

Hs,49376 

Hs.52256 

Hs.8739 

Hs,5364 

Hs.21275 

Hs.82643 

Hs.22385 

Hs.1657 



1s,29403 
1s,82120 
ls.79404 
ls.65874 



ESTs 

survival of motor neuron protein interacting 

ADP-ntrasylation fedor domain protein 1; 64l( 

ESTs; Higtily similar to retinoic acid-induced 

zinc finger protein 281 

ESTs; Weakly similar to hypotlietical protein 

DKFZP564J0863 protein 

ESTs 



SRY (sex-determining region Y]-i)ox 5 
tiomeotraxBS 

Human N33 prolan fbnn 1 (N33) gene, exon 1 a 
collagen; type X; alpha 1 (Sclimid metaphyseal 
ESTs; Weakly sirrilar to katanin p80 subunlt [ 
procollagen-proline; 2-oxoglutarate 4-dioxyge 
DKFZP586L0724 protein 
ESTs 



ESTs 
ESTs 
hypothetical protein 

SWI/SNF related; matrix associated; aolin dep 

ESTs; Weakly similar to C44B9.1 [Celegans] 

rit)onucleaseP(30kD) 

solute carrier family 1 1 (proton-coupled diva 

ESTs; Moderately similar to NUClfAR PORE COMP 



ESTs; Wealdy similar to KIAA0862 proteh [H.s 



Homo sapiens mRI«\; cDNA DKFZp564C053 (from cl 



104903 
109704 



"-llmRNA-binding protein 2 
Ts 

Ts; Weakly similar to similar to 1-acyl-flly 



ESTs; Moderately similar to PUTATIVE GLUCOSAM 

ESTs 

ESTs 

ESTs; Weakly similar to PROTEIN PHOSPHATASE P 

ESTs; Weakly similar to lieta-TrCP protein E3R 

ESTs; Weakly similar to ORF YGR200c [Sj»revi 

DKFZP564I052 protein 

ESTs; Weakly similar to uKquitin-corvugating 

protein tyrosine kinase 9 

ESTs; Weakly simitar to KIAA0970 protein [H.S 

estrogen receptor 1 



ESTs 



AA478106 Hs.12692 

F09637 Hs.12876 

N64267 Hs.10177 

AA251906 Hs.48473 

AA450312 Hs.23748a 



iRNA; cDNA DKFZp5B6C1723 (from c 
; Weakly similar to protein phosphatase-1 



Homo sapiens mRNA; cDNA DKFZp434E1 02 (from cl 

150 
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103753 
131882 
123673 
132936 
103023 
120572 
132384 



20 

25 
30 
35 
40 
45 
50 
55 
60 



106753 
102621 
103330 



115860 
115278 
134125 
129160 
121710 
1C2242 
104956 
113047 
116017 
133780 
129463 
130353 
106036 
102234 
106133 



AA280794 

AM79933 

AA282914 

AA147719 

N71781 

R44538 

W38419 

AA041SS1 



Hs.90421 

Hs.23625 

Hs.180320 

Hs.3385 

Hs.112712 

Hs.6111 

Hs.117950 

Hs.258787 

Hs.46967 

Hs.10176 

Hs.159441 

Hs.50081 

HS.1408B9 

Hs.24935 

Hs.48644 

Hs.73826 



loBBALUSUBFAMIL 



104733 
103879 
103038 
135164 
11486C 
102437 
100352 
103631 
104238 
108613 
115916 
120640 
124068 
130375 
131632 
131523 



AA476944 

U66075 

X85373 

M481403 

L15309 

AA393755 

T10196 

AA213459 

AA426017 

M434257 

H99131 

X76040 

AA431719 

AA279757 

R3B102 

AA131252 

AA419011 

U27185 

T25867 

AA251880 

M14219 

AA421213 

X86018 

AA412505 

U26312 

AM24346 

H47357 

AA465194 

AA610108 

N90a29 

AA019493 

AA22B148 

XS4941 

AA126433 

AA235112 

U46S69 

□64159 

Z48570 

AB002364 

AA1009B7 



AA427890 
AA131584 
R49483 



131538 
105616 
101228 
100280 
132744 
103106 
106984 
105127 
102302 
117708 
111314 



X61970 
AA521201 
AA158132 



ce from clone 167A19 on chtom 



Hs.77495 
Hs.107213 
Hs.193677 
Hs.1 17211 



Hs^l 

Hs.109007 

Hs.96744 

Hs.32943 

Hs.120975 

Hs.7549 

Hs,179982 

Hs.76152 

Hs.1 11632 

H3.172210 

Hs.10653 

Hs.8123 

Hs.107573 



HS.77S50 
Hs.173242 
Hs.106227 
Hs.221986 



Hs.163933 
Hs.101619 
Hs.155172 



Homo sapiens mRNAfull length ins 



Hs.lQ 

Hs.44581 ESTs; Weakly similar to HEAT SHOCK 70 KD PROT 



ESTs 

GATA-binding protein 6 
small nuclear ribonucleoprotein polypeptide G 
ESTs; Highly similar to NY-REN-6 antgen [H.s 
zincfnger-protein 141 {clone pHZ-44) 
ESTs; Highly similar lo CGI-62 protein [H.sap 
ESTs; Weakly similar to /prediction 
transcription factor 17 

ESTs; Highly similar to DNA-REPAIR PROTEIN CO 
ESTs; Moderately similar to liii ALU SUBFAML 
ESTs 

protease; serine; 15 



ESTs; Weakly similar to hypothetical protein 



chromobox tiomolog 3 (Drosophila H 
sialyllransferasc 

ESTs; Moderately similar lo weak sin 



ESTs; Moderately similar to similar to mi 



adaptor-related protein complex 3; beta 1 sub 



Hs.71436 

Hs.22159 

Hs.170008 

Hs.28606 

Hs.24968 

Hs.82916 

Hs.155314 

Hs.55921 

Hs.76913 

Hs.7129 

Hs.11817 

Hs.69171 

Hs.46476 

Hs.33922 

Hs.168147 

HS.15S017 



ESTs; Weakly sinilar to PHOSPHOLIPID HYDROPER 
DKFZP564O0463 protein 

ESTs; Weakly simitar to ZINC RNGER PROTEIN H 



subunit6A(zela 
KIAA0095 gene product 
glularayl-prolyl-tRNA synthetase 
proteasome (prosome; macropain) subunit; alph 



H^apiens novel gene frnn PAC 1 1 7P20; chromo 
ESTs 

nuclear receptor interacting protein 1 

151 
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128420 

108746 
127236 
114208 
107071 
104967 
124073 
130869 



10 
15 



A1088155 
M126974 
AI341818 



AA074919 

H05394 

AA128100 



116545 
117873 
121463 

131185 
134380 
105740 
130919 
134423 
104896 
134407 
106373 
112283 
109018 
114239 
114959 



107243 
1344B9 
134054 
127370 
113277 
132783 
109010 
130095 



111353 
105344 
134498 
117910 
118903 
121713 



D86425 

L49054 

AA127514 

N89829 

AA261089 

M449338 

D20313 

N49967 



Hs.14146 
Hs.43388 
H5.986S8 
Hs.7859 

Hs.35198 
Hs.10026 
Hs.127376 

Hs'.242894 
Hs.85222 
Hs.247277 
Hs.101850 
Hs.82733 

Hs.61603 



AA291710 

W96151 

AAa54228 

X72964 

AA445994 

R53545 

Ml 56960 

Z39742 

M250775 

AA608752 

AA279803 

AA393827 

W44923 

D59894 

U09284 



T65797 

N74897 

AA156460 

F01831 

AM59249 



Hs.102652 

Hs.23%0 

Hs.179565 

Hs.10852 

Hs.21276 

HSJ3006 

Hs.23165 

Hs.32794 

Hs.21331 

Hs.20952 

Hs.1 14992 

Hs.222478 

Hs.87747 

Hs.71969 

Hs.32204 

Hs.20104 

Hs.4878 

Hs.34782 

Hs.112378 

Hs.78393 

Hs.70337 

Hs.11774 



Hs.8715 

Hs.181309 
Hs.6616 
Hs3645 
Hs.84131 

Hs.12940 



rs; Weakly similar to unknovm [H.sapiens] 



ESTs 
ESTs 

ESTs; Weakly similar to ORF YJL063C lS.ceievi 

KIAA0266 gene product 

uridine raonophospiiale synthetase (orotate plio 

ADP-riiwsylation faotor-like 1 

ESTs; Weakly similar to fy7090_2 IH^apiens] 

ESTs; Weakly similar to transformation-relate 

retinol-binding protein 1; cellular 

nidogen 2 

Homo sapiens t{3;5)(q25.1;p34) fusion gene NP 

ESTs 

ESTs 



mlnichiDmosome maintenance deficient (S. cere 
ESTs 

collagen: type iV; alpha 3 (Goodpasture anSg 
ESTs; HIglily similar to CGI-139 pioleh [H.sa 
ESTs 

caltractin (20kD cak'um-ljnding protein) 

ESTs 

Homo sapiens clone 24411 mRNA sequence 



no sapiens mRNA; cDNA DKFZp5B4P0823 (frame 



LIM and senescent cell antigen-like domains 1 

immunoglobulin superfamily; member 4 
protein (peptidyl-prolyl cis/trans isomerase) 
DEAD/H (Asp-GIUrAla-AspWls) box polypeptide 
dual specificity phosphatase 12 



ESTs; Weakly sir , . 

Rsapiens mRNA for Ptg-12 protein 
proteasome (prosome; macnipain) subunit; alph 



60 
65 



133415 
120649 
131267 
134480 



AA287115 

AA258042 

AA024564 

F13789 

AA126719 

AA251010 



AA521265 
AA411462 
AA232183 
AA010530 
AA1 15629 
AA4S2112 



132366 

119636 
124637 
130588 



HS.2C7776 
Hs.164g78 
Hs.73149 
Hs.99697 
Hs.24908 
Hs.839ie 
Hs.93796 
Hs.25282 
Hs,87807 
Hs,18439 



Hs.11042 

Hs.173042 

Hs.9599 

tte,1 18531 

Hs.42644 

Hs.21187 

Hs.46541 

Hs.56148 

Hs.75798 

Hs.16411 

Hs.7525 

Hs.3281 

Hs.155478 

Hs.251631 

Hs.86919 



NADH dehydrogenase (ul 
DKFZP586D2223 protein 



ESTs; Weakly similar to veli 1 (H.sapiensj 
ESTs; Weakly similar b !!!! ALU SUBFAMILY J 
Human BAC clone GS025M02 from 7q21-<)22 



ESTs 

Homo salens PACclone DJ0B94A10fio(n 7q32-q3 
ESTs; ll«)derafelysimllarto NY-REN^ antigen 
Human DNA sequence fiom cbne 1183121 on chro 
Human DNA sequence bom ckme 1 189B24 on chto 
ly siirilar to KIAA0742 protein fis 



proteasome,, , ^ 

ESTs; Weakly similar to !!ll ALU SUBFAMILY J 
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100463 D89289 
132920 106133 
113490 T88700 



128770 
129122 
104827 
111348 



106094 
126433 
107129 
104491 
105043 



120455 
130861 
131649 
128517 
100486 
116729 
101851 
115465 



30 
35 
40 



N62515 

AA035630 

N90041 

R45698 

U15932 

M236359 

AA419461 

N93125 

M520553 

N71513 

AA132239 

AA486182 

F09530 

M251083 



U90551 

AA306121 

[B1544 



129426 
103437 
129821 
130160 
104257 



107056 
108536 
116226 
120266 
128654 
116726 
132640 



105621 
129164 
133618 



125134 
129254 
102339 



AMI 2087 
X98250 
F11019 
Z39228 
AF006265 
AA465701 
AA262925 
AA207114 
AA435551 
AA459296 
AA600310 
AA084605 
AA478729 
AA173939 
H20689 
F13681 
U33821 
AA147725 
AA132983 
AA280865 
AA282183 
U7e624 
AA258785 
116429 AA609710 
110984 N52006 
100372 D79997 
W19228 
AA453624 
U37022 
AA456646 
R44122 
AA406320 
AA436568 
AA365527 
AA096178 
AA143190 
AA205649 
AA432136 
120321 



Hs.1 18722 

Hs.806 

Hs.1 73374 

Hs.73818 

Hs.143460 

Hs.108790 



Hs.21893 
Hs.2128 
Hs.39504 
Hs.1 81 27 
Hs.1 37300 
Hs.4756 
Hs.39328 
Hs.1 1810 
Hs.17975 
Hs.180591 
Hs.1 04347 



Hs.82045 
Hs.43691 
Hs.15071 
Hs.146401 
Hs.28777 
Hs.111515 
Hs.10g701 
HS.9158S 
H5.121017 
Hs.168272 
Hs.82254 



129108 
129385 
131412 



114471 
102051 
106916 



U34044 

AA393164 

AA481146 

AA488872 

AA101416 

AA482014 

M212S9 

N36881 

AA023074 

U07550 

AA490814 



ESTs; Weakly similar to CD4.2 [Celegans] 
ESTs; Weakly similar b ROOFasb [Celegans] 



ESTs; Weakly similar to p60 katanin [H.saplen 

(^-Like Protein Tc4 

ritionucleaseP:40kDsubunit 

mldkine (nejJiite growth-promollng factor 2) 

ESTs 

chaperonln containing TCP1; subunU B (tlieta) 



H2A hlsbnefantHy; member L 
ESTs; Wealdy sMar to predicted using Genef 
ESTs; liAoderately similar to wealcslnillanvt 
protein plnspliatase 1; regulatoiy (inlilbllor) 
H2A histone MIy; member A 

EST; Highly similar tor" 

M-piiase ptv 



Hs.1 80034 
Hs.27842 
Hs.30824 
Hs.105039 
Hs.1 8720 
Hs.226440 
Hs.76450 



Hs.69469 

Hs.44155 

Hs.6375 

hls.10904S 

Hs.75251 

Hs.107476 

Hs.82837 

Hs.80120 

Hs.1 84339 

Hs.100748 

Hs.1 098 

Hs.95577 



Hs.172140 
Hs.1 77861 
Hs.70337 
Hs.1 2577 
Hs.85371 
Hs.98582 
Hs.1087 

Hs!l24027 
Hs.97644 
Hs.41086 
Hs.12314 
Hs.107149 
Hsa463 
Hs.1066 
Hs.82202 



ESTs; Weakly similar to !!!! ALU SUBFAMILY J 
programmed cell deatli 8 (apoptosls-inducing f 
Homo sapiens clone 24881 mr- 



lar to inner centromere piot 



Taxi (liuman T-cell leukemia virus type I) bin 
dendrltk: cell protein 
DKFZP586G1 517 protein 

■ \; cDNA DKFZp564K0222 (fri 



DEAD/H (Asp^lu^a-Asp/His) box binding prot 
ATPsyntliase; H*transpar«ng; mitocliondrial 
Human chromosome 3p21.1 gene sequence 
UDP-N-acefyl-alpha-[><ialactosamine:polypeplid 
KIAA0175 gene product 
ESTs 

deoxynucleotidyltransterase; terminal 



ESTs: Highly similar to CGI-110 protein [Ksa 
immunoglobulin supeifamily; member 4 
ESTs: Highly similar to CGI-147 protein [Ksa 
zinc finger protein 254 
ESTs 

seiine/lhreonine kinase 2 

Homo sapiens ctone 26007 mRNA sequence 

SEl£NOPHOSPHATESYNTHET/ i 



toOXYSTEROL-BINDINGPRO 
Homo sapiens mRN^ cDNA DKFZp586C1019 (tram c 
ESTs; Weakly similar to PTBASSOOATED SPUCI 
cenMn; EF-hand protein; 3 (CDC31 yeast homo 
smdl nuclear ribonucleopnifein polypeptide E 



ESTs 



sinL17 
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116142 
109912 
103193 
102046 
104667 
112996 
118138 



30 
35 

40 
45 
50 



AA460549 

H05509 

X70476 

U07151 

R64534 

T23539 

AA485724 

H94692 

N48000 

H77546 

Y12065 

AA075427 

AA461551 

AA093287 

AA092376 



114366 
114162 
133370 
106818 
133501 
100530 
130553 



AA460917 

U21858 

Z41688 



HG1869.H1 

AA430032 

AA137078 

AA436679 

W45531 

W84865 

AA46S016 

AA04S365 

AA460264 



Hs.24639 

Hs.75724 

Hs.182215 

Hs.101469 

Hs.7165 

Hs.g3S60 

Hs.ig2119 

Hs.6906 

Hs.140945 

Hs.57898 

Hs.5092 

Hs.17296 

Hs,16251 

Hs.76728 

Hs.90606 

Hs,106778 

Hs.34892 

Hs.94896 

Hs.2780 

Hs,60679 

Hs.18663 

HSJ2157 
Hs.3542 
Hs.74284 



coatomer protein complex; subunlt beta 2 (bet 



131130 
112043 
116146 
122378 
103134 
133395 
115652 
104976 
134691 
112869 
100092 
102635 
104490 
106813 
106872 

107113 
113281 
115586 
115779 
122895 
124726 
129775 
131991 
13-2518 
134612 
130313 
131971 
133175 

125670 
121322 
106719 



105387 
103073 
116294 



131076 
114096 
120402 
102125 
134653 
101959 
116766 



AA4607C1 
AA44ei00 
X65724 
M491296 
AA405098 
M086071 
M59979 
T03313 
AF000231 
U66838 
N71503 
M479922 
M487907 
AA599041 
AA610073 
T66300 
AA399218 
AA4241B3 
M469946 
R15740 
R94659 
AA25190g 

AA451712 



Hs.94642 

Hs.40094 

Hs.69423 

HS.51BB 

Hs.155983 

Hs.6456 

Hs.23255 

Hs.26312 

Hs.193200 

Hs.103617 

Hs.2839 

Hs.72805 

Hs.38178 

Hs.60758 

Hs.88474 

Hs.4747 

Hs.75618 

Hs.79378 

Hs.43087 

Hs.181022 

Hs.18282 



M425107 

M465171 

AA236412 

H97781 

AA236951 

X59417 

AA489000 

D59269 

H95094 

U52077 

H44386 

Z38342 



ESTs 

v-ral simian leukemia viral oncogene homolog 

Homo sapiens mRNA; cOI4A DKFZp586L141 (fmn 

ESTs; Highly sMar to NY-I^N4g antigen [K 

nucleolar protein [KKE/D repeat) 

ESTs; Wealdy similar to /prediction 

ESTs; Higiily similar to 73 kDA subunlt of oie 



TATA box binding protein (TBP)-associaled fac 

ESTs 

ESTs 

DKFZP564I1922 protein 
ESTs 

ESTs; Moderately similar to Similar to S.cere 
Male Eniianced Antigen 
pituitary tumor-transfbmiing 1 
ESTs 

ESTs; Higiily similar to CGi-07 pnitein [H ^ap 
ESTs 

Human DNA sequence from clone 167A19 on chrom 
I<aiilkrein10 

ESTs; Weakly similar to 60S RIBOSOIIML PROTEIN 

KIAA0677 gene product 

chaperonin containing TCP1 ; subunit 2 (beta) 

nudeoparin 155kD 

glioma amplified onctiromosome 1 protein feu 

ESTs 

ESTs 



Hs.70945 
HS.10S32S 
Hs.104576 
Hs.12420 
Hs.36708 
Hs,5064 
Hs.171581 
Hs.154320 
R70167 Hs.3611 
AA134767 Hs.66666 



chromosome-associated polypeptide C 
prostaglandin-endopeioxiide synthase 1 (prosta 
dyskeratosis congenita 1; dyskeiln 
RAB1 1 A; member RAS oncogene family 

cydinAI 

ESTs; Weakly similar to dysfeiHn [Ksapiens] 
ESTs 

ESTs: Highly similar to mknom (rtsapiens) 
pnogrammed cell death 10 
ESTs; Weakly similar lo oligophrenin-1 like p 
Homo sapiens mRNA for lipoyltransferase; comp 



budding uninhibited by benzimidazoles 1 (yeas 
ESTs 

ESTs; HigNy similar to ubiquitin C-temiinal 



Hs.14415 
Hs.1 08636 
Hs.74077 
Hs.93743 
Hs.1 27842 
HS.751B7 
Hs.247948 
Hs.22665 
Hs.27007 
Hs.50282 
Hs.107573 
Hs.87385 
Hs.180832 
HS.95C97 



proteasome (prosome; raacropain) subunit; alph 
ESTs; Moderately similar to TRANSCRIPTION FAC 
Homo sapiens mRNA fiill length insert cDNA do 
translooase of outer mikxdiondrial membrane 2 



GTP-binding protein ragB 

sialyltransferase 

ESTs 

aiglnyl-tRNA synthetase 
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104954 
108771 
116439 
133859 
132792 



AA074514 

AA1 27924 

AA610068 

U85782 

AA401903 

AA010686 

M191353 

M401136 

X17620 



ESTs; Weakly sir 



to protein [H^apiens] 



PIBF1 gene product 

26S proteasome-associated padi homoiog 
hemoglobin; gamma G 

ESTs; Weakly similar to KIAAOegi protein [H^ 
ESTs; Weakly similar to KIAA0970 protein [H^ 



107444 W28391 

104525 R16007 

128917 AA204a76 

102299 U32907 



HS.S181 
Hs.75355 
Hs.206097 
Hs,155545 



ESTs 

ptollferaUan-associaled 2G4; 38kD 
utiiquitin-conjugating enzyme E2N (homolcgous 
oncogene TC21 

37 kDa leucine-rkih repeat (LRR) protein 
activator of S phi " 




100398 
130800 
114481 
113404 
100260 



128820 
124049 
128781 
121361 
134133 



Hs.19574 
Hs.151572 
Hs.70337 
Hs.174135 
Hs.150402 
HS.2978C 
Hs.22744 
Hs^2153 
HSJ2094 
HSJ4727 
Hs.30558 
Hs.260g0 



^^ 106309 
ii 74519 
Hs 105466 



ESTs; l^oderately simJartosimnar to NEDD-4 
ESTs; Weakly similar to partial CDS [Celegan 
KIAA0942protein 

ESTs; Weakly similar to 60S RIBOSOHML PROTEIN 

minictiromosome maintenance deficient (mis5; S 



ESTs; Weakly similar to DMR-N9 PROTEIN |H.sap 

KIAA09S7 protein 

liomeoboxD3 

ESTs 

ESTs; Weakly similar to similar to the beta t 

Friend of GATA2 

primase;polypeptkie2A(58kD) 

small nuclear n'bonucleoprolein polypeptide F 

ESTs 

dual specificity phosphatase 6 
small acidic protein 

ESTs; Weakly similar to Weak similarity to Ye 



131075 
133576 
100420 
130743 
122465 
115117 
124582 



130512 
134402 
104769 
125787 , 



Homo sapiens done 23570 mRNA sequence 
ys69e12£l Scares lellna N2b4HR Homo sapiens 
secretory granule; neuroandocdne protein 1 ( 



Hs.99153 
Hs.49007 
Hs.1 08408 



Hs.51779 
Hs.50915 
Hs.1 81271 
HS.B2712 



Homo sapiens mRNA; cDNA DKFZp564F093 (from c 
ESTs; HigMy similar to CGI-73 protein (H^ap 
poly(A) polymerase 

ESTs; Highly similar to CGI-78 protein [H.sap 
ESTs; Highly similar to CGI-53 protein [H.sap 

ESTs 

ESTs; Weakly similar to putative zinc Unger 



ESTs; Highly similar to CGI-120 protein [Ksa 



TABLE 7B: 

Pkey: Unique Eos prcbesetidentilier number 
CAT number Gene cluster number 
Accession: Genbank accession numbers 
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101335 46413.1 L49054 N87447 AA24B791 AA452193 AI015525 AI762070 AA781526 AW183498 AA625682 AI268713 AA400391 AI193725 AW590304 

W56360AA936067AI990398AA406183AA628888AA844206AA621117A1141092A1808150BE379750AI351482AA093527AA405119 
AA40()662AVV368723AA463811 AW242927 R50034 W56334F21257AA164314BE074125AA47()924A1307439W16738AA026647 
5 T35999 T19178 AA164313 AI744010 AI015466 A1014921 

100906 4312.1 AU076916 BE29B110 AW239395 AW672700 NM_003875 U10860 AW651755 BE297958 C03806 AI796876 AA644165 T36030 AW392852 

AA446421 AW881 866 AI46942B BE548103 T96204 R94457 N78225 AI564549 AW004984 AW7B0423 AW675448 AW087890 AA971454 
AA305698AA879433AA535C69AI394371AA928053A1378367 N59764AI3e4000AI431285T81090AW674657AW674987AA897396 
AW673412 BE063176 AW6744C8 AI20201 1 R00723 A1753769 A1450161 AW079585 AW275744 AI873729 D25791 BE537646 T81 139 
10 R00722 

102221 3861J NM_006769 U24576 AW161961 AW160473 AW160465 AW160472 AW161069 AI824831 AW162635 AI990356 AW162477 AW162571 

AI520836 AW152352 AW162351 AW162752 AI962216 A1537346 AA663902 H 17667 BE046345 BE669B02 BE255391 AA986217 AA236061 
AI129757 AW356451 T34489 D561C6 D56351 AI936579 AW023219 AW689335 AW889120 AWB89232 AW889175 BE093702 AW889349 
AA147546AI952998AA912579AI143366AW902211R64717AW157236AI815242 D45274AW263991 AA442920AA129965AL035713 
15 AI923255AI949082A1142826AI684160A1701987AI678964AI827349BE463635AW628092AW302281 AA493203BE348856BE536419 

AW193969 AW673561 AW592609 A1224044 H43943 AA091912 R49632 R48353 A1568409 R48256 A1198045 H27986 H43899 AI678759 
AI680310AI624220 H17052AA156410N56062A1699430AA664529T09406T10469AA627506AI379584 N83831 N88633 AW022651 
AA971281 AA248036 AI039197 AI914689 AA973825 AL047305 AA129966 A1798369 AW264348 Al 445879 AI658769 N57924 AI933507 
A1216121 AI333174 T10972 AI375028 AI186756 A1273778 AA610487 AI797946 AA853903 AA903939 AI338587 A1278494 AW627695 
20 AA904019 

101714 30725J M68874AL022147 M72393AL049797BE439441 T27650 AI766240AW150345AW778943AI627464 BE439479AA587049AI277900 

AI984983AI 530935 

116803 55078_4 H47357W33034H55976H55975 R67830AA527091 F24482AW841585 R66514 

116988 185904 1 AW953679 AW953530 AA244436 H82527 AA361046AA244483 H82526 

25 132640 179_1 AW162087AA22453SAA471218AA08a655AA375275SE440052AF090891 AA324435AF063549A1110675AA322223AW953306 

AA233590 AW949864 AW949859 AA383721 AA081878 U33821 NM_006024 AA350900 AA081588 AI148087 AF268075 AA088185 
AI142478 AA081824 AI887930 AA070570 BE135248 AI459825 BE257794 AA420459 AA420859 AA777997 AA081219 AW816721 
AW854758 AA157932 BE018208 AW378974 AL041212 A1247664 AW681897 AI002897 BE643242 A181 1690 AW86207S AW852270 
AA360969 AAC94943AA09068C AW6C1SS4AA099673AA662226AA3S6814AA330174 AA187544C02751 AA315460 BE1683S8 

30 AWC80447AI813764AI624222AW156901A1954032AW473780AI861975AA173643AWS11541AI951492BE3016B6AA669760 

BE182212 AAOBIOOg T69431 AI186207 AA604124 AA707346 AA173953 A101 6700 AI125916 AA35B9B2 AIB73719 T90593 T90497 T10776 
AWS13002AW304292AA72488SAW4747S9AI811621AW068925AA66630SAIS80161AI128023AW471151AA534849AA6663S8 
A1078833 AI139223 A1244874 AI381658 AW263441 AI432440 AW802882 N66401 AA224251 A1167469A1141060AJ099214AI537130 
AL120428 AA948655 D53110 AA076099 AA938617 AA826543 AI3S7914 AA565098 AA807994 AI288812 AA632832 AA157933 AA639302 

35 AA634268AA282337AA551431 AA557374AA256923AA872943 AA009665 H8962eAA810386 T92925T36145AA632190AA130436 

AI686635AA130437AW392gC4AW3g2839AW392848AW392836AA72g737AA070450AW392890 W04825AA77184eAA084634 
AA481985 AI263840 A1801C06 AA235380 A1954229 AI559330 AI208724 AA887638 T25894 AA041269 W44443 AI581 770 W46171 
AA378485 W46535 AA1 97336 AA894945 AA394224 AI766834 AI582590 AI033007 AA481889 AW190598 AW392855 R27279 AA398137 
AI248407AI241386 A1991753 AI826685AA865699 AI096806AI833030AA041279 AW888745A1703279 N70572Aigi2553BE649931 

40 AI240422 AW375187 AW591692 AA975905 AW614967 AA009666 W44332 AA664659 T06561 BE468150 AI650695 AA5e7920 AI473310 

AIG32991 AA255499 AW104241 BE163782 A1984973 BE163613 A1263906 AA628191 AA282072 BE163769 BE163775 AI492939 AI473315 
D56907 AA6E7930 H89480 AI362373 AA598483 DSB595 AI167590 C16223 AI935415 D62655 D62884 D63130 AI760286 AI660286 
AW173598 AI499145 All 22566 AW903408 AI810569 AA854936 BE049510 D62065 D61900 D62101 R27475 AI469835 AI669086 Ne0399 
N48922 N48746AA481381 R2285B H13912AC004549AW602500 AW768788 

45 103427 43892.1 BE514383 M071273 AW2479B7 AW573236 BE312102 AW749B24 BE071985 AW577383 BE071946 BE072005 AW577355 BE071965 

AW23 9 231 BE072000 BE071 960 AW577360 AW749830 AW373020 X97303 AW999522 BE000192 BE562219 BE266655 BE264970 
103631 15?.34 R6473a AF214731 T19173 BE25831 8 AF161446 BE542228 BE383856 BE206748 BE543260 AA640735 AA788907 BE251313 BE221852 

AW865357 AA224407 AW855346 BE1 50454 AW070651 BE326867 AW051698 AI829278 AI470927 AW855345 AI804942 AI971 004 
BE046620 AI863664 AA8a8492 AI915971 BE046949 AW59071 1 AI468066 BE409685 AA332663 BE385394 AA862623 BE255591 

50 BE254968 AA211871 BE255493 BE257727 BE255389 BE257491 BE262528 BE261296 BE313277 BE261714 BE314316 Z28434 AA316645 

BE008562BE012093BE161393T31969AA305848AW955238BE619156AI191748AA323396AW361760AA2161ieBE264939AA325954 
AW580281 AA302597 AW888908 AW888893 BE312970 AA134402 H52679 AA478191 T34090 AW961505 Z24771 AA179552 R57244 
BE315207 AW583121 A1372747 T33143 AVV377460 T33141 R14922 AW352414 H93249 AW4C5S76 T33102 R89545 N46625 H0e434 
BE165062AW367891 H93121 H47325 T30931 AW402852 H47410Z2036BT18928 T30758 H93254AW389725 R96628A1372407 R88995 

55 AI81S980 AW1S7278 AW607664 AW163288 AA133492 AA099328 AA1 57348 AI816063 AW4495S6 AA1S7252 AW608980 H66576 

AW821 127 T32030 AW856058 AA032188 Z421 20 R18582 AW402392 BE408021 AA2B0989 AA039427 AA035354 AW328008 T941 86 
R97481AA181444AA774697BE613141AW630221H13066AI124S78AW7S4481 BE262112AW839942 H60108AW364002AW363800 
BE547161 BE082634AA642471 BE619719BE082719 W28879AW794944 C01685A1291127AW166099AI936102AI478929 R70284 
AA872914 W31065 N54215AI55874tH56262NM_017425 Z48570Al831777 T75007AA354867AA427938Al922e44AA733170 

60 AW821145BE081547AW881571 AW881573AW056249AA204724AI417415AI127303AI355013A1039527AW593269AA576745 

AI457317AW693235 H93126BE396072AL134941AW629175AI424011AA115732AA179986AI334944AW367922AW152304M806752 
AI312418 AW935023 BE301 136 AA032258 AI829922 A1372406 BE177074 AW513743 A1151526 AA975643 AA478034 AI814920 AW080063 
AI032624 BE177107 AA319768 AW935098 A1017620 AA974477 D51441 C14226 AL043683 080146 AI690771 AW00971 1 AW881570 
AI220431 N51090 AI143003AA951480 AA039351 AI094385AI096520AA179553AAra3974 AI373929 AA677252AA587374AA886867 

65 AA312863 AI150654 AI13845C AA133209 H99368 AI585632 AW070496 AI539748 H5945S AI811537 H52680 T74907 AI499557 R96670 

AA854795 AA427863 AA224345 AA8898gg AI347782 AA931056 A107S059 A1360841 W797976 AI362268 A1200968 BE360785 R97433 
N98499 AA1 34403 AA035355 AW263162 A1369607 D801 44 AI376627 A1520801 AA365942 A1707705 Al 123495 T33101 H0B716 AA804238 
AA922201 AA723522 AW183592 A1445B84 F34614 AW022342 AA363998 AA568793 AA152475 D31233 AA852622 AA099862 AI129147 

_^ AAg22699AA782664 n3142 730009 T32913A1676138A1914667 N34899AI372746AI266911A1362444AA443158AA910603AI420273 

70 AA868050 AI277700C14224 AW082087 R41447 138385 AI911645 A1961833 R91976 F04560 AA661955 A1857675 AA369666AA424207 

N79953 AA382958 AA894626 AI834964 AA8469B9 AA21 5454 A1742580 AI339437 AI806879 AI091373 AA782558 A1026868 AW590904 
AW204599 BE348235 A1819318 AA122324 AA939221 AW13971 1 AA131608 AW613548 AA122286 AI309179 AA437247 AW339322 
AI671306 AW439848 AA131701 A1078075 N54624 AAei2881 AI140547 

_ _ 129097 25953.1 BE243933 AA355449 T29766 F08395 N83324 NW_005953 S50223 AI207548 AA258C92 AA113952 AI311718 AI128612 AW607449 M77172 

75 AI961311 X52346 AA903307 A1569810 N55421 W77875 R37223 R83788 AA031666 H47092 AA133451 AA311095 AA906963 H87667 

N5605BAA393593 W24864H10710F06925F07239AW386140AA325018AA235950AW373176 N57158AA258093 N39467 R21609 
BE089979 R34173 AW889005 AA745644 AI693852 AA424914 AA744771 W72632 AI291213 AA524318 A1472134 AI91 1230 AA528418 
AA115745 AA775720 AI671134 AA975044 AW2981 17 AA321015 N26288 AW962194 A1743379 A1204233 AI801Q26 AA830690 AI146980 
AW10461 1 A1338576 R21507 AI367623 BE244484 AI269308 AA031667 AI884346 AA731989 AA988943 AA235951 AA807887 AA642645 

OO AI24648g N29739 AI216718 AI38334g AI03861B AI3S1476 AA806031 AI914178 H10711 AI095S73 H89220 AW470854 AA729015 R83353 

AA782239 R34295 H87165 AW419059 AI653689 Z40349 H89114 AW074506 AA397785 AA888377 AI91 1228 F03193 AI468783 AA702615 
AI83082g AA74B323 R37224 AA424915 AA731647 H47183 
120695 9683.3 AA976503AI917802AA953664 AA404613AA428771 BE280542AW194691 AI927301 A1740458A1796100Aig36603AW052210AA970201 

A1633384 AA425910 A1017004 AI2412g5 AA402816 AA291468 
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100362 37780.1 AL133887D64159AF112218AI766633AL039303AL133888BE620604AW976259AW262792AW591383AI365413 N36652AA807027 

AI472041 BE620065 

101084 13883_1 AW409934AW245855AU077157AW163245 AW161434AW250083 AA316055BE621134AA171883 BE272494 105425 BE250310 

NM_013285 BE311494 AA858436 AA308223 AW362598 AA373618 BE394454 AA126101 AA581348 AA303227 AA058438 AA126544 
AL135350AW996947 M403201AA446682 W79685AW246249AW577783AW002316AA320026AW763913AI798554AW070650 
BE250413 AW250835 BE076336 A:925558 A1907634 AW804193 AW804270 AA902387 AW804232 AW804255 AW607751 AI9091 14 
AW157242AA934590A1628921A1470650AW409935AW172793AA401208AW162279AA838018BE206462AI826742AA857353 
AA483614 AA12S418 AA722289 AA780182 AW768894AW183614 AW155969 AI244063 AA863491 AI376281 AA682490 AA846248 
AI474094AW246802AA446557AA1260QOA1699045AI702310A1253092AA171554AA831455AW1183B4AI954511 AI760439AI867001 
AA493881 W81287 AA51 5590 AA659297 AA635139 AA659293 AA755044 AA196109 N32569 A1907635 

100502 26409 1 U15979 X17544 W52755 NM_003836 Z12172 AW37C136 BE262564 T49116 AA333753 BE262238 BE313737 H38163 AW683056 R28890 

BE259532 D16897 AA885610 AA911293 AA319627 R94472 R29022 AA443405 R96397 W04904 W01746 W01204 N74203 N58621 
AA701996AW418723 N53220AA602813AA576129AA593786AA911577AA576967AI149135AW673058AA772985AI188918AI372065 
AA575838 W60010AI004576AI131265AA319845 T50070AI335742AA235245 W32706AA447372 R95355N69573AA904616AI291224 
BE467454 T491 17 A1268620 AA928248 AA449494 AA318817 T49929 R94473 H38154 AI076649 AW935307 AW6051 12 AW935433 
AW935342AW935310 AW93S345 AI298308AWg35395AW935384AI184857 AA319871 729465 C21134Z1978S AA3291C7 T52079 
AW935346 C06234 Aigsi555 749928 AA371 745 AA389296 AA346673 R82547 750006 

102398 enlrez^U42359 U42359 

7able 8A lists alMut 54 genes up-regulaled in ovarian cancer compared to normal adult tissues. 7liese were selected from 35403 probesets on ttie A%metrix/Eos-Hu01 
GeneCtiip an-ay such tliat the ratio of "average" ovarian cancer to "average" normal adult tissues was greater tlian or equal to 4.0. The "average" ovarian cancer level was set to 
the 3rd highest amongst various ovarian cancers. 7he "average" nonnal adult tissue level was set to the 4th highest amongst various non-malignant tissues. In order to remove 
gene-specifio iMcltground levels of non-specllic hybridizalkin, the 15th peroenBle value amongst the non-malignant tissues was subtracted from both tl>e numerator and the 
denominator before the ratio was evaluated. 



130941 
101249 
132528 
102610 

129671 
105298 
121779 
104301 
132191 
102136 
101304 
132572 
106738 
108357 
115291 



AA283C06 

U65011 

AA347193 



AA448297 
AA470145 
AA1332S0 
AA279943 



Hs.1904 
Hs.50758 
Hs.30743 
Hs.62180 



Hs.237825 
Hs.25130 
Hs.62180 
Hs.122579 



5-hydrDxyttyptamlne (serotonin) receptor 3A 
protein khaseC; iota 
chromosome-associated polypeptide C 
preferentially expressed antigen in melanoma 



KIAA0741 gene product 
acidic 82 kDa protein mRNA 
HK protein kinase 
signal recognition parikde 72kD 



134520 
108778 
131228 
116238 
108055 
132939 
115909 
120438 
123494 

109648 F046CC 



N21407 

AA128548 

AA279157 

AA479362 

AA043562 

U76189 



111234 
135242 
123006 
116296 
100661 
111346 



130376 
120619 
122802 
116416 
115094 



N69287 
M74093 
AA479726 



AA211941 

R40873 

AA2B4372 

AA460530 

AA609219 

AA255921 

AA947601 

AI160190 

AA257971 



Hs.90847 
Hs.24485 

Hs.47144 



AA243441 Hs.99488 ESTs; Weakly similar to ORFYKR074wlS.cerevI 
AA599786 Hs.112110 ESTs 



Hs.63631 
Hs.21943 
Hs.9700 



Hs.14669 

H5.158174 

Hs.4029 

Hs.90073 

Hs.10g643 

HS.15S174 

Hs.111471 

t1s.2S6579 

Hs.39982 

HS.8809S 

Hs.97056 



ESTs 

zinc finger pioteln 184 (Khippel-like) 
gl'ioma-amplified sequence-41 
chromosome segregation 1 (yeast homolag)-Iike 
polyadenylale binding protein-interacting pro 
KIAA0432 gene product 



hect (homologous to the E&7\P (UBE3A) cartnxy 



wo 02/102235 



PCT/US02/19297 



114965 AA250737 Hs.72472 ESTs 

120821 AA347419 Hs.96870 HomosapiensmRNAMIIengJi insert cDNA do 

134621 L02S47 Hs.172865 cleavage stmulaUonfaclor, 3' pre-RNA;subu 

134161 U97188 Hs.79440 IGFJI mRNA-binding protein 3 

TABI^BB: 

Pl<ey: Unique Eos probeset ideniliier number 
CAT number. Gene cluster number 



CAT Number Accession 



116416 
132191 



2774.1 
47548.1 



L18964 m 002740 L33881 AA095249 BE080871 AW60S320 1^^85571 AA17977e AA160650 AW1 17327 BE467131 AW088338 AW937631 
AW0B7S14 AI480090 AI873147 T57875 AI217404 AA827196 AI27g471 AA969093 AA815168 AA988896 AI754623 T28044 Am50302 
AW950294 AI0321 93 AI953e96 AI630SB3 AA062633 BE541 355 AA18C493 AVW1 5748 AA255651 
BE623001 L05096 AA383504 AW95541 6 N53295 AA460213 AW571519 AA603655 

AW893940AW978851 AA03424OAI686323AI767653AA829515AA053933AA737691W92607AW261869AA835698AA447216AI623248 

Z21891 AA835700AA599963 T20152AA533167 

AW753676 Rl 1789 AW001886 AA609219 AW780420 AI860557 AI280331 AI334300 AI288870 AA669343 N29918 BE537790 AA934687 
H79075 N42970 R63752 

AA507576 AI610269 AI380079 R40309 AI203932 AI3421 28 AI342578 R4311 0 AW583269 AI375234 AI092708 R52802 AI028462 AI016062 
AI189 144 AI016691 W45615 AA551452 AA449431 T1 0046 AA424059 N62822 AW197701 AA465242 A1418989 AI942433 A!891 1 1 5 BE302315 
AI743979 AI283341 AW34a338 AA774543 AW104778 AI078020 N21 487 H97562 AA970063 BE327945 F03880 F03885 AA970899 AI298468 
Ai380330 A1247787 AA770467 AI200154 Ai089863 AI089890 AI595738 W88524 AI471010 AA700191 AA778937 BE440182 R79225 AA338236 
AA548984 AA907692 N21250 AW904736 AI909337 AASe7772 AWg69228 A1149372 N29644 AI039967 AA677529 A1694291 R86811 N28672 
AA465598 AA321185 AW130492 AI824479 AI682992 

NM.000869 D49394 BE252349 AW249320 AW249140 AW250535 S82612 AJ003079 AJ005205 AW178407 AA811360 AW976407 AW976408 
AW248903 AA731733 AA804189 AA703169 AI435492 A1076288 AA91 217S AW248713 AA743457 R08170 C06167 R02351 
AW872527AA453863AA442475AF086541AA365801 A1692575AW131631AA732993 W96131AA436666AA453779AA365504AW959717 



101804 
132572 ' 



AF133123 m 012086 AA1 28292 381493 AL137453 BE614053 AA307628 BE009521 BE085659 BE085542 BE0e559e AL120654 R13165 
AA429306 R13465 R55236 AW994182 W00838 AW994417 AW994404 AW994426 AW994321 AA516147 AA345603 AW953009 BE315104 
AI126654AA626457 AA291327 H67g83 H66271 H67976AW270955 AA768221 AI023487AI921811 AI963370 AF085850 R70992 N26129 
AW295143 AI433661 AW608361 AA873402A1217453AI953358 AA262143AA92849S AI475268AI167211 AW385961 AA269045AI762630 
AA428238 AI001932 AI735550 AI951370 AA766807 S81492 AA918976 AI040967 R70939 AA469065 T70340 AA477615 AA478070 AI017743 
AI608833AI635824AI186039AA741312AI040184 H67656AA258221AA731316AI381293AW298473 R55237 R37375AI768014AA128S48 
AI206773AIB79827 R64193 

AA300576 U15552 NM_014597 AA223318 AA171806 BE259461 AW578439 AW604388 AW953513 AA772816 AVW04383 AW577851 
BE169672 AW117711 AW3663C3 AW3663C2 AW36630B AW366304 AW366300 AI908432 AW591937 AI358420 AW272622 C75067 AI926471 
AW002266 BE064947 BE064722 T10372 AW838681 AI81 1 1 19 AW262098 AA58B547 AI916666 AI440083 AI0781S0 F2426C AA51291 9 

AI953413 A1054798 A1420425 AA191 324 BE503222 AI632721 AA1 80035 AA558329 W44843 T10610 W38442 BE542B69 AI125024 BE279566 
AW747936 AI589491 AA559096 AI090265 AA548959 AA223220 AW51 5936 AW368395 AW368407 BE540776 AI039762 AI584020 AA171691 
AK001468 AA190315 AA374980 AW961 179 AA307782 AA315295 AA347194 AW953073 AW36ai90 AW36ai92 AA280772 AA251247 N85676 

61R57094A1550045AA347193R16712AW119006N55905 N87768AW900167A1341261A1818674 D202B5 

26 Al 1 22827 AA1 33250 AI952488 AA970372 AA889e45 AW069517 AI5243B5 AA190314 AI673359 AA971 105 
AI361088 Ai872789 AI919056 AI61 1216 AK001472 BE566751 AA6810C4 

AL043202 U33286 NM_001 316 AF053641 AL048759 N99830 AA263091 AW408174 N90467 R84306 AA317882 BE613644 AA307378 T10722 
AA207207 AA315560 AA1 13938 AW386317 AW386316 L44546 AW386335 AA243317 AA713588 AA192541 AA649035 BE300737 AW752491 
AWg02334 AWg93922 BE003403 AA251 521 AA382754 AA339162 AA382619 H68600 H67810 T70379 T82109 D81644 D60375 H59003 
BE075732 AA471242 H17790 F1 1801 T84903 R78076 BEo14356 R16380 R16396 AA876127 W95535 AA164768 AI279876 H02142 C18698 
AA365866 AW954410 AI539769 T39128 AL121103 AA192466AA213367 AI963800 BE090601 220096 BE566508AI969470BE044090 TS5536 
AA837311 AA075484 AA075621 AA778294AA587266 T69722 AA4461 18 X86624 AI334209AI587101 Ai281280AA«8602 AA946837 C75603 
AA236997AA459274AI150191 AA165156 A1198839AA789268AI139373AA236674 AI127770 AA678954AI140786AA113939AI187231 
AI754062AI763243AA934719AW439362 H02038 C17453 A1400951 AA227539 N66040 RB9384AA872668 AI344110 W95420AA164700 
C05659 BE094097 AI826398 H58956 T17222 AW139044 A1271344T16445 R42323 C76665 AA166228 A1026443 AA165229 C21496 AI826239 
AI868711 AA532354 AA524392 R01549 R01641 221083 AA528463T39127AA989472 F09450 AA084485 BE004378AW974353 AA137250 
AI278406AW609291AA137249AA142866AA639198AW609271AW149760A!026112AA236620AA937248 U90736AW005487AW674427 
BE397971 AW609285 T65602 T99684 T97378 AW609366 T85647 AI672235 T99083 AA199583 AW303874 T35523 AA686446 239669 
AA4S9503 N95643 AW821210 AW813461 AW582064 AW609293 AW609320 AW609270 AW58208S AW582071 AW609318 AW813451 
AW8134S6AWS82079AW60SZ76AVV6092BOAVV6092gOAW582101AVV582102AW609263AW609317AW6092S6AW609305AW 
F06655 AW605343 AA446426 BE09059S AW969S7e T79852 AI0e2S0S N63239 AI97316e AIC861B2 AA84671 1 AI874213 AA730605 AI927257 
AA912624 D60376 T10180 AA705847 AI018123 AA493197 T67083 R77739 AA9530B7 R00885 AI370606 R01642 AA862914 N57843 
AW023353 H774B3 H68082 R42337 H58601 T97267 

H17800 Ai362549 Ai671 064 T23526 F03426 F04694 F04600 A1635B56 238715 F02039 AW022635 

T78736AA284422AA283006 

M86699NM.003318AL133475AA122377 221415 R57092AA806568AAB11904BE538323 R41558AA421620AI337292AA470077 
AW795371 AA543024AI677941 AI4722CC AI215042AA732384AAB37143AA8C4229AI907533AI742701 AA1211S9A1973225AI620839 
A1929669 AA227827 AF069765 AW408768 NM.006947 AF077019 AA220974 H07969 C14621 D52294 BE512960BE614138 BE258539 
BE251981 AA355433 AA481126 AW403053 BE542282Ai929818 AL120605 AW753079AW391834 BE0186C3 BE395262 W21406 AW66325g 
AW975690 W93905 W96519 AI863832 AA443177 AA730942 T99558 TB6581 W19444 N555B3 AI701020 AI92B986 AI857864 AI590849 
AW081819AA714970AI122630 W04887AW662427AA602680 W93645AA582946AVW)08812AA311187AA463631 A1421918 A1400518 
A1921 404 AA143770 AA587675 BE302192 AA813080 A1493386 AW327435 AW340871 AI143616 AA687231 AA218961 AI362249 AI378345 
N74716 AW969249AA468581 AA516399AI274726 AI131244AI572604AI929236 AW327971 R65637 N90309 H07877 W96486 AI358806 
T90801 AI383246 AI740957 T86758 AI471248 AI864233 AA910590 AI079094 AW805781 AA709025 AW196707 AW327436 AI903790 
A1873956 T99348 AI924643 AW103910 AI802993 AI080390 T99098 D19794 AW327972 A1935904 A1288676 AW3B0875 AA779784 N93674 
AW769295 W32639 AA363094 N89012 W39751 AI291329 A1291371 AA82941 1 AI985219 AI422776 AA918940 AA363108 AA192633 
AF0861 31 AA373679 AA165043 AA355705 AI243507 AI027796 AA573461 AA757260 AI370979 AA5741 49 AA658276 N70660 AI478948 
R35393 AA448435 AA334659 AW879356 AA436527 AW972044 W25165 AA521219 AI094141 AI302096 AWy8551 AW578534 AW390535 
AI131 472 N50381 AA736938 AIC891 12 AA863053 AI359793 AA962268 T27353 D82590 AA448297 A1277168 AI36a4S7 AA872737 AA330346 
AA308346AA342341 AA365159T86701 BE1 62693 T99703 

AA503020 AI858190 AI686571 AW615203 AW073686 AW172459 AIB28762 AW150534 Ai859795 AM1 1046 AI539195 AA404fl09 AI63B559 
AA434329AA171844A1684143AA953518AW470108A1870700AA706376AI539668A1683712AA075579A1682137AA291512AA554431 
H51315 AA404225AA075632AA172293 H51911 

AA326108 W74020AW612698AI750909AA487800AJ270e9SAA044941 H20708 AA2967S0 AA018401 AA378SB1 AW964159AA018887 
R68533 AA525338 AA526640 H84308 AA278942 AA164818 AA3471 10 T82335 N25519 AA021474 N31381 N36297 AA83B191 AA318932 

158 



wo 02/102235 



PCT/US02/19297 



M9612a6 N41430N41439AW63M77W37595BE394538AA365256NM7771 N34873 AA988105A1242138 AW148523 W978761 N50882 
AA52744a AW086200 AI75091 0 N50868 AA709437 N51946 AI222179 AA732883 H96742 AW615360 N537a) W37490 R87362 AA51 3273 
H98999 AI469022 AI368442 AI460122 N20485 N24087 AA164819 N2487B AW471270AW590458 R68240 AAffl4434 N204flfl AI419625 
AW500664AI033658AA593215AA907408AA71350BAI422627 H85551 AA923571 062680 AW627456 H96206AA01 6289 AA485896 N266gi 
5 132632 4312_1 AU076916BE298110AW239395AW672700NHL003875U10860AW651755BE297958 C03806AI795876AA544165 T36030AW392852 
AA446421 AW881866 AI469428 BES48103 T96204 R94457 N7822S AIS64S49 AVU0049B4 AW780423 AW675448 AW087890 AA9714S4 
AA305698 AA879433 AA536089 AI394371 AA928(IS3 AI37B367 N59764 AI364000 At431 285 T81090 AW674657 AW6749B7 AA897396 
AW673412BE06317S AW67440BAI202011 R00723AI753769AI460161 AW079S8S AW275744AI873729 D2S791 BES37646 781139 R00722 
102610 9336.1 U6S011 NM.OC611SAW1820S3BE383g3CBE407839BE409930BE408826AW370292AA3128S9AAl36204AW36S8S2AW36S735 

10 BE622732AVV9392g5AA781195AI017284AW376329AVV3753BBAW17B384AW78333AW178424AW365726AF025440AW172852 
AI570998AW117792AI885499BE46SS16BE207427AW130942AWS13316AW770892BE207426AW1 73563 AW168292AW173S6S 
AI810101 AI744983 AI861974 BE207404 AI744982 AI613210 AW591505 AW1 89285 AI521444 AI745C44 AI627904 AI690634 AI289305 
AA861253AI612799BE207425AI14ge94AA902662AI082468AI014752AA613844AA725693AA136C8gAI2g0C82AA565489AI68g083 
AI859014AW051225AA66575BAA496ggiAA564738T19428AI567170AW166726AW084200AW188723AA617626AI91B664AW381473 

1 5 AW381543 AA5g8817 AW088g42 AW0SO423 AI3B2502 AI680308 AI687S00 BE327836 D20455 BE410282 BE2547B6 BE256014 AA357423 

BE6182C8 AA489S77 AW1821 14 BE37gi47 
102627 25245 1 AL02igi8AA160639 U665B1AA321623US2098AL1ig4S3AA455712N80080 N46550 W07223 N75923 W05057AI811577AA455657 

AI275409 AI139121 A|g27S68 Aig275B2 AI139471 AA16Q473 N7879S AI7ig983 AI718928 AA7230g7 AI335776 M39140 N591B4 AI5B7600 
AI864B12 AA732097 N74667 AAB3239B H89600 019825 AI554833 

20 132725 29101.1 NM_006276NM.006276 L41887L22253BE379909BE567870BE274265 BE53951BAW239523AW239271 AI093618BE5C4485AA580279 
AA494481 BE440161 AW7B0428 BES439B0 055986 AA852399 A1630020 W77G96 M2781 93 R10505 AI963201 AI739336 BE174301 
AA662222AA6B4912AA244152AW611553BE503285AA211023 M3e3016AI69B174AW195381AA948229Al768495AIB90437 N30025 
AI71B952 AI953572 BE464509AA777315 AI337221 AW070910 AI953848 AW674661 H54177 AW510B90AW07B699 AI436178 AA630759 
BE502074 AA278769 AI499038 AW469072 AA778071 AW236753 AI933033 A 690458 A1276691 AW768235 AI952118 AA425156 AA610579 

25 W73953 AA244153 W86034 AI948B72 AI952678 AW087811 AI333591 AI869B83 AI92691 1 W48865 AL048024 AA214485 AI972522 A1 151368 

W48738 AA214467 AA334640 A1678170 AA927525 AA68158B W96283 AA366470 A1471919 AW6114B8 AA211834 A136519B AI698365 
AW002238AA507624 W96150AA446490AL048025AA852400AA362221 A1336376 R35083 A^290B12 R10397 AA9759BB AW236462 Z43032 
H1 6969 F13437 D19858 AA452207 BE085942 AA344396 AW949533 AA279472 AW902406 AW070440 BE3951 95 H00835 AA300750 
AA729303 AA420591 AA385025 AA420542 R691 55 AA4205g2 AA2B1 747 N88502 AI46B206 AI700996 AW41 8607 AW341 202 AI825692 

30 W00640 AA21 4405 AA044744 AI950617 BE467493 AW4741 13 AA446310 BE328705 AI911573 AW24396B A1628622 AW173020 AW079958 

AI1403B7AW051969AW299438AI127170AW76gi64AI422435AI307116BE549519At371116AA281748AA701073AA67gg4BBE551197 
N50345 AW338776 BE32SB01 AI142892 AW470687 AISSgSBB AA911241 AW294822 AI174414 AA804366 AI004725 AW27igg4 AI559313 
A1270102A1351542AW768904AA7659B4A1961708AI149231AA995907AI094280AI185753 H01219AW768846AA747500AA970106 
AI60123BAA513452AW612802BE075163 R39171 AI565328AI375669 F10366 AA284B25BE241509 AI702889AW193010AA649847 

35 AW439150 AA721407 AAB10333 AA706384 AK)49B87 BE569015 BE622280 BE56661B AW967342 R69269 

111234 83711J AA902656AI185915R43705H15150H09794AA8324S4AI697438AI354538AI436354AA948272AAg28143AI091263 R416SBAI3525B0 
All 2294B AA946670 AI340088 AI275007 N70255 AA721 176 AI934162 AAB27098 AA935934 AA827088 AI081207 AA992399 AW1 30757 
AI805667 AA035556 AI37g266 AI0939Q1 AI095234 AA909Q79 AA516079 AI572357 AA20596g AI432383 AA905290 AW628920 AW1 82996 
AI266084 N49879AW024457 AI246246AI934031 AI369270 AI003836 AA010063 AA494361 AI284151 AI919536 N34884 N69287AW510465 

40 AI358609 AW081421 AA706205 AI085317 AI140633 AI347104 AA602547 AI686707 AA872536 AA694028 AI094546 240832 AI382838 

AA610132 AA501433 H841 20 AI140722 AW674839 BE503822 AW663895 BE327472 AW393494 AI340087 W04189 AW393499 H56506 
BE089878 BE301950 AI025475AA724446BE275324 H15210AW957667 
134161 16074.1 AA634543 AI682259 AF1 17108 BE39e917 NM.006547 U97188 U76705 BE560799 BE396918 BE269531 BE560268 BE560346 AAB36048 

AL023775 BE545535AA427803C18804 D58801AA303353 U46218BE539704AA187966AA252545AA261821 D63197AI824109 AI088047 

45 AI424833 AI807368 AI250857 AA741476 AI146832 AA169615 AI809821 A1274288 AW136704 A1206172 AA917039 AA243584 A180861 1 

AW674709 AA935733 AW450092 AA905172 AA471196 AA302256 AW673348 A1352044 AW511295 AA247134 W81035 AA722962 AW662471 
R64432 AW04461 6 AI086619 AW628546 AW043682 AA425750 AI743038 A1368723 AA187143 AI3769B7 A1B03976 AI275537 AW471368 
AW104877 AA1 95464 W81 072 AW197351 AA932674 AI3S3420 AI43499B R63822 AW086083 AI240272 W87006 AA01 1347 H58428 A1497896 
R23223 

51) 132939 11867 1 AB009284 AF000416 AA022636 BE032610 AW357997 AA491410 AA337477 AA336421 W38526 AA625283AA773685AA490078T66134 
AA347838AA022647AW054725AI918001 AA431965 AI263596 AI804298 AW469314 U76189 AA779001 AAS25945 ALD42357AI674730 
AA410350 AK001450 AV654353 AA058443 D8161 8 AA853555 W31930 AA334445 AW955767 N47777 AAB83784 AA428916 AI552062 
AA329703A1417923A1435031AA708791 A1989636AI220345AI239913AI220102A1435875AI076731 AI377049A1039173 AW972638 N90076 
AW263652BE440048BE440013AAW7463AI038774AW204992AA846580AA501952AA34205BAA508525D61670W31725AI689499 



AW263155 AA314512 AW40B152 AA360413 BE206274 AK001402 AA307B65 AW954678 W39078 AW36g236 AW3691 16 AW369096 
AW753235AW36g072AW361194AW369125AW3641B7H94225T79502AA131908BE071359AW36B503AWB01517AWB01322AI674163 
AAB61 077 AI738568 AI830199 AW1 18577 AI478895 AI688497 H92996 AW0B3479 AW970625 AW6131 24 AI632234 AI65421 0 AI696847 
AI923423AA13190gAA806B30AA806158AW592S20R9650gAAB13923AA502823AVW67889AA960972AAg945BBAI2B0346AA88458B 
AAB535S3 AI200023 N89820 AW7BB792 C00145 R96554 T79416 

AA76B491AA476251AAB09748AI186268AA621244AI37g029BE550341AA651915AI216376AI215585AI471780AA772159AW181980 
AI151169AA759270AI675769AI018776AA757335AA148511AI138378AA504167AI420617AW261930AW8727g7N51769AW614403 



AIB71154T16768AI765893AWg69016AA744720BE094085AA743769AI476407AA156619AI76B535 R81435 D45332 N75682 N51177 
AW207406 AM251B4 R20997 AA50416B Z43298 

BE091005BE541579AW387738AW3860B3H1376gAW377820AW3e9180AW753239AWB72695BE37g572AW021732AW891450 
AW891416 BE09135B N50376 BH)91354 BE091365 66091361 BE091363 BE091350 AA353863 AAB45510 N21407 AW7709B1 AI3B1577 
AA526557 AA525443 AW893622 AA63089B AI418983 BE172016 AA550754 AA664574 C16147 AA355902 AW9585B6 N51590 C20995 
BE5441 86 AI337578 AA090549 AI807374 AW450654 BE067578 AA446781 AA447058 T1 0807 AA457082 AI267703 AI880220 BE568979 
AW380506 BE150744 AW3B0468 AW380546 BE150713 D60029 HBB099 BE546301 BE160731 AW368467 BE091348 
AI583187 M73812 AW339829 M740g3 BE252510 BE2S2518 BE536901 U40788 W95578 6E018493 BE544205 N83637 AI671049 AW439693 
AW300786 AW374970 AA592960 AI215885 AI215884 BE302101 AI186210 AW771831 T54213 AW452924 AA834019 W95471 AA628312 
AW304866 AA570076 AI559B73 T54121 

AA037145 L02547 NliL001324 AW411516 AA314711 AA143605 BE394455 AA325731 AA093227 AA0B3307 BE299438 BE295669 AA37C886 
AA338272 AA166862 BE304837 BE298306 R60507 AW238966 N72750 AW505406 AW994153 AA309742 AA309929 BE0g0721 AW904189 
AW904214 AA363564 R94127 AA352101 R28249 AA206337 AW577208 AA3B5473 AA35S463 AA4fl0696 AA0755B7 W72815 AA554033 
AA075530 AA620555 AA554034 T27804 AW950014 AI570740 AI268538 AA704423 AW41 1517 AI278646 AW339924 AI66B917 AI796034 
AA994501 R94082 AW027137 AA400652 AW299746 W7ai6 AA988494 AA146582 AW0B7489 AA992763 AW516454 AA992759 AW270444 
AA227183 AI203929 AA1 67720 AI052527 MB66660 AA569368 AI888464 AI670003 AA827620 AA507543 A1BBB385 R8841B AW9590B3 
AI341077 AI825719 BE552285 AA738076 AW0e5903 R28242 

AW805610AW806603AW805500AW805819AW517040AW473670AW51670in0141AA894497BE349504AI272007A1985274BE501962 
AW102975 AI801727 AW197918 T24046 AA947601 AW900958 

AF121B56 BE242657 U83194 AA226732 A1160190 AA948725 A1079958 AW513369 W39443 AW408479 W06854 AA094683 AI985095 
AA316647 H96313 N7843B R81582 H95034 R79674 AA488552 W2S292 W31697 W19918 T30640 R08686 H78637AA1 66100 241909 
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25 

129097 

30 
35 

120619 
129229 

40 
45 

120821 
50 106459 

55 115094 



60 



123005 
10B055 



75529.1 
100690.1 



M165080 R34212 AA160886 T82168 N77082 W56864 R19848 AA888217 AA314539 AW750293 N62714 R58039 AA845453 N63268 W03474 
N41923 AI264123 AI808533 M824288 AW198143 N99916 AA902465 M775397 M772387 AI567676 AA227473 A1082614 R68334 N78878 
R34329 AW438902 AA164685 AA558790 A1590102 AA863422 BE002625 AA934039 AI298102 W15351 H93997AA725325.-eE1 80993 W05350 
AI510771 W06941 AA4B8414 R79863 N91264 R768B4 AA31294B T71267 AW959659 AI086695 N90421 AI278098 AA164538 AI300271 
AA854381 RB1331 AA700449 H06174 AW518427 AA876634 AA150778 N32393 H78585 T85364 BE002808 AW663196 N90337 BE252097 
T71401 H06438 N40268 N31015 R77046 T99588 T85462 W25298T59815 H09416 T85403 W32150 N79109 R78812 
BE387790 BE276B49 BE246825 BE246900 AA380437 AA332gg6 AW408727 AK000294 AI636887 AWig7272 AWS90657 AW594006 AI768979 
AI751632AA580098AA313261 AA300475AA133237AA233499 BE242126 BE242597 BE242254AA314374 AW961210AW939345AA173535 
AA306779 R99373 AA128304 AA447246 AA476365 T34973 BE264878 M25149 Z24939 BE263038 AA128317 Z46082 H23504 AW378551 
AA088533 AA442625 H15235 H79172 AA344697 AA344742 ALj040280 BE173256 BE173129 T59749 AAC8841Q BE242311 AA173576 
AA455337 AA129715AA582953 AW612637AA917038AW340019 AW006315AW593024 H05915 AW294592 AI146814AW195659AW440271 
AI209090 AI28B689 H98630 H15174 AI214454AI569471 AA085808AW188351 AI751631 AW440477 AW300860 AA516088AI365108 
AI305805 Al 264064 AI246276 AA44261 1 AA807059 AA233459 AA875987 AI374653 AA972665 AA9476 16 AA436867 AI216423 AI667181 
AI680306 AA436854 AI537153 AA883723 Z28659 AA705973 Z40741 AA463884 AI216025 AI564986 AA476316 AI432666 AI571662 AA447126 
AW293675 AI67551 7 AW009004 H23498 

AW015242AA831493AI292a46AI076966AI191561AA243441AI183309AA252613AI038422AI306531 AW051480AI348605AA195119 
AI817119 AI091896 AA738440 AA195013 AA976687 AA459659 AI246250 BE219252 AI703457 AA243291 AA243401 AA989100 AA931640 
AA459782 

AK001269 AL354613 AA147472 AA490803 BE207628 AW816113 AA085574 AW503392 AA299910 AW750305 BE079539 BEC79484 
BE512838AK001593AW968772AW967440AW206280AA251270A1627886AA303599AA147473 BE205515AA490611AA715039 
AW590866 AW590447 AI364512 AA204731 AA894490 BEOOi 136 AA612785 AA237036 AA149960 Z44267 R12986 AA448446 AI734041 
AA422167 BE22Q551 R66041 R32927 R32942 AA258773 AW386142 R53730 N54624 AWeeC29S AA253485 AW954441 H93989 AW614348 
AI654838 AA779793 AW237213 N66635 AI186812 AA947479 BE158011 AI859480 AW805579 N52010 AA806306 AI628445 AW270990 
AA778165 AA149949 A(55072B M74910B AA687257 AI261661 AA747442 AA481351 AA206339 AA903407 AW473306 AI688930 AA262281 
AA448310 AA74a820 AI347430 BE465692 R32839 AW510564 AA436408 AA257971 AA253362 AA938330 AA513150 AA976840 AA6871 17 
AI281547 AA046243 R32325 AI631664 AWiaSEie A!244535 R52946 AW236443 R40183 AA299909 AA81 1958 AI302918 Z40213 BE158047 
BE158060 AA767245 A\V7481 59 AW500735 AA094074 

BE243933AA366449 T29766 F08396 N83324NM_006963S50223AI207648AA258092AA113952AI311718AI128512AW507449M77172 
AI951311 X52346 AA903307 AI569810 N55421 W77876 R37223 R83788 AA031666 H47092 AA133451 AA311096AA906963 H87667 N56068 
AA393593 \/V24864 H10710 F0692S F0723g AW386140 AA325018 AA23S9S0 AW373176 NS71SB AA2S8093 N39467 R2160S BE089979 
R34173AW88900SAA74S644AI6938S2AA424914AA744771W72632AI291213AA524318AI472134AI911230AA528418AA11574S 
AA775720 A1671134 AA975044 AW298117 AA321015 N26288 AW952194 AI743379 AI204233 AI801026 AA830690 AI146980 AW104611 
AI338576 R21507 AI367623 BE244484 AI269308 AA031667 AI884346 AA731989 AA988943 AA235951 AA807887 AA642645 AI246489 
'N29739AI216718AI38334gAI03B618Al351476AA806031Al914178H10711A1095573H89220AVM70854AA729015RB3353AA782239 
R34295 H87165 AW419059 AI653689 Z40349 H891 14 AVW)74506 AA397785 AA8BB377 AI91 1228 F03193 AI468783 AA702615 AIB30829 
AA748323 R37224 AA424915 AA731647 H47183 

AW966339 AL045632 AA333229 AI306195 AA284372 AA206108 AA5B2633 AW44961 4 AA804785 AI215473 AI357263 AI651208 AI651753 
AF01 3758 NM_006451 AI538709 AA209236 AA300293 AA367274 AA1 26598 AA324825 AW955225 F1 1436 AW374740 AW374714 AW374774 
AW751614 W74780 AI909015 AW997a79 AW997067 AW379344 AW363397 W3B5B9 AA043823 BE169280 AI909016 AW994861 AI74C638 
AW14e560AW368339AI858333AA314718AW954872AW46B734AIS81980AW519046AW056171AA579286AW069164AW615004 
AA345052AI446735AI142106AA662683AW002ei3AI418280AW613203AIS13333AI354480AI929755 AI146977W74674 AW799610 
AI798529AI589422AA043967 AI223043 AA167016 AI446759 D56729 AI687471 F30716 AA812125 AI537301 AA653347 D11966 AI434383 
AA59B533AI287254AW139140AW051033AA601911AI702506AA737460 T30221AI129081 N90213 AA805225 AI798518 BE001071 T10841 
W20199 AW664694 AW196667 D60123 061496 AW468018 AI720097 N90553 AA829375 AW513266 H92758 AA585324 C14767 AI922391 
D60124 D60666 AW071558 BE044120 AA728821 AA211941 

Y19062 NM 014393 AW296801 AK001576AL079288W16667 Z45S54AI768661 AL079236 R12736 AW0e0147AW136530AI202958 
AW241579 R21013 AA347419 Aig29333 AW196689 AI040867 F13437 AA918240 AI869798 AI365176 AW440030 AW440C72 N80892 
AW242030 Z44307 R12417 AA436784 AA442041 AA046503 AL157526 AI929265 AA056642 AA045462 AA683542 N51374 AWig3508 
AI873524 AW473151 AW004719 AI810504 AI581093 AA493977 Z40600 F04553 R46130 F09321 

AA78g081 AW408328 NM_006530 U61 384 AA449641 AW13821 S AA448696 AJ246746 AI365301 N44728 AA255743 AA360783 BE550380 
AW593925 AI962309 AA322097 AW964625 A1695988AW672B27 BE543266 AK001413 AW503395 AA6S1700 AA449053 AA455540 
AW083185 T52128Z78373 AW673713 AW468061 BE350755 AW673958AW675504 AA995709AW574841 AA835883AI248439AA548364 
T62072 N331 93 AAB14046 AI376210 A1340020 AA449766 AA703407 AA427613 AI470108 AI298757 AA507602 A165B941 AA44g478 
AA633165 AA449741 AA831B21 AA903673 AA682588 AW673075 
AA2SS920 AI817197AA255g21 AI612925AW874669 AA4g3440 

XS1630 M80232 X61631 875264 AA172249 AA134065 AA130278 AA130187 AA130291 AA031554 AI246677 Z21455 AI745434 AW273S44 
AW088613 AW471307 AI7454a3 AI399854 AI683952 AA031555 AA298075 AI935945 T29809 AA172099 AA356120 
AW513143AA422036AI821669AW514232AA477828AW772009AW439799AW089884 

AW149266 R49248 AW237401 AA938113 AW665871 AI969698 AI960812 AW874276 AIS71939 AA741222 AIB6g822 AW104061 AI569994 
AW972559 AA506012 Al 553704 AA470145 AI33242t AA218990 AW131351 AI709076 AW392488 AW392477 AI970981 AW392476 D61949 
H44g81 BE172698 

AW369771 AW748174 AA290801 AA419198 AA044331 AA127909 AW995442 AI480343 AA044582 AW956159 AA373461 AA127965 

AL134913 AW994955 BE622314 BE006298 BE006312 BE006305 BE006317 BE006303 AA043906 AA234175 AA479726 

'AJ404672 AJ28g8ig AWg76000 AA043661 AW450885 AW452379 AA043662 AA788832 AI664338 AI564330 AI368875 AA643607 AA994375 

AA810342AI367704 

BE545072 AI540751 AA301103 AI916675 N85422 BE553965 AA327978 AIB16094 AK001515 BE501319 AA279943 BE138895 AA343765 
AW963051 AW082308 AI823g92 AI653752 AI589007 Ai816135 AI566636 BE601307 AW272766 AW242239 AA766315 AI014927 AA578848 
AI354483AI476548AI038579AA973322AA992180AW472921 BE5047B9 AI392988 AA606076AA769228 A1370562 AL137710 BE005656 
AW96S920 
R40873 

AK001468 AA190315 AA374980 AW961 1 79 M307782 AA3 15295 AA347194 AW953073 AW3681 90 AW368192 AA280772 AA251247 N85676 
AI215522 AI21638g NB7835 R12261 R67094 A1660046 AA347193 R16712 AW119006 N66906 N87768 AW900167 AI341261 AI818674 D20285 
AI475165 AA3C0756 R40626 AI122827 AA133250 A1952488 AA970372 AA889B45 AW069517 AI524385 AA19C314 AI673359 AA971 105 
AI351088AI872789AI919C56AI611216AK001472BE568761AA581004 

AI733881 AA165164 AI82S437 AI972791 AA165165 BE2ig675 AI732686 AI821671 AA260737 AW136875 AI984273 AI249271 
AW207459AL079814AA354351 AF020043AW291396 BE550484 NM_005445 BH)46917AW594249AI551554 AI631615 AW771344 
AI969758 AI699982 AA247175 AI244676 044780 AW593978 AI638479 AI373676 AW089547 AL121432 AA554698 AI016991 AI087260 
AVW49939 AF067163 W40482 AW316558 AI537184 AW381979 W401S0 AI810562 AACT3151 AI630288 AI675561 AI674420 AW840733 
AW022653 AA1 14210 AJ005015 AL046587 AA878141 AW271896 AW085287 AA150465 BE536295 AA463412 BE093222 AA213739 
AA485586 AI825913 AA706307 AI33734a R31995 AI819641 R32095 AW976653 AA742375 AA142957 AI808214 AW468303 AI205987 
AI206347AI769095BE501640AA113866 AI093931 AI752855 AA612743AA463411 AA279157AI123791 AA213570AI207305 AW627814 
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AI783313 AI769315 AI743691 AI915545 AA479473 C21435 N50944 N50902 AW978102 H23837 BE08753a AA316516 
122802 287993 1 AI687303AW571681 AI554465AI684252AI581056AA604098AI628160AI859843AA424021 AA460530BE042778AW273200AW273223 
AW167288 AW083347 AI654306 AW517496 AW104706 AW273214 BE139512 AW189487 AW130822 AW16r419 AI289485 AW150010 

HSSOO't AI743745 AW0B8710 

123494 21202_1 AW179019 AW179011 AF135160 NM.014050 AF078860 BE018C05 AK000285 AF151038 BE246166 AW179007 AA345114 BE619758 

BE619209W25509AA314339AA336674AA337956AW954843AW390412 N46796AA316235AA314286 R15686 BE635633 N57134 N46483 
AW368462 AA923517 AA665223 AI418513 AA837523 AI359320 AI309273 AI522278 N40939 AA904977 AA938272 N30240 AA887965 
AI571972AI028109AA094652AA883262AA887781 AI744447AW592944AI077790AW3S0883AW1 48667 N89861 AA557195 AI191824 
AI433166 AI719760 AA453089 AA630656 AA300976 AA639620 AW675033 AA284393 AW886987 AI476335 AI332939 BE301513 AA452920 
AW674302 AI925483 AW170412 AI698717AI375985 BE220635AI688151 AW514809 AW062346 AA699786 BE350848 AI660848 AI023075 
AA864876 AA1S5371 AI807947 AW514579 AI978602AI860340 AA830886 AI374788 AI283592 AA683152 AA743159 AI379932 AI432056 
AI123904AW150433 N38909 

116296 11967.2 AW149502Z43342AW0C2825 AL04g382AA442545AW971471 BE220243AW968952 AA043607AW299245AA669892AI038768 H26330 
BE463534AI628252AA836139AI277291AA489033AA741239AI209064AI300253AI275761 Z39417 C01835 



Table 9A lists about 382 genes up-regulated in ovaiian cancer compared to normal wanes. These were selected from 35403 probesets on ttie Affymelrix/Eos-HuOl GeneChip 
array such that the ratio of 'average' ovarian cancer to 'average' normal adult ovaries was greater than or equal to 10. The 'average' ovarian cancer level was set to the 2nd 
highest aniongst various ovarian cancers. The "average" normal adult ovaries level was set to the arithmetic mean amongst various non-maSgnant ovaries. In order to remove 
gene-specilic background levels of non-spedf c hybridization, the.isth percentile value amongst the non-malignant tissues (see Table 7A) was subtracted fton both the 
numerator and the denominator before the ratio was evaluated. 

TABLE gA: 382 UP-REGULATED GENES. OVARIAN CANCER VERSUS NORMAL OVARV 
Pkey: Pifmekey 
Ex.Accn: Exemplar Accesskin 
UGID: UnlGenelD 
m: UniGenefitle 



ratk): raVo tumor vs. noi 


rmal tissues 






Pkey 




UGID 


Title 


ratio 


134454 


L33930 


Hs.173996 


C024 antigen (small cell lung carcinoma clust 


B6.2 


102927 


X12876 


Hs.65114 


keratin 18 


84.7 


116909 


AA436666 


Hs.59761 


ESTs 


72.3 


123169 


AA488892 


Hs.104472 


ESTs; Weakly simile to Gag-Pol polyptotein [ 


66.8 


115674 


AA406542 


Hs.71520 


ESTs 


65.4 


102193 


U20758 






63.1 


101839 


Mg3036 


Hs.692 


membrane component; chromosomal 4; surtoe ma 


56.8 


115221 


AA262942 






56.1 


108C59 


AA043944 




ESTs 


52.3 


121853 


AA425887 


Hs.g8502 




47.8 


133504 


W95070 


Hs.74316 


desmoplaldn (DPI; DPII) 


47.0 


103646 


Z14244 




cytochname c oxidase subunit Vllb 


46.5 


100147 


D13666 


Hs!l36348 


osteoblast specific factor 2 (fasoiclin l-lik 


45.5 


102979 


X17042 


Hs.1908 


proteoglycan 1; secrekiy granule 


44.6 


130967 


AA134138 


Hs.182579 


Homo sapiens teuclne aminopeplidase roRNA; com 


44.5 






Hs.82643 


protein tyrosine kinase 9 


40.4 


126960 


AA317900 


Hs.161756 


ESTs 


39.6 


103111 


X63187 


Hs.2719 


epidkjymis-speciiic; whey^ldic protein type 


39.1 


133829 


AA4S3783 


HS.765S0 


Homo sapiens mRNA; cDNA DKFZpS64B1264 (tram c 


39.0 


111223 


N68921 


Hs.34806 


ESTs; Weakly simll? to neogenin [Ksaplens] 


38.9 


102803 


U89916 


HS.2612S 




38.8 


104943 


AA065217 


Hs.169674 


ESTs 


38.7 


106606 


AA457718 


Hs.21103 


Homo sapiens mRNA; cDNA DKFZp564B076 (from cl 


38.4 


120666 


AA287347 


HS.23B205 


ESTs 


33.1 


102968 


X16396 


Hs.154672 


methylene letrahydrofblate dehydrogenase (NAD 


363 


104062 


AA393164 


Hs.97644 


mammagkibin 2 


36.0 


109166 


AA179845 


Hs.73625 


RAB6 interacting: kinesin-like (rabkinesin6) 


35,9 


101332 


L47276 




Homo sapiens (cell line HL-6) alpha topolsome 


35.0 


106167 


AA425906 


Hs.7956 


ESTs 


34.5 


101042 


J05428 


Hs.10319 


UDP glycosyltransferase 2 family; polypeptide 


34.3 


125852 


H09290 


Hs.76550 


Home sapiens mRNA; cDNA DKFZpS64B1264 (from c 


33.7 


101201 


L22624 


Hs.2256 




32.3 


126410 


R51912 


Hs.12409 


somatostatin 


32.1 


134326 


U16306 


Hs.81800 


chondroitin sulfate proteoglycan 2 (verslcan) 


32.0 


126739 


AA428557 


Hs.92137 




31.6 




L20826 


Hs.430 


plastin 1 (1 isoform] 


31.4 




R79392 


Hs,23643 


ESTs 


30.9 




M21005 


Hs.lOOOOO 


S100 calcium-binding protein A8 (calgranuin 


30.6 








HER3 receptor tyrosine kinase (c-erbB3; ERBB3 


30.1 


108860 


AA133334 


Hs!l29911 


ESTs 


29.8 


133859 


U86782 


Hs,178761 


26S proteasome-associated padi homokig 


29.2 


107295 


T34527 


Hs,80120 


UDP-N-acetyl-alpha-l>gdactDsanilne:polypeptid 


28.9 


106210 


AA423239 


Hs.10338 


ESTs 




134711 


X04011 


Hs.88974 


cytochrome b-24S; beta polypeptide (chronic g 


28.0 


125769 


AI382972 


Hs.82128 


5r4 oncofetal irophoblast glycoprotein 


27.5 


107222 


D51235 


Hs.8268g 


tumor rejection antigen (gp96) 1 


27.4 


102260 


U28385 


Hs.169657 


karyopherin alpha 2 (RAG cohort 1; importin a 


2&9 


134691 


M59979 


Hs.88474 


ptDstaglandin-endoperoxide synthase 1 (prasta 


26.8 


105588 


AA279215 


Hs.10867 


ESTs 


26.3 


130718 


N7019S 


Hs.18376 


ESTs 


26.3 




N67551 


Hs.12844 


EGF-lik&<kimain; multiple 6 


2S.6 


131965 


W90146 


Hs.35962 


ESTs 


25.6 


132903 


AA235404 


HS.598S 


Homo sapiens clone 25186 mRNA sequence 


25.6 


114359 


Z41589 


Hs.153483 


ESTs; Moderately similar to HI chtoride chann 


25.5 
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101185 


L19872 


Hs.170087 


aryl hydrocarbon receptor 


25.2 


128742 


D0C763 


Hs.251531 


proteasome (proscnne; macropan) subunit; alph 


25.1 


116724 


F13665 


HS.6S641 


ESTs 


24.9 


111929 


R40057 


Hs.112360 


prominin (mouse)-like 1 


24.9 


102915 


X07820 


Hs.2258 


matrix metalloproleinase 10 (sfromelysin 2) 


24.8 


131210 


AA430047 


Hs.24248 


ESTs 


24.7 


101714 






Human phosphatidylcholine 2-acylhydroiase (oP 


24.6 


100154 


D14657 


Hs.81892 


KIAA0101 gene product 


24.6 


134656 


X14787 


Hs.87409 


thrombospondin 1 


24.3 


100294 


D49396 


Hs.75454 


anUoxidanI protein 1 


23.9 


104080 


M402971 


Hs.57771 


lallikrein 11 


23.7 


107055 


M600310 


Hs.18720 


programmed cell death 8 (apoptosis-inducing f 


23.7 




AA411502 




ESTs; We^y simile to airway trypsin-lllte p 


23.7 


130350 


U02020 


Hs!239138 


pre-B-cell cotany-enhanciig feclor 


23.7 


105870 


AA399623 


Hs.23505 


ESTs 


23.6 


118528 


N67889 


Hs.49397 


ESTs 


23.4 


105309 




Hs.4104 


ESTs; Weakly similar to cDNA EST yk386g7.5 CO 


23.2 


109680 


F09255 


Hs,4993 


ESTs 


23.2 


131501 


AA121127 


Hs.181307 


H3 histone; family 3A 


23.2 


100824 


HG4058-HT4328 


Oncogene Aml1-Evi-1, Fusion Activated 


23.1 


111890 






ESTs 


23.0 


101543 






pentaxin-related gene; rapidly induced by IL- 


22.8 


102095 






sterol carrier protein 2 




11498B 






ESTs: Weakly similar to phosducin; retinal [H 


22.8 


120695 






ESTs 


22.8 


130941 


049394 


Hs.2142 




22.8 


106654 


AA460449 


Hs.3784 


ESTs; Highly similar la phosphoserine aminotr 


22.7 


109141 


M176428 


Hs.193380 


ESTs 


22.6 


102345 


U37283 


Hs.58882 


Mkirofbril-associaled glycopFateln-2 


22.6 


115652 


M405098 


Hs.38178 


ESTs 


22.4 


100103 


AF007875 


Hs.5085 




22J 


105463 


AA253370 


Hs.32646 


ESTs 


22.2 


132624 


AA164819 


Hs.53531 


ESTs 


22.2 


119743 


W70242 


Hs.53086 


ESTs 


22.0 


132528 


AA283006 


Hs.50758 


chromasome-asscdated polypeptide C 


22.0 


107174 


AA621714 


Hs.25338 


ESTs 


.21.8 


134495 


D63477 


Hs.84087 


K1AAC143 protein 


21.8 


131985 


AA434329 


Hs.36563 


ESTs 


21.5 


105832 


AA398346 


Hs.21898 


ESTs 


21.2 


126160 


N90960 


Hs.247277 


ESTs; Weakly similar to Iransfomrallon-relate 


21,2 


114846 




Hs.44343 


ESTs 


20.9 


109703 


F09684 


Hs.24792 


ESTs; Weakly similar to ORF YOR283w [S.cerevl 


20.9 


135154 


Ml 26433 


Hs.1 73242 


sorting nexin 4 


20.B 




M25753 


Hs.23950 


cyclitiBI 


20.7 


105616 


AA280670 


Hs.24968 




20.5 


131148 


C00038 


Hs.23579 


ESTs 


20.2 


129337 


R63542 




KIAA0990 protein 


20.2 


133640 


D83004 


HsJ6355 


ubiquitln-conjugating enzyme E2N (homologous 


20.1 


127479 


AA513722 


Hs.179729 


collagen; type X; alpha 1 (Schmid metaphyseal 


19.9 


133711 


J04130 




smal Inducible cytokine A4 (homologous to mo 


19.B 


131818 


Z39297 


HS.32B1 


neuronal pentraxln II 


19.7 


125303 


Z39B21 


Hs.107295 


ESTs 


19.6 


109112 


AA159379 


Hs.72865 


ESTs 


19.5 


105376 


AA236569 


HS.876B 


ESTs; Weakly similar to lilt ALU SUBFAMILY SQ 


19.2 


103605 


Z35402 


Hs.194657 


cadherin1:E-cadherln (epithelial) 


19.1 


100661 


HG2874-HT3018 


Rlbosomal Protein L39 tHcmolog 


19.1 


129571 


X51630 


Hs,1145 


Wilrre tumor 1 


19.0 


115239 


AA273650 


Hs.73291 


ESTs; Weakly similar to similar to the beta t 


18.9 


131662 


U90551 


Hs.28777 


H2A histone family; member L 


18.9 


131272 


AA423834 


Hs.139033 


paternally expressed gene 3 


13.9 


130343 


AA490252 


Hs.15485 


ESTs; Weakly similar lo APICAL-UKE PROTEIN I 


1S.8 


103246 


X75548 


Hs.28988 


glularedoxin (thioltransferase) 


13.7 


101809 


M86849 




Homo sapiens connexin 26 (GJB2) mRNA, complet 


•18.6 


105344 


AA235303 




ESTs 


1B.4 


135225 


AA45598B 


Hs!9667 


butyrobetalne (gamma]; 2-oxoglutaratedioxyge 


18.4 


115786 


H25836 


Hs.83429 


tumor necrosis facfcir (Dgand) superfemily; m 


18.3 


131510 




Hs.27842 


ESTs; Weakly similartoslmiarto l-acyl-jly 


18.2 


124059 


F13673 


Hs.99769 


ESTs 


18.0 


103352 


X89398 


Hs.78853 


uracitONAglycosylase 


17.9 


132742 


AA490862 


Hs.55901 


ESTs; Weakly simiarto C43H8.1 laelegans] 


17.9 


135242 




Hs.9700 


cyclin El 


17.9 


123494 


AA599786 


Hs.112110 


ESTs 


17.8 




T90621 


H5.1090S2 


chromosome 14 open reading frame 2 


17.7 


12B517 


AA280617 


Hs.100861 


ESTs; Weakly simlar to p60 katanin [Ksapien 


17.6 


130150 


Z39228 


Hs.151344 


UDP-Gal:betaGlcNAc beta 1;3^alactosyliransfe 




103448 


X99133 


Hs.204238 


lipocalin 2 (oncogene 24p3) 


17.5 


119708 


W57810 


HS.S7904 


mago-nashi (Drosophila) homolog; prolKeraUo 


17.5 


122946 


AA477445 


HS.10S341 


ESTs 


17.5 


125819 


AA044840 


Hs.251871 


CTP synthase 


17.5 


131689 


AA599653 


Hs.30696 


transcription factor-like S (basic helix4oop 


17.5 


115061 


AA253217 


Hs.41271 


ESTs 


17.3 


113702 


T97307 


Hs.161720 


ESTs; Moderately simiarto III! ALU SUBFAMIL 


17.3 


115291 


AA279943 


Hs.122579 


ESTs 


17.3 


1C2S67 


U59863 


Hs.146847 


.TRAF family member-assadated NFKB acHvator 


17.2 
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129229 AA211941 

129351 M167268 

110769 N22222 

113182 T55234 

115892 AA436946 

123114 AA486407 

123442 AA598803 

123339 AA604263 

123689 AA609666 

131941 D62657 

120649 AA2B7116 

102139 U15932 

115522 AA331393 

135243 AA215333 

131257 AA256042 

109508 AA233892 

132701 AA279359 

134449 L34155 

126180 R18070 

106124 AA423987 

116363 AA282071 

117688 N34895 

131245 AA620599 

101674 M61916 

126819 AA30SS36 

134039 878569 

130648 AAD75427 

102823 U90914 

128470 M447504 

115844 AA430124 

132543 AA417152 

130155 L33404 

101008 J04162 

120472 AA251875 

116844 H64938 

106753 AA476944 

114767 AA148885 

127370 AI024352 

101507 M27492 

102519 U52969 

102610 U65011 

111244 N69556 

120404 AA234921 

130455 X17059 

129519 AA298786 

106553 AA454967 

109502 AA233837 

115967 AA446887 

104636 AA004415 

134133 X93920 

134444 X04470 

132998 Y00062 

131997 D32399 

134056 R27353 

101249 L33881 

105298 AA233459 

107119 AA620307 

115839 AA429038 

122802 AA460530 

129896 AA043021 

130269 AA284694 

134374 D62533 

105370 AA443841 

130919 AA291710 

132923 U21858 

107968 AA034020 

125390 H95094 

107148 AA621131 

110788 N24730 

109481 AA233342 

105645 AA282147 

106030 AA4122S1 

132618 AA2S3330 
133230 S82240 
124803 R46480 

121381 AA405747 
105200 AA195399 
105627 AA281245 
114986 AA251010 
118036 N52844 
134672 N79749 
110915 N46252 
■ 117984 N51919 



polyadenylale binding prDte'n-inlerEKling pio 
Human ras Inliibltor mRIMA; 3: end 
yw34b06£l Morton Fetal Cochlea Homo sapiens 
Human DNA sequence liDm clone 30M3 on chromos 
ESTs 

ESTs; Moderately similar to KIAA0454 protein 



uhiqultin-specino prolease 1 

dual specificity pliosptialase 5 

putative G protein-coupled receptor 
ESTs 

ESTs; Wealdy similai to !!!! ALU SUBFAMLY SX 

BCL2-associated altianogene 2 

laminin; dplia 3 (nicein (150kD): kalinin (16 

ubiquinal-cytochrome c reductase; Keske Iron 

ESTs 

activator of S phase kinase 
ESTs 

DKFZF^4E1962 protein 



protein reguialor of cytokinesis 1 
kallikrein 7 (chymotryplic; stratum comeun) 
Fc fragment of IgG; low affinity Ilia; recept 
ESTs; Weakly similar to Gag-Pol polyprotein I 



MFIH-ampllfied sequences with leucine-rich tan 
KIAA1013 protein 

l^-acetyltransferase 1 (arylamine N-aostyttran 



ESTs 



secretory leukocyte protease Inhlblky (antl 
protein tyrosine phosphatase; receptor lype; 
Homo sapiens done 23714 mRNA sequence 
ESTs; Weakly similar to Pelle associated prot° 
protein kinase C; iota 
ESTs 



sprauty (DrosophOa) homobg 2 

collagen; type W; alpha 3 (Goodpasture antig 

TATA box binding protein (rBP)-associaled fac 



ESTs: Vfeakly simnar to WD40 protein Oao 1 [ 



ESTs; Weakly similar to WASP-famHy protein [ 



ESTs: Weakly similar k> cDNA EST EMBL:T00542 
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132550 AA029597 

124315 H94892 

102547 U57911 

125134 W19228 

111806 R33468 

106983 AA521195 

106498 AA452141 

110787 M24716 

122860 AA464414 

131535 AA504642 

116188 AA464728 

107243 D59489 

129300 C20976 

134487 FSB185 

102348 U37519 

131839 H80622 

119620 W47620 

120802 AA343533 

102250 U28014 

105539 AA258873 

114965 AA250737 

118001 N52151 

100448 D87469 

130920 D60975 

131075 Y0(I757 

105496 AA256323 

109235 AA193S92 

118215 N62195 

134388 M1S841 

106897 AA4ag790 

133050 S67325 

109683 F0930B 

121453 AA411745 

102876 X03663 

101804 M86699 



107080 AA609210 

110799 N26101 

112253 R61818 

116760 H11C64 

120314 AA194166 

123005 M479726 

132572 AA448297 

110561 H59617 

101923 S7S2S6 

134992 H0562S 

105516 AA257971 

105248 AA226968 

109130 AA172040 

115955 AA446121 

116135 AA460314 

116284 AA487252 

132384 AA479933 

134753 Y09216 

126136 W31479 

133928 N34096 

117396 N26330 

127CC7 AA299360 

130567 L07493 

135C73 AA452000 

11514C AA258030 

115636 AA347193 

13324C D31161 

107674 AA011027 

114149 Z38814 

132478 H20906 

104252 AF002246 

102436 U45499 

100116 D00654 

110970 N51374 

130417 U58522 

132906 AA142857 

107853 AA024427 

103467 Y00451 

100438 067448 

102654 U68494 

103172 X68742 

106856 AA486183 

108255 AA063157 



Hs,170195 tone morphogeneto protein 7 (osteogenic ptol 
1^.6906 v-ral simian leul(emia viral oncogene tiomolog 
1^.46638 ciiroiTiosonie 1 1 open reading frame 8 
Hs.100748 ESTs 
i4s.246S1 ESTs 

HsA 0887 similar to lysosome^oclated membrane glyco 
Hs.7171 ESTs 

Hs.12244 ESTs; Weakly simiiarto C44Bai [Celegans] 
Hs.1 12159 ESTs 

Hs,28436 ESTs; WeaMysWIartooodedforby Celega 
Hs.184598 ESTs 
Hs,34727 ESTs 

Hs.110165 ESTs; Highly simiiarto riljosomal prateinL26 
Hs.83954 Homo sapiens unl(nown mRNA 
Hs.87539 aldeliyde dehydrogenase 8 
KIAA0633 protein 
2'-5'oligoadenylate synthetase 3 
ESTs; Weal<ly similar to predicted using Genef 
caspase 4; apcptosis-relaled cysteine proteas 
ESTs 



Hs.33010 
Hs.56009 
Hs.128777 
Hs,74122 
Hs,25242 
Hs.72472 
Hs.47447 
Hs.57662 
Hs.75525 
HS.226S 
Hs.25264 
Hs.42300 
Hs.77910 
Hs.82575 
Hs.167496 
Hs.63788 
Hs.27607 
Hs.239681 
Hs,174142 
Hs.169840 
Hs.107968 
Hs.20814 
Hs.4029 
Hs.23044 
Hs.19221 
Hs.7e38 
Hs.1 04222 
Hs.156342 
Hs.221040 
Hs.105577 
Hs.237825 
Hs.5199 

Hs.92414 
Hs.21214 
Hs.22826 
Hs.20161 
Hs.44198 
Hs.94179 
Hs.237809 
Hs.46967 
H6.173135 
Hs.129051 
Hs.7766 



EGF-lil(e^iomain;muiaple2 



111 



□KFZP434N126 protein 
ESTs: Weakly simHa- to ill! ALU SUBFAMILY SQ 
3-liydroxy-3-metliylglularyl-Coenzynie A synltias 
small nudear ribonudeoprolein polypeptide B 
RAN binding protein 6 

propionyi Coenzyme A cartoxylase: beta polype 
ESTs 

ESTs; Weakly similar to KIAA0554 protein lH.s 
colony stimulating factor 1 receptor Ibrmerl 
TTK protein kinase 
ESTs 

ESTs; Highly similar to CGl-27 protein [H^ap 

glioma-amplilied sequence-41 

RAD51 (S. cerevlsiae) honiokjg (E cdi RecA ho 



signal recognition particle 72kD 

ESTs; Weakly similar to UBIQUITIN-CONJUGATING 

HNL=neutrophii lipooalin [human, ovarian cane 



ESTs: Weakly similar to hypothetical protein 
Human DNA sequence from ctone 167A19 on chpom 
dual-speciiiclty tyrosine-(Y)-phosphoiylatian 



EST1 1857 Uenis tumor I Heme sapiens cDNA 5' 
replicatton protein A3 (14kD) 
Homo sapiens mRNA; cDNA DKFZp586E1624 (from c 
ESTs; Weakly similar to supported by GENSCAN 



Hs.68613 

Hs.1 4732 

Hs.41143 

Hs.27196 

Hs.49500 

Hs.210853 

Hs.790 

Hs.7141 

Hs.77443 

Hs.96870 

Hs.155485 

Hs.234896 

Hs.59461 

Hs.78712 

Hs.91417 

85.24385 

Hs.116774 

Hs.15839 

Hs.172608 



Homo sapiens mRNA full length insert cDNA do 



lopoisomerase (DNA) II binding protein 
Human hbc647 mRNA sequence 
integrin: alpha 1 

ESTs: weakly similar to simiar to oxysler 
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124308 H93576 

129057 X62466 

128845 AA455658 

129025 AA420992 

107638 AA009528 

134480 AA024664 

115262 AA279112 

102580 ueceos 

1Q6614 AA458934 

107115 AA610108 

115764 AA421552 

121770 AA421714 

132191 AA449431 




Hs.162981 
Hs.179912 
Hs.27693 
HS.91C11 
Hs.11469 
Hs.158688 
Hs.250911 
Hs,77542 



ESTs; Hghly simJarto CGI-124 protein (Rsa 
anterior gradient 2 (Xenepus laevis) homolog 
KIAA0896 protein 
KIAA0741 gene product 



101973 S82597 

102669 U71207 

104147 AA451992 

105474 AA450212 

115881 AA435577 

129950 M31516 

132783 N74897 

133784 AA214305 Hs.76173 ESTs 

134248 AA292677 Hs.80624 ESTs 

105565 AA278302 Hs.18349 

127999 AA837495 Hs.69851 

108040 AA041551 



Hs.29279 
Hs.226799 
Hs,42484 
Hs.184942 
Hs,1369 



111345 N89820 

11SSB3 AA398913 

128965 T17440 

101396 M15796 

132154 U84573 

101275 L37935 



116984 AA447687 

105538 AA258e60 

102200 U21661 

116710 F10577 

119780 W72967 

112996 T23S39 

103029 X54489 

101256 L34600 

107032 AA599472 

126617 AI287461 

131475 Z39063 



102437 U46569 

104301 D45332 

127236 AI341818 

101465 M22612 

113806 W42957 

133536 Y00264 

109799 F10770 

113523 T90037 

116195 AA465148 

134542 X57025 

125298 Z39256 

119367 T78324 

134470 X54942 

134288 AA430008 

105127 AA158132 

110627 H70485 

115188 AA251819 

132632 N59764 

124049 F10523 

100079 AB00236S 

113987 W87494 

117280 N22107 



UDP-N-acetyt-alph3-[}^alactosamine:polypeped 
absent (Drosophila) lumolog 2 
. Highly simta to HSPC039 protdn IH.sa 
Homo sapiens mlW; cDNA □KF2p564Ca53 (from cl 
G proleln-coupled receptor 64 
decay accelerating factor for coinplemenl (COS 
DEAD/H (Asp-Glu-AlaVtep/Hls] box polypeptide 



Hs.48644 
Hs.8768 
HS.54SS8 
HS.14SS9 
Hs.45231 
Hs.107418 
Hs.78996 
Hs.41270 
Hs.3273 
Hs. 14846 
Hs.69499 
Hs.22929 
Hs.7859 

Hs!21766 
Hs.91109 
Hs.32597 
Hs.157205 
Hs.70312 



proWeraling cell nxlear antigen 



nelongaior 
ns mRNA; d 



cDNADKFZpS64[X16(fronicl 



fibrinogen; gamma polypeptide 



succinate-CoA llgase; GDP-fomilng; beta subun 



Human somatic cytochrome c (HCS) gene; comple 
aquapoilnS 

budding uninhibited by faenziindazoles 1 (yeas 



Hs.247309 

Hs.164950 

Hs.27263 

Hs.38516 

Hs.169248 

Hs.221986 

Hs.6783 

Hs.93658 

HS.24139S 

Hs.250617 

Hs.177486 

Hs.1 80378 



Hs.83758 CDC28 protein kinase 2 



Hs.88367 ESTs 



>e; polypeptide 2A(58kD) 
kiAA0367 protein 

ESTs; Moderately sMar lo COMPLEMENT C1Q SU 



TABLE 9B-. 

Pkey: Unique Eos probesettdentlfernunibe 
CAT number Gene cluster number 
Accession: Genbanit accession numbers 



BE623001 L0S096 AA383604 AW96641 6 1453295 AA46021 3 AWS71S19 AA6036SS 

165 
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AA806983 AA625873 W78031 BE206307 AA550803 AI743147 AI990075 AA948274 Ml 29533 AI635399 AA505313 AI624669 AW594319 
AI221834 AI337434 AA307706 BE550282 AI760467 AI630636 AI221521 AW674314 AW078889 AI933732 AI586969 AM 86928 AW074696 
AI127486 AL079644 A1910815 H17814 AA310903 AW137864 T19279 AA026582 AA306036 AW383390 AW383389 AW383422 AW3B3427 
AW383395 H09977 AA306247 AA352501 AW403639 F05421 AA224473 AA305321 H93904 AA089612AW391543 AW402915 AW173382 
AW402701 AW403113 R94438 N73126 H93455 AA09C928 AA095051 T29025 AW951071 L47277 L47276 AI375913 BE384166 W24652 
AA746288 AA5S8223BE090591 H93033 N57027 AA504348 AA327653 AW959913 N53767 AA843715 AI453437 AW263710AI076594 
AA583483 AW873194 AW575166 AI128799 AIB03319 AL042776 AW074313 A1887722 A10322B4 AA447521 AI123885 N29334 AI35491 1 
AW090687 AA236763 AA435635 AA236910 AA047124 AA236734 AW514610 H93467 AA962007 AI446783 AA127269 AI613495 
AI686720 AI587374 AA936731 AA702453 AI859757 AA216786 AI251B19 AI469227 AA806022 AI092324 N71868 AA968782 AA236919 
AA809460 AA227220 AA765284 All 92007 AA768810 AA805794 AA7292B0 AA80623e AW768817 N71879 AI050686 AA505822 
AA668974 AI68ai60 BE045915 AW466315 AA731314 AA649568 AA834316 AW591901 AW063876 AW294770 AI300266 A1336094 
AI560380 AA721755 H09978 D20305 D29155 AWe21790 BE150864 F01675 AI457474 AW466316 AA550969 AA630788 
A1393237 AI52131 7 AI761348 AF025841 D43968 AW994987 L34598 AF026841 D89789 D89788 D89790 AW998932 AI971742 AI310238 
X90976 AW13966a AW674280 AI365552 AA8774S2 AV657554 C75229 AA376077 AI79805S AW609213 W25536 H30149 BE075089 
BE075190 AW580858 H99598 AA425238AA133916 AW363478 BE158121 BE158127AW467960 BE158135 BE158125 BE158146 
N92860 AA847246 A1961688 AI361423 AA878154 AA043767 AIB63712 AIS69226 AW339007 AI371266 AI368901 AA046624 AA134739 
AW449154AA130232AI458720AA962511AI70C627 R70437AW00400eAA045229Al671572 H99S99AA043768AI685454Al871685 
N29937 X90977 AA524240 A11421 1 4 A1825750 A1567805 AI631365 A1347893 AA134740 F20669 AA046707 AW793216 AW963298 
AW9593B0AA36326SAI784Sg3AI2SB201 R694S1 AV6S7618 AI6gSS88 

M68874 AL022147 M7a93 AL049797 BE439441 T27650 A1766240 AW1 50345 AW778943 A1627464 BE439479 AA587049 A12779C0 
A1984983AI630935 

M86849 AA316280 NliL004004 AA316269 BE142653 AA461400 AW802042 BE162893 AW383165 AA490688 AW1 17930 AW384563 
AW384S44AW384S66AW378307AW378323AVV83g08SAA257102AW378317AVV276060AW27124SAW3782g8AVV3844g7AIS9B114 
AW264S44 AI018136 AW021810 AA961504 AWD86214 AW771489 AW192483 AI2g0266 AWig2488 AW384490 AW007451 AW890895 
AA554460 AA613715AW020066A1783695 AI58949BA1917637AW264471 AW384491 AI816732AW368530AW368521 AW368463 
AA461087 AI341438 AI970613 A1040737 AI4184aO AA947181 AAg62716 AI280S9S AW769275 AW023S91 AI160977 AA05S400 N71882 
. AA490466 AW243772 AW31 6636 AI076554 AWS1 1702 M6g323 H88gi2 AA2S701 7 AI952506 H88gi3 Aigi2481 AA60071 4 BE465701 
N641 49 C00523 N64240 AA677120 

X99133 XB3006 W38398 AA401137AA298242AA366738 AA30B125AW5B3781 AA298668AW845024 BE140204 AW845005 U47734 
AA837S7S NM_005S64 AA329732 AA421943 BE171S67 S7S2S6 AI7S0047 AI762213 AA10073S AW612g93 AI474120 AW062884 
AI940001 AVVD628S2AW062899 BE182639AW77887SAAS26093 AWS17424AI939989 AA076ie6 BE182636AA169S69 AA167439 
AI283967 AA167783 AA076140 AI74964g AA168792 AI708618 AA400973 AA514773 AA514789 AA1644S8 AA167440 AA074845 
AA421944 AA514874 AA079SS7 AA102361 AAS87027 AA642930 AA87B029 AA1644Sg AW176400 AW47S086 AA8S7S22 AA148193 
AA838234 AA593897 AI284506 AW193324 AA148194 AW583341 AI669077 AW264913 AA074902 AI680515 AA169874 AA16961 4 
AA079651 AWS91737 AWig0644AA076S6SAA662747AA07S8g6AA535642 N27757AI306666 AA074727 N79823AAS24360 AI826B00 
AA173827 BE140374 BE004062 AW26S060 BE184103 Aligg258 AA8578S3 AA299459 AA837890 AI626104 AA503624 BE183618 
BE1 83717 AA573267A1833071 AW270590 AAS06601 BE004fl10 AA837854 A16758g5 A18104gi A1184883 AW664712 AA076046 
AA51 5574 AW352267 AI7g7418 AA172395 AI749194 AI559933 AAS02697 AA321220 AI866124 AI695633 AA494293 AW085635 
AA1 65649 AA16S663 

AB037771 BE0O5O79 AA394189 AW969650 AA299360 AA39B081 W37627 AW750817 AW630138 AI522068 BE326323 AA374890 
AW418534 AW997510 AW995214 AW959649 AA504426 D79223 D79621 AI276062 A19731SS AA653470 AA337887 AI382521 AW0B4427 
D57078 W37628 AI610506 Z30230 AI567034 AA766091 H25097 H25078 AW9gi5(J7 AA319736 

BE000831 AA641787AW17303eAA327931 AW117510AW664665A1066624 A1478955A1863075AI073744AA490170 R46651 AI076653 



Table 10A lists about 733 genes up-regulated In ovarian cancer compared to normal adult I'ssues. These were selected from 59680 probesets on lie A%melrli(/Eos-H(i03 
GeneChIp aray such that the ratio of 'average' ovarian cancer to 'average' normal adult tissues was greater than or equal to 3.0. The 'average' ovarian cancer level was set to 
the about the 80th psicentile amongst various ovarian cancers. The 'average" normal adult tissue level was set to the 90th percentile value amongst various non-malgnant 
tissues, in onJer to remove gene-spedlic bacltground levels of nonspecific hybridizalon, the 15lh percentile value amongst the non-malignant tissues was subtracted fiom both 



TABLE 1 0A: ABOUT 733 UP-REGULATED GENES, OVARIAN CANCER VERSUS NORMAL ADULT TISSUES 
Pkey. Primekey 

UGID: UnlGenelD 
Title: UnlGene title 



EX.ACCI1 UGID Title 

T27013 Hs.3132 sleroidogenic acute regulatory protein 

X51630 Hs.1145 Wilmstumofi 

AA260737 Hs.72472 BMPR-lb; bonemc 



Hs.30743 Preferenllally expressed anigen In melancma 

415511 AI732617 Hs.182362 ESTs 

422966 BE646072 Hs.122679 ESTs 

410929 H47233 Hs.30643 ESTs 

400289 X07820 Hs.2258 MalrixMetalloprotelnase10(Stromolysln2) 

449034 AI624049 HS.277S23 gb:ts41a09.x1 NCI.CGAP.UtI Homo sapiens cDNA 

427S8S D311S2 Hs.179729 collagen; t/pe X; alpha 1 (SchmId metaphyseal 

428392 H10233 Hs.2265 secretory granule, neuroendocrine protein 1 

448243 AW369771 Hs.77496 ESTs 

430691 C14187 Hs.103538 ESTs 

444783 AK001468 Hs.62180 ESTs 

407638 AJ404672 Hs.288693 EST 

423739 AA3981SS Hs.9760O ESTs 

436982 AB018305 Hs.5378 spoi 

451110 AI955040 Hs.301584 ESTs 

426427 MB6699 Hs.169840 . TTK protein kinase 
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424717 
443646 
424345 



G protein-coupled receptor 64 
RU2S 

hypolhetioal protein FU22252 similar to SRV- 
guanylate cyclase 1, soluble, tieta 3 
MAD2 (mitotic arrest defioienl, yeast, tiomoto 
NMA (never in mitosis gene a)-related kin 
Homo sapiens mRNA for KiM1661 pn ' ' 



wingless-type MMTV inlegralion site famly, m 
ESTs 

Homo sapiens cDNA FU10518 fis, clone NT2RP20 
ras lioraolog gene family, member I 
solute carrier MIy 7, member 11 
cell division cyde 2, G1 to 8 and G2 to M 
ESTs.Mc ■ ■ ■' 



AA279490 Hs.86368 
AA634543 Hs.79440 
AF157326 H5.184786 



AK00201S 
BE379761 
AL157581 
AW027457 



Hs.163443 
HS.14365S 
Hs,5887 
Hs.14248 
Hs.30357 
Hs.30323 
Hs.226765 



RNA binding moH protein 7 

ESTs, Weakly similar to ALU8JHUMAN ALU SUBFAM 

Homo sapiens mRNA; cDNA DKFZp434E0626 (from c 



414972 BE263782 Hs.77695 KIAA0008 gene product 

435039 AW043921 Hs.1 30526 ESTs 

447033 AI357412 Hs.157601 EST-notinUnlGene 

433764 AW753676 Hs.39982 ESTs 

442611 BE077155 Hs.177537 ESTs 

408562 AI436323 

427344 NM.000869 
421478 
426635 
415989 
433159 



8E394412 Hs.61252 



418506 AA084248 



Hs.49136 
Hs,171077 
HS.2S04O 
HS.3C1430 
HS.26S398 
AB0Z7249 Hs,104741 
AI63S931 Hs.147613 
NM_004482 Hs.278611 
NM.005754 Hs,220689 



kinesln-rke protein 2 
ESTs 

G protein-coupled receptor 39 



447700 A1420183 

450480 X82125 

425176 AWD15644 

435496 AW840171 



ESTs, Weakly similar to KlAA0357lH.sapiens] 



445413 
425734 
421451 
410044 
427878 
408460 
422972 
443715 
440901 



Hs.213g8 
Hs.73625 
Hs.269347 
Hs.72402 
Hs.12677 



AA291377 Hs.50831 

BE566742 Hs.58169 

C05766 Hs.181022 

AA054725 Hs.285574 

N59319 Hs.145404 

AI583187 Hs.9700 

AA909358 Hs.1 28612 

AI263307 Hs.146228 

R64730.comp Hs.155986 

AW163518 Hs.1554e5 

AW162948 Hs.64542 

AI949952 Hs.49397 

AF063306 Hs,36708 

AI864131 Hs.71119 

AW243307 Hs.170187 



ESTs; Highly similar to SPERM SURFACE PROTEIN 



pieMiiRNA cleavage factor Im (68kD) 
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449433 
453878 
460606 
407001 
414316 
426492 
435181 



AW291168 

AJ245671 
AA576953 
AW235613 
AL137163 

AA383092 

AK001050 

AW206878 

AI672096 

AW964440 

NM_004672 

U12471 

Z24878 

AL021918 

AA669339 

AI6834a7 

AW149266 

AI470523 



409143 
407771 
419088 
431725 
431750 
436635 
441826 
417728 



433330 
439759 
427660 



450221 
439262 
435420 



Hs.41295 
Hs.117183 
Hs.12844 
Hs.22972 
HS.13302Q 
Hs.57649 
Hs.1608 



AL138272 

AI638323 

X65724 

AA514986 

AF220050 

AW503603 

AW1 38437 

AA862985 

NM_003478 

AA161520 

AF002246 

AW975942 

AA263143 

AW898292 

AK001581 

AV649989 

AA037145 

AW207084 

AL359055 

AI741320 

AI868872 

AA326108 

D16532 

AFIOOTBI 

AA328102 

AA832333 

AI928513 

AA988176 

AI301740 

AA1 22393 

BE313625 

M13609 

AL045633 

W67883 



Hs.299112 
Hs.26130 
Hs,1 82356 
Hs,50081 
Hs.301670 
Hs.1 54140 
HS.98C74 
HS.4976S 
Hs.102267 
Hs.27693 
(^.61635 
Hs.172608 
HS.1346S0 
Hs.138966 
Hs.62713 

Hs!2839 

Hs.283705 

Hs.181385 

Hs.129915 

Hs.24790 

Hs.89232 

Hs.279526 
Hs.210863 
Hs.48524 



Hs.1 72865 
Hs.132816 
Hs.67709 
Hs.1 14121 
Hs.288966 
HS.S3631 
Hs.73729 
Hs.1 94678 
Hs.24641 
Hs.1 24399 
Hs.59203 
Hs.121553 
Hs.1 73381 
Hs.70811 



gb:yg4a01.s1 Soae 
EGF-like.donnaln;multple6 
Homo saplons cDNA FU133S2 fis, done OVARC10 



ins cDNA FU13103 Rs, clone NT2RP30 



ESTs 



H3.247964 Human tlirombospondin-l gene, partial cds 

gb:HSB65D052 STRATAGENE Human skeletal musde 
Hs.168174 zinc finger protein 184(Kruppel-llke) 
Hs,28838 KIAA1571 protein 

icDNAFU11441 fis, clone HEMBA10 



ESTs 



Nome disease (pseudogfonia) 
ESTs 

phospholrieslerase related 
KIAA1573 protein 

cliromobox homdog 5 (Oosophila HP1 alpha) 



ESTs 

F!AD51-int9raoting protein 

Homo sapiens mRNA; cONA DKFZp564H1663 (fioni c 

polymerase (DNA directed), gamma 
Human til>o647 mRNA sequence 
cleavage stimulation factor, 3' pre-RNA, subu 
ESTs 



very low density lipoprotein receptor 
WNT1 inducible signaHng pathway protein 3 
cytoskeleton associated protein 2 



iiypothetical protein FU20641 
ditiydropyrimidinase-like 2 
hypotheUcal protein FUaOSIS 



AI733859 
AF212848 
AF026941 
AW207533 
AW131B88 
AI819063 
AI754593 



Hs.209122 ESTs 
ls.145968 ESTs 



ESTs, Weakly similar te hypothetcal protein 



Hs.111471 

AA300576 Hs.85769 

AW6638SB HS.S6120 

NIUL015368 Hs.30985 

Nl\/L003462 HS.3384S 

AW411194 Hs.120051 

AW819642 Hsi4135 

AW368397 Hs.150042 



acidic 82 kDa protein mRNA 



llglit intermediate polypept 
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transporter similar to yeasl MRS2 
KIAA0007 protein 

- lA; c(»IA DKFZp564B222 (fron 




CGl-94 protein 

fucose-l-phosphate guanylyltransferase 
nudearcap binding protein subunit 2, 20kD 
. Homo sapiens CDNAFU119BQ re, clone HEMI 
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442914 
417995 

418946 



447020 
458027 
425217 



431548 
412719 
411945 
424078 
433558 
434265 
453911 
415539 
442717 
432358 
409731 
419699 
420313 
422506 
425733 
434160 



436754 
437212 
447312 
409732 



424539 
413677 
406076 
420179 
450375 



425420 
428664 
419131 



417168 
429486 
442917 



AB001106 Hs.151413 

W07318 \\s2A0 

AI536065 Hs.122460 

AA101043 Hs.151254 

AW188551 Hs.99519 

AW974175 Hs.188751 

AI798841 Hs.132103 

AA743276 Hs.301052 

U79734 Hs.97206 

AA371612 Hs.115351 

AA310693 Hs.279512 

T27308 HS.169B6 

L49054 Hs.85195 

AU075596 Hs.155174 

NM 001809 Hs.1594 

AA1 55950 Hs.18645 

AA805166 Hs.165165 

NM 000401 H3.75334 

Y19"062 Hs.96870 

A1308862 Hs.167028 

AB037734 Hs.4993 

BE04432S Hs.227280 

NM_(W1851 HS.1S4890 

AA3822S7 Hs.106S3 

R13949 Hs.226440 

AA814309 Hs.123583 

AA948033 Hs.130853 

NM_006153 Hs.54589 

AA121514 Hs.70832 

AW582962 Hs.300961 

AI834273 Hs.9711 

AW016610 H3.129911 

AL033527 Hs.92ia7 

AB006625 Hs,139033 

AA833757 Hs.201769 

AA846811 Hs.130554 

AW503867 Hs.4007 

AI733e81 Hs.72472 

R88362 Hs.1 80591 

AI093491 Hs.72830 

AA125985 Hs,56145 

AA248998 Hs,31246 

AB023230 Hs,96427 

AL120362 Hs,124165 

F13237 Hs.169388 

BE551196 Hs.1 14275 

AI560129 Hs.277523 
AW298067 

T16971 Hs.289014 



AW131636 Hs.191260 

BE540255 Hs.6994 

AK001407 Hs.88663 

AI061288 Hs.1 33437 

AI765021 Hs.210775 

AI434345 Hs.36908 

NM_016122 Hs.56148 

AI867679 Hs.1 48410 

L02911 Hs!l50402 

S83308 Hs.87224 

AL390179 Hs.137011 

N74530 Hs.21168 

AA009647 Hs.8850 

S65791 Hs.89764 

BE139590 Hs.1 22406 

BE536911 Hs.234545 

AK001666 Hs.1 89095 

AA406293 Hs.301622 

AF072873 Hs.1 14218 

AF216751 Hs.26813 

AA305627 Hs.1 39336 

AW05B674 Hs.44787 

AL133117 Hs.81376 

AF155827 Hs.203963 

AA314907 Hs.85950 

AI800271 Hs.129445 

AW39255S Hs.18878 



hunlingtin interacting protein 1 
ESTs 

HSPC072 proleln 

tiypothelical protein FU11046 

ESTs, Highly similarto t(3;5)(q25.1;p34) fus 

CDC5 (cell division cycle 5, S. pombe, tiomolo 

centromere protein A (171(D) 

ESTs, Weakly similarto partial CDS [Celegan 

ESTs, Moderately similar to AUJB_HUMAN ALU SU 

exostoses (multiple) 2 

staulen (Drosoptiila, RNA-binding protein) lioin 

ESTs 



Homo sapiens clone 24881 mRNA sequence 



Homo sapiens cDNA: FLJ23089 fis, clone U<IG070 
Sarcolemmal-assodated protein 
BMPR-II); tnnemorphogenelic protein receptor 
ESTs, Wealdy similar to R06F6.5b [Celegans] 

ESTs 

tliymosin, beta, identified in neuroblastoma c 

ESTs; (HSA)PAP protein (programmed cell deal 
Homo sapiens clone 23578 mRNA sequence 



gb:UI-H-BW[)4[]p.g-09-(HJI.s1 NCLCGAP_Sub6 Ho 



hypothetical protein FU10298 

Homo sapiens mRNA; cDNA I3KFZp564F093 (from cl 

ESTs. Wealdy similar to BACR37P7.g [D.melanog 

ESTs 

Homo sapiens cDNA; FU22044 fis, clone HEP091 

hypothetical protein FU10545 

ESTs, Moderately similar to gonadotropin Indu 



aclivin A receptor, type I 

SRY (sex deiennining region Y)-box 5 

Homo sapiens mRNA; cDNA DKFZp547P134 (from cl 



similar to SAtll (sal (Drosophllaj-like 
ESTs 

CDA14 

ATP-binding cassette; sub-family C (CITR/MRP) 
Homo sapiens mRNA; cDNA DKFZp434O0227 (from c 
Homo sapiens mRNA; cDNA DKFZp586L1 121 (from c 
hypothetical protein FU10339 
ESTs 

hypothetical protein FU12496 
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421977 
429441 
449722 
431689 
425178 



400534 
417845 
423123 



AI472808 
W94197 



Homo sapiens cDNA: aJ21663 fis, clone COL088 
gb:ti70e07.x1 SoarBS_NSF_F8_9W_OT.PA.P_S1 Horn 
ribosomal protein L2S homolog 
Upophilin B (uteroglobin 1mi\f membei), pro 
cydinBI 

HS.267S95 UDP-GalrbetaGlcNAc beta 1 ,3^alactosyltransfe 



AA305688 

H16097 Hs.161027 ESTs 

NM_003816 Hs.2442 

AI364997 Hs.7572 



AL1 17461 
NM_012247 
AA625207 
AW605267 
AF1 09298 
AI817225 



411630 U42349 

432842 AW674093 

413472 BE242870 

414699 AI815523 

412733 AA984472 

419790 U79250 

433377 AI752713 

449535 W15267 



Hs .82719 
Hs .124027 
Hs.264915 
Hs.7627 
Hs.1 18258 
Hs.1 70337 
Hs.1 58205 



451813 
454193 
418478 



Hs.76930 
Hs.74554 
Hs.93201 
Hs.43845 
Hs.23672 
Hs,226414 
Hs.237146 
Hs.1 2244 
Hs.46638 
Hs.32683 
AA524190 Hs.1 20777 
AW2914B8 Hs.1 17305 
NM.016117 Hs.27182 
BE141183 

U33945 Hs.1 174 



R64512 
AA831267 



418413 R95735 



BE518060 
AVS49748 
AW021691 
AA436736 



Hs.125845 
Hs.12407 
ls.173912 



Hs.3804 
Hs.183171 
Hs.40735 
HS.10S039 
HS.12534S 
Hs.6111 
Hs.9663 



AI811807 
AW673847 
AA641376 



443271 
421437 
401644 
405095 
418417 
4208C7 
429529 

431130 
453403 
442768 
413430 
424081 



AA454190 H5.193811 
AI217477 Hs.194591 
NM_006103 Hs.2719 
BE466639 Hs.61779 
AL048534 Hs.48458 
R22479 HS.246S0 
Hs.139120 
Hs.155956 
Hs.39384 
Hs.44298 



NM_0fl6413 

D90041 

AI077715 



Homo sapiens mRNA; cDNA DKFZp586F1822 (from c 
SELENOPHOSPHATE SYNTHETASE ; Human selenlu 
Homo sapiens cDNA FU12908 (is, done Nr2RP20 

CGI-60 protein 

Prostate cancer assodaled protein 1 
ESTs 



basic leucine zipper nuclear factor 1 (JEM-I) 
Hs./l 1 1 9 Putative prostate cancer tumor suppressor 
Hs.279525 hypothetical protein PR02605 
Hs.75379 sciute carrier ^ily 1 (glial high af nl^ 

syrwleii\, alpha (non A4 component cf amyloid 
KIAA0080 protein 

glyce(Dl-3-phosphale dehydrogenase 2 (mitocho 



ESTs 

low densify lipoprotein receptor-related prot 
ESTs, Weakly similar to AUJ8LHUMAN Al^ SUBFAU 
Homo sapiens cDNA FLJ14234 (is. clone NT2RP40 
Homo sapiens cDNA: FU23S81 fis, clone LNG136 
■ 1 open reading frame 8; fetal br 



ESTs, Wealdy similar to ELU.HUfllAN RNA POLYME 



Homo sapiens cDNA: FU23068 fis, clone I>1G055 
ESTs, Wealdy similar to antigen of ttie monoci 
hypotlieticai protein FU 10997 



guanylate binding protein 1, Intetferon-induc 
gb:DKF2p434H0623_r1 434 (synonym: htesJ) Ho 
SRY (SOX determining region Y)-box 2, partial 



Hs.282882 ESTs 



eultatyotic translation initatlon fac 



Hs.295901 ESTs 



DKFZP564C1940 protein 
Homo sapiens cDNA: FU22002 fis, clone HEP066 
frizzled (IXosophtIa) homolog 3 
solute carrier My 34 (sodium phosphate), 
ESTs 

KIAA0307 gene product 

programmed cell death 6-interactlng protein 

gb:MR4-ST0062-031 199-018<I06 ST0062 Homo sapt 

Homo sapiens cDNA FLJ10570 fis, done Mr2RP20 

early growtli response 3 

cyclinAI 

multiple POZ domain protein 
F-box only protein 21 

solute cam'er family 5 (Inositol transporter 

Homo sapiens cI3NA FU12S34 Us, clone NT2RM40 



Hs.191840 ESTs 



ADP-ribosylalion fador domain pratein 1, S4I< 



gb:yi65e02.r1 Soares placenta Nb2HP Homo sapi 
ESTs 

ESTs, l^oderately similar to reduced expressio 
ESTs 

epidkiyrais-speoifiq whey-acidic protein type 
Homo sapiens cDNA FU13591 fis, done PLACE10 
ESTs, Weakly similar to ALU8.HUI1MN ALU SUBFAM 
Homo sapiens cDNA FLJ13047 fis, clone NT2RP30 
ribonuclease P (30kl)) 
NAT1; 



171 
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421175 
420324 
417531 
458924 
400195 
401480 
410360 



10 
15 



AI879099 Hs.102397 

AF163474 Hs.96744 

NM_003157 Hs.1087 

BE242158 Hs.24427 



Hs.10592 
Hs.99785 

AA436989 Hs.121017 
AI917494 Hs.131329 
NM_002214 I .184908 
NM 001173 Hs.267831 



437144 
452728 
430447 
440594 
408933 



AA210863 Hs.3532 



AA059013 
BE178110 
AF155666 



432824 AK001783 



451C73 
450377 
414343 
448807 
442821 
426300 
418068 



Hs.279012 
Hs.124577 
Hs.18685 



DKFZP56601646 protein 

0 

0 

gb:h]21g03j(1 NCLCGAP_U8HoniosapienscDNA 

ESTs 

ESTs 

H2A histone My; niember A 

ESTs 

integrin, beta 8 

Homo sapiens cDNA FU12952 fis, done NT2RP20 



Hs.239708 ESTs 

Hs.241451 SWI/SNF 

Hs.1 26036 ESTs 

Hs.22607 ESTs 

Hs.1 73374 ESTs 



411402 
450447 
414706 



Hs.25010 
Hs.76989 
Hs.283978 
Hs.148135 
Hs.21291 




ESTs, Weakly similar to Similar to Ena-VASP i 
ESTs 

neuronal pehtraxin 11 



ESTs; Weakly similar to Gag-Pol polyprotan [ 
tieparan sulfate (glucosamine) 3-0-sullbtransf 
tiyptoptian libli basic protein 
ESTs, Weakly similar to RG1 MOUSE FIG-1 PR( 



gb.-C(V2-HT0577-010500-165-g04 HT0577 Homo sapi 
ESTs 

transforming growth factor, beta 2 
Homo sapiens cDNA: FU22729 fis, clone HS1156 
P0P4 (processing of precursor , S. ceievisiae 



Hs.8752 

HS.16922B 

Hs.293902 



Homo sapiens cDNA: FLJ22648 fis, clone HSI073 
endometrial bleeding associated factor (lefl- 
KIAA0305 gene product 
hypothelcal protein FU10074 
sterol-C5-desaturase (fungal ERG3, della-5-de 
hypothetical protein FU10549 



eukaryolc translation Initiation tador 4E 



gb:EST187095 Colon carcinoma (HCQ ceil line 
Human clone A9A2BR1 1 (CAC)n;(GTG)n repeat-oon 
hypotlietical protein FU12085 
liypotlietical protein FU10921 
ESTs 

Homo sapiens mRNAfor KIAA1413 protein, parti 



ESTs 

Putative ^e II memlxane protein 

delta-like homolog (Drosophila) 

ESTs, Weakly similar to prolyl 4Jiydrai(ylase 

kinesin-like 6 (mitotc centromere-associated 

yes-associated protein 65 kDa 

Homo sapiens cDNA FU14269 fis, clone PLACE 

NRAS-related gene 

hypotlietical protein P1 5-2 

KIAA0097 gene product 

ESTs; KIAA0738 gene product 

ESTs 

mitogen-activated protein kinase Idnase kinas 



411560 AW851186 



. gb:IL3-Cn)220.150200-071.H05Cr0220Ho 
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415115 
453468 
441205 
452693 
417389 
448105 
451522 
440048 
419359 



413775 
417177 
427943 
439107 
447268 
412604 
427134 



Hs.28465 
Hs.127751 
HS32990 
Hs.176904 



Homo sapiens oDNA aJ14236 Ss. done NT2RP40 
Homosapiens cDNA: FU210()7fls, clone CAE038 
KIAA0155 gene product 

gb:RC4-HTO469-23030(H)1+«10 HT0469 Homo sapi 
Homo sapiens cDNA: FU213S9 lis, cione HEP024 



AA897461 
AL043202 
AL1 37578 



H3.158469 
Hs.90073 
Hs.27607 

Hs.151380 
Hs.40403 
Hs.47144 



AW409934 H3.75528 
NM_00445B Hs.81452 
AW969075 



AA398409 
AI311127 
AW137159 



AI19g738 
N87079 
AA347746 



Hs.47144 
Hs.173561 
Hs.125522 
Hs.146151 



zinc finger piQlein 228 



ESTs, Weakly similarto TMS2_HUMAN TRANSMEMBR 

hypcthetcal protein FU21657 

ESTs. Weakly similar to envelope protein [H^ 

chramcsome segregalion 1 (yeast tiomologHike 

Homo sapiens mRNA; cDNA DKFZp564N2464 (from c 

0 

ESTs 



Hs.27895 ESTs 



nucleolar GTPase 

faHy-acld^Caenzyme A llgase, long-chain 4 
gb:EST371 145 ll/IAGE resequences, MAGE Homo sapi 



Homo sapier^ cDNA: FU22418fis, clone HRC08S 



HSPC067 protein 

Homo sapiens HSPC316 mRNA partial cds 

hypothetical protein 

EST cluster (not in UniGene) 



9byq30fOS.r1 Soares fetal llverspleen INaS 
ESTs 

ESTs, Weakly similar to unnamed protein produ 
NADH dehydrogenase (ubiquinone) 1 betasubcor 
ESTs, Weakly similarto KIAA1015 protein |H.s 



50 

55 

60 
65 
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75 



423687 
428372 
439741 
441447 



417837 
450516 
407796 
419200 

445679 
435014 
446619 
439170 



AA934077 H5.126980 

R44433 Hs.106614 

AI744361 Hs.205591 

NM_000314 Hs.10712 

Y15221 H5.103982 



ESTs, Highly similarto Z117_HUMAN ZINC FINGE 

hypothellcal protein FU22104 

Homo sapiens mRNA full length insert cDNA do 



AI989517 
BE219771 
AA262105 
AA158177 



H5.23448 
HS.16S29 
Hs.181605 
Hs.237146 

Hs,4094 
Hs.1 18722 
Hs.42746 



AA195509 
AW966405 
AL049227 
A1343868 
BE56089B 
AU076643 
AA332365 
AI537278 
AW086180 



Hs.124776 
Hs.58800 
Hs.10026 



445817 NMJ03642 Hs.13340 



403532 
409517 
414304 



F12863 
AI969716 
AA4421S3 
AW074266 



ESTs, Weakly similar to zinc finger protein P 
phosphatase and tensin homokig (mutated In mu 
small inducible cytokine subfamily B (Cys-X-C 
Homo sapiens cDNA: FU22621 fis, cbne HSIOSS 



Homo sapiens cOiA FU 14234 lis, clone NT2RP40 

Homo sapiens cDNA FU 1 4208 fis, clone Mr2RP30 

fucosyltransferase 3 (alpha (1,6) fiicosyllran ■ 
ESTs 

transforming groulh factor, beta 1 
N1F3 (Nggi interacting factor 3, S.pcmbe honw 
lymphocyte activalion-assoo.ated protein 
prefoldin 5 

Homo sapiens mRNA; cDNA □KFZp564N1116 (from c 
Homo sapiens cDNA FU 1 2488 fis, clone NT2RM20 

rilKJSomal protein L17 Isolog 

secreted ptiosphoprotein 1 (osteoponlin, bone 

ESTs 

DKFZP434D193 protein 

ESTs, Weakly similar to K1AA1392 protein [H.s 

histone aoetyllransferase 1 



Hs.7678 

Hs.63176 

Hs.54668 

Hs.165998 

Hs.143399 

Hs.126695 

Hs.125395 

Hs.13034 
Hs.104744 
Hs.23071 



gb:HSC3FE081 ncnrslized in^it brain cDNA Hom 
ESTs 

ESTs. Weaklyslmilarto AF208855 1 BM-013[H. 



wo 02/102235 



PCT/US02/19297 



25 
30 
35 
40 
45 
50 
55 
60 




408418 

442025 

417006 
407881 
444755 



415947 
410519 
421987 
440046 



GalNAc alplia-2, 6-sialyltransferase I, long f 



Homo sapiens cDNA: aJ23442 lis, clone HSI009 

kallikrein 6 (neurosin, zyme) 

TONDU 

ESTs; Weakly similar to WASP-family protein [ 
paired box gene 8 

ESTs, Weakly similar to 712540 hypothelcsl p 

ESTs 



Hs.5199 
Hs.26145 
Hs,60758 
Hs.193914 
NM_016625 Hs.191381 



H88717 

Z99399 

T67012 

AA627098 

AA830B11 



AW673606 
AW072003 
AA431791 



441020 W79283 
422363 T55979 
413010 AA393273 



410486 
434540 
409178 



418735 
414245 
410909 
434926 

429017 
447072 
426514 
448133 
418792 
427528 
402077 
440671 
419390 



UC4045 

AW612264 

AI133161 

AW402306 

AL121278 

AW603985 

F13036 



Hs.26706 

Hs.93796 

Hs,28C858 

Hs.7838 

Hs.226440 

Hs.161712 

Hs.35962 

Hs, 11 5474 

Hs,75133 

Hs.27842 

Hs.193424 

Hs.5184 

Hs.50915 

Hs.125316 

HS.394B7 

Hs.149722 

Hs.42743 

Hs.78934 

Hs.131705 

Hs.286131 

Hs.6877 

Hs.179662 
Hs.27373 
Hs.44609 
Hs.75850 
Hs.53112 
Hs.50252 



AA723157 
AB037805 
AU077143 



XI 7360 
M31126 
AA306105 
AA449015 
A1807036 
BE149866 
BE298210 
NM_016010 Hs.118821 
AA256253 Hs.120817 
AL121031 



Hs.98133 ESTs 



ESTs, Highly simllarlo AF161349 1 HSPC086 [H 



ESTs, Weakly similar to 138428 T-ccraplex prol 

ESTs 

ESJs 

liypothetical protein FU10525 
B-fector, properdin 

replication factor C (activator 1) 4 (371(01 
KIAA1435 protein 

ESTs. Wealdy similar to C04.2 ICelegansl 

aspartyl-tRNAsyntlietase 

heparan sulfate (glucosamine) 3-0-stJHabansf 

ESTs 



ESTs, Highly similar to XPBJtUMAN DNA-REPAIR 

makorh, ring finger protein, 1 

Homo sapiens done 24881 mRNA sequence 

ESTs 

ESTs 

replication factor C (activator 1) 3 (38kD) 

transcription factor 6-llke 1 (mKocliandrlal 

hypothetical protein FU11210 

ESTs, Weakly slmOar to KIAA1C64 protein [H.S 

THIdrosophilahomolog 



Hs.238995 ESTs 



Hs.17 

Hs.130054 
Hs.278255 
Hs.272620 
Hs.50785 
Hs.286145 
Hs.101619 
Hs.14831 



WAS protein family, meml)er1 

ESTs. VfeaklysimilartaALU8_HUMAN AIOJ SUBFAM 

Homo sapiens HSPC283 mRNA. partial cds 

ESTs 



tyrosylprotein sulfotransferasc 1 

tione morphogenetic protein 7 (osteogenic prot 

folate receptor 1 (adult) 

K1AA1384 protein 

minlchromosome maintenance deficient (S. cere 
ESTs 

homeo box D4 

pregnancy specific bela-1-glycopfoteln 9 
SEC22, vesicle trafficking protein (S. cerevi 
SRB7 (suppressor of RNA polymerase B; yeasQ 



Hs.32556 . KIAA0379 protein 
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456844 


AI264155 


Hs.1S2g81 


CDP-diacylglycerol synthase (phosphatdate cy 


3.0 


414941 


CI 4865 


HS.1821S9 


ESTs 


ao 


407807 


AL03U27 


Hs.400g4 


Human DMA sequence from clone 167A19anchtom 


3.0 


414725 


AA769791 


H5.120355 


Homo sapiens cONA FU13148 fe, clone NT2RP30 


ao 


444420 


AI148157 


Hs.146766 


ESTs 


3.0 


431742 


NM_016652 


Hs.268281 


CGI-201 protein 


ao 


412519 


AA1 96241 


Hs.73980 


ttDponinTl.skeletaliSlow 


3.0 


418348 


AI537167 




Homo sapiens cONA: aJ23560 fis, clone LNG098 


ao 


444261 


AA298958 


Hs!l0724 


MDS023 protein 


ao 


457465 


AW301344 


Hs.195969 


ESTs 


ao 


443933 


AI091631 


Hs.1 35501 


Homo sapiens two pore potassium channel KT3.3 


ao 


442150 


AI368158 


Hs.1 28864 


ESTs 


ao 


414883 


AA926960 


Hs.77550 


CDC28 protein kinase 1 


ao 


442879 


AF032922 




syntaxin binding protein 3 


ao 


437949 


U78519 


Hs!41654 


ESTs 


ao 


403515 
403864 
407785 


AW207285 


Hs.98279 


0 
0 

ESTs 


ao 
ao 
ao 


426199 


AA371B65 




ESTs 


ao 


426324 


AW291787 


Hs'.200933 


ESTs 




427738 


NM_000318 


Hs.1 80612 


peroxisomal membrane protein 3 (35kD, Zellweg 


3.0 


427837 


U87309 


Hs.180941 


vacuolar protein sorting 41 (yeast tiomolog) 


3.0 


439430 


AF124250 


Hs.6554 


breast cancer anti-estrogen resistance 3 


3.0 


442039 


AW276240 


Hs.1 28362 


ESTs, Weakly similar to p80 [Flnoivegkus] 


ao 


446978 


NH.0019a8 


Hs.16697 


down-regulator of transcription 1, TBP-bindIn 


ao 


452431 


U88879 


Hs.29499 


toMke receptor 3 


ao 


452841 


T17431 


Hs.65412 


DEAD/H (Asp-gu-AlaVyspfttIs) box polypeptide 


ao 


432114 


AIJ036021 


HS.22SS97 


ESTs 


ao 


445640 


AW969626 


HSJ1704 


ESTs. Weakly simOar to KIAA0227 [H.saplens] 


ao 


442607 


AAS07576 


Hs.288361 


KIAA0741 gene product 


ao 


453920 


AI133148 


Hs.36602 


1 factor (complement) 


ao 


430000 


AW205931 


Hs.99598 


ESTs 


ao 


429164 


AI688663 


Hs.1 16586 


ESTs 


ao 


463331 


AI240665 


Hs.8896 


ESTs 


ao 


448663 


BE614599 


Hs.106823 


H.sapiens gene «ram PAC 42616, similar to syn 


ao 


425776 


U25128 


Hs.159499 


parathyrald hormone receptor 2 


ao 


401714 
400903 






0 
0 


ao 
ao 


428428 


AL037544 


Hs.1 84298 


cyclin<jependent kinase 7 (homobg of Xenopus 


ao 


443761 


AI525743 


Hs.1 60603 


ESTs 


ao 


451640 


AA19S601 


Hs.26771 


Human DNA sequence from clone 747H23 on chrom 


ao 


442580 


AI733682 


Hs.130239 


ESTs 


ao 




427943 284802_1 
431965 33959_2 
436812 427323_1 



1223878_1 
1245515_1 



R86913 R86901 H25352 R01370 H43764 AW044451 W21298 
AIJ043212 AAa77S75 AAa77655 R195Q2 BE5454S7 AI638421 R14093 
AW663690 

BE298210 AI672315 AW086489 BE298417AA455921 AA902537 BE327124R14963 AA085210 AW274273 AI333S84 AI369742 AI039658 
AI885095A147647QAI287650AI885299AI98S381AWS92624AW340136AI266556AA456390AI310815AA484951 
AWB51186AW996967BE143456 
Z2487B AA494098 F13654AA494040 AA143127 
R77182R77197R80484 



AI830417AA236612 

AA426562 AI880208 AA346646 N22655 AW81 1775 AW81 1786 
AW959075 W06838AA417B63 
BE175190 BE003348 

AW29B067 AA731645 AA8101 01 AW1 94180 AI690673 AW978773 

AI950087 N7020B R97040 N36309 AI3081 19 AW967677 N35320 AI251473 H59397 AW971573 R97278 W01059 AW967671 AA908598 
AA25187S AI820501 AI820532 W87891 T85904 U71456 T82391 BE328S71 T75102 R34725 AAS84922 BE328517 AI219788 AA884444 
N92578 F13493 AA927794 AI560251 AW874068 AL134043 AW235363 AA663345 AW008282 AA4BB964 AA283144 AI8g0387 AI950344 
AI741 346 AI689062 AA2B291 5 AW1 02898 AI872193 AI763273 AW173586 AW150329 A1653832 AI762688 AA98B777 AA488892 A1356394 
AW103B13AI539642AA6427B9AA856975AWS05512AI961530AW629970BE612881AW276997AW513601 AW512843AA044209 
AW85653BAA180009AA337499AW961101AA251669AA251874AI819225AW205862AI683338AI8S8509AW276905AI633006AA972584 
AA908741 AW072629 AW513986 AA293273 AA969759 1475628 N22388 HB4729 H60052 T92487 AI022058 AA780419 AAS51 005 W80701 
AW613456 A1373032 AI564269 F00531 HB3488 W37181 W78802 R66056 AI002839 R67840 AA300207 AW9S9581 T63226 F04005 
BE164500 AA832198 BE164502 
F12863AI377223 T75C9g 

BE141183AW1 78167 AW1781 62 AW1 78156 AW178172AW845893AW178159AW178222AW1 78213 AW1 73215 AW178090AW178C91 
AW178161 AW173207AW178210AW178214AW178212BE140918 BE140917AW178135AW178205 AW178209AW178223AW173220 
AW178206AW178203AW178165AW178168AW178160AW178136AW845878AW178131AW178138AW178105AW845894AW178129 
AW845810AW845828AW178216AW178112AW178211AW178224BE140915AW178221AW178130AW178134AW178096AW178108 
AW178133 AW17B164 AW178218 AW178171 AW178157 AW178153 AW17B103 BE141189 AW178170 AW845816 BE141586AW1 78156 
AW178104AW178163AW1 78093 AW1 78208 AW178137AW178140AW178219BE141S92AW845901BE141S80AW17815SBE141S98 
BE140957 

AW807073 AW807055 AW807067 AW807276 AW807030 AW807363 AW84S892 AW807091 AW807275 AW807284 AW8072B7 AW845891 

AW807195AW8Q7271 

BE141714 AW845993 AW845989 
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iie Genbank Identifier (Gl) numbers. 'Dunham I. et al.' refers to the publication entitled The [XIA sequence of 



401508 
401517 
401644 
401714 
402077 
402222 



Ref 

6981826 
8118496 
2911732 
9719705 
7321503 
7534110 
7677912 
8576138 
6715702 
8117414 



403381 
403515 
403729 
403864 
404108 
404232 
404552 
404567 



405095 
406069 
406117 



8918414 Plus 



7543752 
7709019 
8247074 
8218045 
7243881 
7249169 
8705107 
7341766 



17982-18115,20297-20466 
176341-176452 



110779-110983 

29278-29770 

82655-83959 

96484-96681 

65014-55195 

3261-3834,3939-4269 

110326-110491 

82274-32443 

101532-101852,102006-102263 

26009-25178 

173358-179553 



51753-51890,79290-79445 

63603-64942 

71800-71956 



101320-101501 
110443-110733 
100915-101087 

37999-38145,38652-38998,39727-39872,40557-40674,42361-42450 



Table 11A lists about 222 genes up-regulated in ovarian cancer compared to noomal adult tissues that are likely to encode extracellular or cell-surface proteins. These were 
selected as for Table 10\ except that the ratio was greater than or equal to 2.0, and the predicted protein conlained a structural domain that is Indicative of extracellular 
kjoalizatlon (e^. ig, fn3, egf, 7tm domains, signal sequences, transmembrane domains). Predicted protein domains are noted. 

TABLE 1 1 A: ABOUT 222 UP-REGUIATED GENES ENC0DII«3 EOTRACELUULARCEU. SU(«=ACE PROTEINS, OVARIAN CANCER VERSUS NORMAL ADULT TISSUES 

Pkey: Primekey 

Ex.Accn: Exemplar Accession 

UGID: UraGenelD 

Title: UnlGene title 

PFAM domains: predicted protein structural domains 
ratio: ratio tumor vs normal tissue 



427344 
432677 
404567 

409928 
407001 
453370 
400298 
431725 



UGID 

Hs.72472 

HS.22S6 

HS.179729 

HS.S378 



445142 
415138 
438167 
452097 
449048 
425371 
407945 



AW850417 
AA279490 
AF167326 
Nli^J00869 
NM_004482 
NM_015902 
AJ245671 
AL137163 



AA032279 

X65724 

AF002246 

D16532 

AF100781 

1*113509 

W67883 

AW131838 

A1754593 

C18356 



Title 



lUlalrixlUletaltoproteinase 10 (Strom 
oQllagen; t/pe X; alpha 1 (Schmid m 
spondin 1 , (f-spondin) extraceiluiar m 
G protein^upled receptor 64 



PFAM domains 



Hs.254020 ESTs,tUloderateiysim1lartaunnani 

Hs.86368 calmegin 

Hs.184786 TBP-interading protein 

l^s.2142 5-hydroxytryptamine (serotonin) ret 

Hs.278611 UDP-«-acelyt-alplia-&galactosami 

Hs.278428 progesfn Induced protein (P"" 



Hs.61635 

Hs!210863 
Hs.73729 
Hs.l 94678 
Hs.83169 
Hs.137476 
.172792 



hypothetical protein dJ473B4 
Human throrabospondin-1 gene, par 
ESTs, Moderately similar to Iransiat 
STEAP1 
Nomed 



Hs.145968 ESTs 
Hs.78045 
Hs.24286 
Hs.27916 



Hs.155981 
Hs.606 
Hs.151254 
Hs.97206 



very low density lipoprotein recepto 
Wfiri inducible signaling pathway 
Matrix metalloprotease 1 (interstitia 
KIAA1C51 protein 

ESTs, Weakly similar to hypothelica 



tor2TFP 



ATPase, Cu++ transporting, alpha p 



SS,calretlculin 
TM 

TM,neur_chan 

TM.Glycos_transf_ZRi 

TM.HECT,zf-UBR1 

SS.MAf«,EGF 

Tf«.MSP_domain 

TSPN,vwc,tspJ,EGF 

ABC_tran,ABC_membr 

TM 

SS,Cys_knot 
TM,rn3,:g 

TM,ldUecepLa,ldLrec 
SS,iGFBP,Cys_knot,tsp 



Pep_M12B_propep,Rep 
TM 

Cadherin_C_temi,cadhe 
l^nitz BPn,G-gamma 
7tm_1 

Pep_M12B_propep,Rep 



TM,E1-E2.ATPase,Hy 

SS,trypsin 

TM.ENTH,I.LWEQ 
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413384 


NML000401 




exostoses (muttipte) 2 




5.3 


425154 


NMUI01851 


HS.1S4850 


collagen, type IX, alpha 1 


Collage!i,TSPIvl 


5.2 


411946 


AL033527 


Hs.92137 


v-myc avian myeIo<ytonnatosis viral 


TGF-beta,TGFbj)ropep 


5.1 


415539 


Af733881 


Hs.72472 


BMPR-lb; bone morphogenelic pro 


pkinase,Activin_recp 


5.1 


438018 


AK001160 


Hs.5999 


liypothetical protein FU10298 


TM 


4.9 


424539 


L02911 


Hs. 150402 


aclivin A receptor, type! 


Ac6vin_recp,pkinase 


4.8 


450375 


AA009647 


Hs.8850 


a disintegrin and metalloprolelnase d 


disintegrin, Reprolysin,P 


47 


451684 




Hs.26813 


CDA14 




4.6 


400296 


AA306627 


Hs.139336 


ATP-binding cassette; sub-family C 


TM,ABC tran,ABC m 


46 


429597 


NM_003816 


Hs.2442 


a disintegrin and metalloproleinase d 


TM 


45 


400534 






predicted exons 


TM,KRAB.2f-C2H2 




425506 


NM 003666 


Hs.1 58205 


basic leucine zipper nuclear factor 1 


TM.Folate carrier 


46 


413472 


BE242870 


Hs.75379 


solute canier family 1 (glial high aff 


TM,SDF 




452028 


AK001859 


Hs.27595 


low density tipoprotetn receptor-rela 
hypothetical protein FLJ10997 


SS,ldl_recept_b,ldLrece 
Zn carbOpep(,Propep M 


4.4 
43 




BE391804 




thrombospondin 1 

guanylate binding protein 1, interfsr 


EGF,TSPN,tsp 1,tsp 3, 
TM,GBP 


43 
42 


407872 


AB039723 


Hs.40735 


lirizzled [Drosophila) homolog 3 


Fiizzled,Fz,71nu2 


42 


421502 


AF111855 


Hs.1 05039 


solute canier family 34 (sodium piio 


TM,Na Pi colrans 


42 


412494 


AL1 33900 


Hs.792 


ADP-ribosylation factor domain pro 
disintegrin protease 


ari,zf-B_bOK,zf-C3HC4 


40 


405095 


NM_014479 


Hs.145296 


Reprolysin,disinlegrin 


40 


431130 


NM 006103 


Hs.2719 


epldidymis-specific; whey-acidic pro 


SS,wap 




407792 


AI077716 


Hs.39384 


putative secreted ligand homologous 




40 


408829 


NM_006042 


Hs.48384 


heparan sulfate (glucosamine) 3-0-s 


TM 


as 


450581 


AF081513 


Hs.25195 


en(jometriaI.Jileeding associated faA 


SS,TGF-beta,TGFb_pra 


3.7 


432712 


ABC16247 


Hs.288031 


slerol-CS^esaturase (liingal ERG3, 


TM,SleroLdesat 


3.7 


450447 


AF212223 


Hs.25010 


hypothetical protein P15-2 


TM,ANFjeceptor,guan 


ae 


414706 


AW340125 


Hs.76989 


KIAA0097 gene product 


TM 


3.6 


417389 


BE260964 


Hs.82045 


Midklne (neurlte growth-promoting 


TM,PTN_MK 


3,6 


400666 


X07820 


Hs.2258 


Matrix Metalloproleinase 10 (Strom 


SS,hemopexin,Peptldas 


3.5 


406400 


AA343629 


Hs.104570 


kalliltrein 8 (neuropsinAivasinj 


SS,tiypsin 




407864 
452822 


AFC69291 
X85689 


Hs.288617 


chromosome 8 open reading frame 1 
Homo sapiens cDNA; FU22621 Us, 


TM,FHA,BRCT 
EGF,ln3,pklnase 


14 

3.4 


446913 


AA430650 


Hs.1 6529 


transmembrane 4 superfamily merab 


TM,lransmembrane4 


3.4 




AA158177 


Hs.1 18722 


fucosyllransferase 8 (alpha (1,6) fuc 


SS 


3.4 


423161 


AL04g227 


Hs.1 24776 


Homo sapiens mRNA; cDNA DKF2 


cadlierin,Cadherin_C_te 


3.3 




AW899053 


Hs.76917 


F-box only protein 8 


TM,Sec7 


3.3 


415530 


U62801 


Hs.79361 


kallikrein 6 (neurosin, zyroe) 


SS,TWiypsin 


3.3 


401197 








aif.Ets 


3.3 


436525 


AA721428 


Hs.26145 


Homo sapiens cDNAnj14127fis, 


TM 


3.2 


452943 


BE247449 


Hs.31082 


hypothetical protein FU10525 


TM 


3.2 


411393 


AW797437 


Hs.69771 


B-factor, properdin 


SS,sushi,tiypsln,vwa,fib 


3.2 


407881 


AW072003 


Hs.40968 


heparan sulfate (gtucosamine) 3-0-8 


SS 


3.2 


418336 


AI655499 


Hs.161712 


ESTs 


pkinase,Activin_recp 


3.2 


409178 


BE393948 


Hs.50915 


kallikrein 5 


SS.trj'psin 






AI133161 




CGI-101 protein 


TM 




447072 




Hs.1 7279 






31 


426514 


BE616633 


Hs.301122 


bone morphogenelic protein 7 (osteo 


SS,TGFb^peptlde,T 


3.1 


4481 33 


AA723157 


Hs.73769 


folate receptor 1 (adult) 


TM 


3.1 


406687 


M31126 


Hs.272620 


pregnancy speaTic beta-1-glycoprot 


SS,Peptldase_M10,hem 


3.1 


456844 


AI2641SS 


HS.1S2981 


CDP-diacylglycerol synthase (phosp 


TM,Cytldylyllrans ' 




414725 


AA769791 


HS.1203SS 


Homo sapiens cDNAFU13148fis, 


SPRY.71m_1 


3^0 


407735 


AW207285 


Hs.98279 


ESTs 


Sema,lg 


3.0 


427738 


NM 000318 


Hs.180612 


peroxisomal membrane protein 3 (35 


TM^C3HC4 


3.0 


452431 


U88879 


Hs.29499 


tol-like receptor 3 
(factor (complement) 


TM,TIR,LRRCT 


ao 


453920 


AI133148 


Hs.36602 


klljecepta,tiypsin,SRC 


ao 


453331 


AI240665 


Hs.8895 


ESTs 


disintegrin,Reprolysin,P 


ao 


425776 


U25128 


Hs.159499 


parathyroid hormone receptor 2 


TM.7lmJ 


ao 


428428 




Hs.184298 


cyctin-dependent kinase 7 (homolog 


TM,pkinase 


ao 


407910 
408380 


AA650274 
AF123050 


Hs.41296 
Hs.44532 


fibFonectin leucine rich transmembra 

diubiquitin 


TM,LRRCT,IJ^RNT,IJi 
TM,ubiquitin,7tm_3,AN 


2.9 


407783 


AW996872 


Hs.1 72028 


a disintegrin and metaltoproteinase d 


disinlegnn,Reprolysin 


Z9 








androgen receptor (dihydroteslostero 
wingless-t/pe MMTV integration sit 


TMAndrogen_recep,ho 
cadheiin,Cadhenn_CJe 


Z9 
2.9 




AA430064 


Hs!220929 


ESTs, Moderately similar to ARF-fa 


arf 


2.9 


419452 


U33635 




PTK7 protein tyrosine kinase 7 


TM,pkinase,ig 


2.9 






Hs.28792 




TGFb_propeptide,TGF- 




42C440 


NM 002407 


Hs.97644 


mammaglobin 2 


SS,Uleroglobin 


29 


418848 


AI820961 


Hs.1 93455 


ESTs 


pklnase,Aclivin_recp 


2.9 


421991 


NM 014913 


Hs.110488 


KIAA0990 protein 






433190 


M26901 


Hs.3210 




SS,asp 


2!9 




NM_005095 


Hs.150390 


zinc fnger protein 262 




2.8 


433002 


AF048730 


Hs.279906 


cydinTI 


SS 


2.8 


444342 


NM_014398 


Hs.10887 


similar to lysosome-asscciated mem 




ZS 


430598 


AK001764 


Hs.247112 


hypothet'cai protein FU10902 


tm' 


2.8 


428450 


NM_014791 


Hs.184339 


KIAA0175 gene product 


TM,pkinase,KA1 


2.8 


450171 


AL1 33661 


HS.245B3 


hypotheticai protein OKFZpmXa 


TM 


2.8 


423654 


Mg0516 


Hs.1674 


glutamina4iictose-6-phosphale Iran 


TM,GATase_2,SIS 


2.8 


430016 


NM_004736 


Hs.227655 


xenotropic and polytroplc retrovims 


TM 


2.8 


417866 
424894 


AW067903 
H83520 


Hs.82772 
Hs.153678 


collagen, type XI, alpha 1 
reproduction 8 


Coilagen,COLR.TSPI\l 
SS,UBX 


2.8 
28 


430651 


AA961694 


Hs.105187 


kinesin protein 9 gene 


SS 


2.7 


414853 


U31115 


Hs.77501 


saicoglycan, beta (43kD dystrophin- 


TM 


27 


448595 


AB014544 


Hs.21572 


. KIAA0644 gene product 


TM,LRRCr,U^R 
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10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 





AK001269 


Hs.30738 










AA834626 




RAD54(S.cerevisiae)-like 


SSAnSjMlilerat 






AI&23693 


Hs.191533 




Cation_ellIux 




434815 


AF155582 


Hs.241489 


Corel UDP-galactose:N-acelyIgalact 
TMPRSS3a mRNA for serine prolea 
hypothetical protein 


trefoil,liypsin 
^arf 




448402 


BE244226 


Hs.21094 


RAB1 8, memlKr RAS oncogene fam 




2]6 


421802 


BE261458 


Hs.108408 


CGI-78 protein 










Hs.29202 


G proteiiKoupled receptor 34 






417742 


R64719 




gl):EST22il11WATM1Homosapie 


anWeath.RHD,TIG 


26 




NM_006338 




glionia amplilied on chromosome 1 


TM,ig,LRf^LRRNT,LR 










platelet dsilved growth factor C 


TM,PDGF,CUB 




420079 


NM_014051 




PTD011 protein 








X80700 




prs-&cetl ieut<emia transcription fsc 


homeobox,igA:yltransf 




432350 


NMJ05865 


Hs!274407 


protease, serine, 16 (thymus) 




25 




Ml 29547 


Hs!mi?o 


met prolo^ncogene (hepatocyte gro 
interieuidn 1 receptor, type t 


pkinase,Sema,P1exinje 
SS,TtR,ig 




422530 


AW972300 




lx>ne marrow stromal cell antigen 2 


TM 


25 


433929 
443562 


AI375499 


Hs.27379 
Hs.9599 


ESTs 

solute carrier family 25, memlrer 13 


EGF,ldI_recepLa,tdl_re 
TM,mito_can- 


25 




X00442 






sushi,liypsin 




417576 


AA339449 


Hs!82286 


phosphoribosylglycmamide Ibnnyttr 


AIRS,fonnyl_transl,GA 


2.5 


449207 


AL044222 


Hs.23255 


nucleoporin 155tO 


TM 


25 


416107 


AA173846 
AK000768 


Hs.79015 


antigen identified by monoclonal ant 
hypotheticaljirotein FLJ20761 


TM,ig 
TM,PH 


24 
24 


414812 


X72755 


Hs.77367 


monoldne induced by gamma interfe 


SS,IL8 


24 




AW953381 


Hs. 18627 


mulS(E.coli)homolog6 

ESTs, Weakly similar to G01447 GP 


TM,MulS_C,MutS_N,P 




430291 


AV660345 


Hs.238126 


CGI-49 protein 








BE278288 
BE387790 


Hs .26369 


Lutheran blood group (Auberger b a 


ig 

TM,G-alpha 




412277 


BE277592 


Hs.73799 


ESTs 

guanine nucleotide binding protein ( 






BE439580 




small inducible cytokine subfamily A 






451806 


NH.003729 


Hs.27076 


RNA 3'-terminal phosphate cyclase 












redculocalbln 2, EF-hand calcium bi 


SS,elhand 






NM_003512 




H2A histone family, member L 


histone,Calc_CGRPJA4 




451668 


Z43948 


Hs.26789 


ASPIC (acidic secreted protein in ca 


SS,TM, 




400880 


M84349 


Hs.1 19663 


CD59 antigen 


SS,UPAR_LY6 


23 


421340 


F07783 


Hs.1369 


decay accelerating factor for comple 


SS,sushi 




443986 


AI381750 


Hs.283437 


HTGN29 protein 
syntaxin 7 








AW500305 




TM,Syntaxin 




440516 


S42303 




cadhefin2, type 1, N cadhetin (neur 


HNH,cadheiin,Cadherin 






AI394145 


Hs.1 8048 


melanoma anl'gen l*GE-10 






440704 


M69241 


Hs.162 


insulin-like growth factor binding pr 


SS,thyrogtobutin_1,IGF 










solute carrier family 11 (prolon coup 








AI734080 


Hs.104211 




Sema.ig 






AF051152 




toll-like receptor 2 


SS,TIR,LRRCT,LRR 






J06070 


Hs.151738 


matrix melalloproteinaso 9 (gelatina 


SS,fn2,hemopexin,Pepli 






NM_012211 


Hs .256297 


inlegrin, alpha 11 


TM,FG^GAP,vv»a 










procollagen-lysine, 2-oxoglutarate 5 


SS.LysyLhydro 




400898 


AF220030 


Hs.1 25300 


Homo sapiens tripartite motif protein 








AA242758 


Hs.79136 


Human breast cancer, estrogen regul 








AF245505 




Homo sapiens mRNA; cDNA DKFZ 


ig,LRRCT 




414809 


AI434599 


Hs.77356 


transferrin receptor {p90, CD71) 


TM,PA,RibosomaLS2 


2.2 




NM_001551 


Hs.298023 


Homo sapiens aquaporln 5 (AQPS), 






400277 


Y00281 


Hs.2230 


Human mRNA for ribophorin 1 


TM 




409317 


U20165 


Hs.53250 


bone morphogenetic protein recepto 


TM,pkinase 






AW103364 




H.saplens actwin beta-A subunH (ex 


TGF-teta,TGFbjropep 




451253 


H48299 


Hs^126 


claudin 10 


TM,PMP22.Claudin 




429638 


AI9166S2 


Hs.211577 


Kinedin1(kinesln receptor) 


TM 






NM_012453 


HS.S2515 


transdudn (beta>-iike 2 


TM,WD40 




418414 


J04977 


HS.849B1 


X-ray repair complementing defectiv 








AB037784 














HS.823B0 


menage a Irois 1 (CAK assembly fac 








NM.C02g50 




ribophorin 1 








NM_C12421 




rearranged L-myc fusion sequence 


TM,zf-C2H2 






AW450303 




bone morphogenetic protein recepto 


TM,AcUvin_recp,pkina 






HI 5622 




gb:ym26c07,r1 Scares infant brain 1 


RasGEF,PH,fibrinogen_ 




413063 


AL035737 


Hs.75184 


chitinase 3-like 1 (cartilage glycopro 


SS,ayco_hydro_16 






BE246444 


Hs.283685 


hypothetical protein FU20396 








AF019612 


Hs.297007 




TM,Peptklase_M50 








Hs.168670 


peroxisonial farnesylaled proteii 


E1-E2_ATPase,Cation_ 






NM_000916 




oxytocin receptor 








Al 690321 




ESTs, Weakly similar to TWIK-rela 






421685 


AF189723 


Hs'.105778 


calcium transport ATPase ATP2C1 


TM,E1-E2J\TPase,Hy 


2.1 


424099 


AF071202 


Hs.139335 


ATP-binding cassette; sub-family C 


TM,ABC_tran,ABC_m 


21 




AL035588 


Hs.153203 


MyoDfemily inhibitor 










Hs.57813 


zinc ribbon domain containing, 1 


TFtIS 


21 


436135 


D85390 


HS.5C57 


oart)oxypeptidase D 


SS,Zn_carbOpept 


21 


420633 


NM_014581 


Hs.99526 


odorant-binding protein 2B 


TfAtipocalin 


2.1 




BE378432 


Hs.95577 


cyclin<iependenl kinase 4 


pkinase,ankMBap,PH 


21 


426156 


BE244537 


Hs.ie7382 


natriuretic peptide receptor AJguany 


TM,ANFjeceplor,guan 


2.0 


442711 


AF151073 


Hs.8645 


hypothetical protein 




2.0 


411872 


AW3273S6 


Hs.90918 


. chromosome 11 open reading frame 


TM 

178 


20 



wo U2/102235 



PCT/US02/19297 



427801 
430258 
431133 
431846 
404210 
435640 
447906 
412666 
417181 



415664 
416391 



U02478 

AF220053 

AU)50062 

AL080116 

L10123 

AA410943 



hypothetical protein PRO1068 
hypothetical protein FU20730 
KDEL (Lys-Asp-Glu-Uu) endoplas 



Hs.100469 
Hs.54960 
Hs.19999 
Hs.74420 



MJ06799 Hs.72026 



L14561 
AI955244 
NM_004939 



S.78546 
S.121520 
S.78580 
s,202379 
3.79284 



D42063 Hs.1 79825 
AI907114 Hs.71465 
AI479813 Hs.278411 



origin recognition complex, subunit 
surfactant prolein A binding protein 
BMPR-lb; bone morphogenelc pro 
protease, serine, 21 (leslisin) 
Homo sapiens clone 2441 1 mRNA s 
HYPOTHETICAL 16.4 kDa PROTE 
DEAWH (Asp-Glu-Ala-AspMs) be 



mesodemi specific transcript (mouse 
ring finger protein 16 
cleavage and polyadenylation specif 
RAN binding protein 2-lil<e1 
squalene epoxidase 
NCK-associated protein 1 



TM 

TM,pkinaseAlMn_rec 

Tm',E1-E2 ATPase,Hy 
TM,LRRCf 

DEAD,helicase_C,SPRY 

DMA_repair,Glyco_tran 

TM.abliydrolase 

SPRY,zf-C3HC4,2f-B_ 

TM 



TABLE 11 

PKey: Unique Eos prolieset identifier number 
CAT number Q ■ ■ • 



Pl<ey CAT Number Accession 
417742 1696282.1 R64719Z44680 R12451 
425189 247825_1 H16622R17322 AA351959 

TABLE 11 C: 

Pl(ey; Unique number corresponding to an Eos probeset 

Ret: Sequence source. The 7 digit numbers in this column a 

human chromosome 22' Dunham, et al. (1999) Mllg 402:489495 
Strand: Indicates DIM strand lirom which exons were predicted 
Nt_posltian: Indicates nucleotide positions of predicted exons 



s. 'Dunhaml.etal.'re 



lo the publication entitled The DMA sequence of 



Pkey Per Strand 
400534 6981826 Minus 
401197 9719705 Plus 



Ntjiosition 



indicative of enzymatic function or of being modulated by small molecules (e.g., pkinase, peptidase, ison 

TABLE 1 2A: ABOUT 67 UP-REGULATED GENES ENCODING EXTRACELLULAR/CELL SURFACE PROTEINS, OVARIAN CANCER VERSUS NORMAL ADULT TISSUES 

Pkey: Primekey 

Ex.Accn; Exemplar Accession 

UGID: UnlGenelD 

Title: UnlGene title 

PFAM domains: predicted structural domains 
ratio: ratio tumor vs. normal 



400292 AA250737 

400289 X07820 

426427 M86699 

424905 NM 002497 

433159 AB035898 

453370 AI470523 

418007 M13509 

425465 L18964 

409506 NM_006153 



424539 L02911 



431699 NM_001173 



416530 U62801 

411393 AW797437 

444755 AA431791 

418836 AI665499 

409178 BE393948 

406687 M31126 

453920 AI133148 

404653 AA923729 

419452 U33635 

418848 AI820961 

428450 NM.014791 



UG)D 

Hs.72472 

Hs.2258 

Hs.169840 

Hs,163704 

Hs.150587 

Hs.182356 

Hs.83169 

Hs.1904 

Hs.54589 



BMPR-lb; bone morphogenetic pro 
Malrix Matalloprotcinase 10 (Strcm 

TTK protein kinase 
NIMA (never in mitosis gene a)-ie!a 
kinesin-like prolein 2 
ESTs, Moderately similar lo Iranslat 
Matrix metalloprotease 1 (interstilia 
protein kinase C; iota 
NCK adaptor protein 1 
BMPR-lb; bone morptiogenetic pro 
activin A receptor, type I 
ATP-binding cassette; sub-famly C 
Homo sapiens cDNA FU12952 fis, 
hypothetical protein FL 
hypottietical protein PI 



Hs.183001 ESTs 

Hs.161712 ESTs 

HS.S0915 kallikrein5 

Hs.272620 pregnancy specific beta-1-glycopiDt 

Hs.36602 I factor (conplemenf) 

Hs.26322 0 

Hs.90572 PTK7 protein tyrosine kinase 7 

Hs.193465 ESTs 

Hs.184339 K1AA0175 gene product 



PFAM domains 




pl<inase,ActivinjEcp 


30.0 


SS,,Peplidase_M10 


255 


pkinase 


18.7 


pkinase 


16.2 


11.6 


ABC_tran 


8.4 


SS, ,Peptidase_M10 


7,2 


Ski_Sno,pkinase_C 


6,1 
5,2 


pkinase,Activin_recp 


5.1 


Activin reopipkinase 


4,6 


TM,ABC_tran 




RnoGAP.FF.ras 


3^9 


C2,PI-PLC-Y,PI-PLC-X 


3,8 


ANF receptor , pkinase 


3,6 


SS, ,Peptk)ase_M10 


3,6 


EGF,fn3,pkinase 


3,4 


SS,TM,trypsin 




SS,sushi,lrypsin,vwa,fn3, 


3!2 




3.2 


pkinase,Activin_necp 


a2 


SS,liypsin 


ai 


SS,Peptldase_Mia/g 


3.1 


k)|jeoepUliypaii,SRCR 


3.0 


pkinase 


2.9 


TM,pkinase,ig 


2.9 


pkinaseA:liviiuecp 


Z9 


TM,pkinaseM1 


2.8 


179 





wo 02/102235 



PCT/US02/19297 



AL158037 




predicted exon 




2.7 


BE242144 


Hs.12013 


ATP-binding cassette, sub-MlyE 


SH3,pkinffie.ABC_tran 


2.7 


AI538613 


Hs.135657 


TMPRSSSattiRNAIbr serine protea 


trelbll,trypsin 


2.6 


BE244226 


Hs.21094 


RAB18, menter RAS oncogene fatn 




2.6 


AA129547 


Hs.285754 


met prolo-oncogene (liepatocyte gro 


pIdnase.Sema 


2.5 


ALC36710 


Hs.209527 


ESTs 


CNH,pldnase 


2.5 


X00442 


Hs.75990 


haptoglobin 


suslii,tiypsin 


2.5 


H56037 ■ 


Hs.108146 


ESTs 


RhoGAP 


2.4 




Hs.76913 


proteasome (prosome, macropain) su 


proteasome 


2.4 


AA307211 


Hs.251531 


proteasome (prosome, maoropain) su 


proteasome 


Z4 


NM_002914 




replication factor C (actiirator 1) 2 (4 


AAA.ViraUielicase1 


2.2 






matrix metalloproteinase 9 (gelatina 


SS.fh2,.Peptldase_M10 


11 


AA775987 


Hs.79357 


proteasome (prosome, maoropain) 26 




12 


NM_014735 


Hs.82292 


KIAA0215genepn3duGl 


PHD 


2.1 




Hs.155585 


receptor tyrosine l<inase-lil(e orptian 


pro.isomerase 


2.1 


AW450303 


Hs.2534 


Ixjne morphogenetio protein recepto 


Activin_recp,pkinase 




AA908197 


Hs.108850 


KIAA0936 protein 


TPR,pkinase 




AF053651 


Hs.31584 


metaxin 2 


pro isomerase 




AF071202 


Hs.139336 


ATP-binding cassette; sub-family C 


TM/BC.tran 




AA306342 




prateinl<inaseC-lil<e 




21 


AA295547 


Hs!62655 




p450 


2.1 




Hs.95577 


cyclin-dependent kinase 4 


pldnase.ank,AifGap ,ras 




H69894 


Hs.193041 


ESTs 


PI3Ka,PI3_PI4_l<inase 




BE244537 


Hs.167382 


natriuretic peptide receptor Afguany 


TM,ANF_reoeplor,pl(inase 


2.0 


AA410g43 


Hs.72472 


BMPR-lb; boof nnarpliogenetbpro 


TM,pl«nase/ellvliuecp 


ZO 


NM_006799 


HS.7202S 


protease, serine, 21 (tesUsin) 


SS,liypsin 


10 


BE617527 


Hs.180450 


ribosomal protein 824 


PI3Ka,PI4Jdnase 


2.0 


D42063 


Hs.179825 


RAN binding protein 2-lll(e1 


TPR,proJsomerase 


2.0 


AF042385 


Hs^l 


peptidylprolyl isomerase E (cyclopti 




.2.0 



Ref: Sequence source. Ttie 7 digit numbers In tliis column are Genbanl( Identifier (Gl) nui 

tiuman cfiromosome 22" Duntiam, et at. (1999) Nature 402:489495 
Strand; Indicates DNA strand from whicti exons were predicted 
Ntpositlon: Indicates nucleotide positions of predicted exons 



s, 'Dunham I. et al.' refers to the publication entitled The DNA sequence of 



9212516 .Plus 



5. These were selected as for Table 10A, ex 



TABLE 13A; Abo 
Pkey; Pritnekey 
Ex. Accn: Exemplay Accession 
UG ID; UniGene ID 
Title; UniGene title 



-REGULATED GENES, OVARIAN CANCER VERSUS NORMAL OVARY 



446441 
428758 
441406 
441359 
448406 

418068 



41730B 
436876 
439180 
428289 
405484 



Ex. Accn 
AW872527 
AU076643 



AW992356 
BE397753 
AF017307 



R34657 

AW296454 

AK001782 



UGID 

Hs.59761 

Hs.313 

Hs.288966 

Hs.17409 



AW971155 



Hs.76560 
Hs.293902 
L22524 HS.22S6 
NkL004415 Hs.74316 
AI860651 Hs.26685 
AW837046 Hs.6527 
AW90383B Hs.B1800 
AA687538 HS.3B972 
N41720 Hs.172684 
H60720 Hs.81892 
AI124756 Hs.5337 
AI393742 Hs.199067 



Title 
ESTs 

secreted phosphoprotein 1 (osleopontin, bone 
ceruloplasmin (fen'oxidase) 
cysteine-nch protein 1 (intestinal) 
epididymis-spedf c; wtiey-acidic protein ^pe 
secretory leukocyte protease inhibitor (antil 
ESTs 

interferon, gamma-inducible protein 30 
E74-like factor 3 (ets domain transcription f 
cysteine-ilch protein 2 
lipocalin 2 (oncogene 24p3) 
uncoupling protein 2 (mitoctiondria!, proton c 
hypothetical protein FU20171 
hypothetical protein 

Homo sapiens cDNA FU14303 fis, clone PIACE20 
Homo sapiens cDNA FU10457 fis, clone NT2RP10 
ESTs, Weakly similar to FIG1 MOUSE FIG-1 PROT 
Homo sapiens mRNA; cDNA □KFZp564B076 
Homo sapiens mRNA; cDNA DKFZp564B1264 
ESTs, Weakly similar to prolyl 4-hydroxylase 
7(matrilysin,uteri 



destiioplakin(DPI, DPIl) 



180 



wo 02/102235 



PCT/US02/19297 



400289 
451035 

44Q84B 
10 436278 



428411 
422166 
412477 
417130 
424673 



443162 
413719 
424687 



418526 
415511 
409453 
445637 
442432 
408243 
419092 
444172 
412115 
420440 
414386 



413278 
418506 
445919 
416854 
414186 
434371 
421937 
449722 



D49441 Hs.155981 
AA368546 Hs.8904 



X07820 Hs.2258 

AU076785 Hs.430 

BE314650 Hs.7476 

BE396290 Hs.5097 

AF1 13676 HS.75B21 

AW468397 Hs.100000 

AW291464 Hs.lOSSS 

W72424 Hs.1 12405 

AA150864 Hs.790 

AW276868 Hs.81256 

AA345051 Hs.294092 

U62801 Hs.79361 

T49951 Hs.g029 

BE439580 Hs.75498 

J05070 Hs.151738 

AL035737 Hs.75184 

AJ224172 Hs.204096 



60 



65 



AI732617 
AI885516 
AJ245671 

BE093589 Hs.38178 

Y00787 Hs.624 

J0SS81 Hs.89603 

BE147740 HS.104S58 

AK001763 Hs.73239 

NM_0O24O7 Hs.97644 

X00442 Hs.75990 

AA852604 Hs.125359 

H13032 Hs.103378 

BE563085 Hs.833 

AA084248 Hs,86339 

T53519 Hs.290357 

H40164 Hs.80296 

U33446 Hs.76799 
AA631352 

AI878857 Hs.109705 



424126 

408380 
419329 
409231 

70 413640 
440943 

410132 

75 412719 



X57522 Hs.168164 

AL033527 Hs.92137 

AL039104 Hs.159557 

AK001330 Hs.48855 

AW075486 Hs.286049 

M79141 Hs.13234 

NM_001982 Hs.199067 

AF178532 Hs.271411 

AF1118S6 Hs.10503g 

M8684g Hs.5566 

NM.014038 HS.S216 

AA161224 Hs.8372 

AA343629 Hs.104570 

At244661 Hs.103296 

L10343 Hs.1 12341 

AV663729 Hs.8185 

W79283 Hs.35962 

AI538613 Hs.135657 

M31669 Hs.1736 

A1791809 Hs.32949 



D13666 Hs.136348 
AI301558 Hs.290801 
AW082298 Hs.146161 



Igsuperfamfly protein 
BCL2-liks1 

IMTERFERON-GAMMA INDUCED PROTEIN 



H3.182362 ESTs 
HS.9S612 ESTs 



ATPase, H-i- transporting, lysosomal (vacuolar 
synaplogyrin 2 

serine (or cysteine) proteinase inhibitor, el 
SI 00 calcium-binding protein A8 (calgranul'm 



S100 calcium-binding protein A9 (calgranulin 
microsomal glutathone S-lransferase 1 
S100 oafcium-binding protein A4 (calcium prot 
ESTs 

kallikrein 6 (neurosin, zyme) 
ESTs; Highly similar to KERATIN; TYPE t CYTO 
small inducible cytokine subfamily A (Cys-Cys 
matrix metalloprotelnase 9 (gelatinase B, 92k 
chilinase 3-like 1 (cartilage glycoprotein-39 
lipophilin B (uteroglobin family member), pro 
! carrier family 16 (monocartjoxylic acid 



EGF-like-domain; multiple 6 

Homo sapiens cDNA: FLJ23468 (is, clone HSI11S 

inlerieukin B 



hypothetk:aI protein FU10901 
mammagkibin 2 

haptoglobin 

Thy-1 cell surface antigen 

ESTs, Weakly similar to DRR1 [Rsapiens] 

Interferon-stimulated protein, 15 kOa 

G protein-coupled receptor 39 

ESTs 

Purkinje cell protein 4 



ATP-binding cassette, sub-family B (MDFVTAP), 
v-myc avian myelooylomatosis viral oncogene h 
kaiyopherin alpha 2 (RAG cohort 1, importin a 
hypothetical protein FU 10468 
phosphoserine aminotransferase 
ESTs 

v-€rb-b2 avian etylhroblaslic leukemia viral 
beta-site APP-cleaidng enzyme 2 
solute carrier family 34 (sodium phosphate), 
Homo sapiens connexin 26 (GJB2) mF^A, complet 
HSPC028 protein 

ubquinol-cytochrome c reductase (6.4kD) subu 



X54942 Hs. 

AW016610 Hs.129911 

BE548267 Hs.50724 

A1027543 Hs.120912 

AW972642 Hs293055 

AI049783 Hs.241284 

AW247252 Hs.75514 

AI333660 Hs.17558 

AA173992 Hs.7956 

U65011 Hs.30743 

Y00272 HS.1B4572 



TMPRSS3amRNA for serine protease (ECH0S1) (T 
inhibin, beta B (acSvin AB beta polypeptide) 
defensin.betal 
diubiquitin 

SlOO-type calcium binding protein A14 
GA733-2; epithelial glycoprotein (EGP) (KSA) 
Homo sapiens mRIMA for osteoblast specific fac 
ESTs 

ESTs, Weakly similar to KIAA0a59 protein [Ks 
Homo sapiens cDNA- FU23241 fis, clone COL013 
Hs.58882 Microfibril-associalBdglycopfo»ein-2 
CDC28 protein kinase 2 



Homo sapiens cDNA FU10934 fis. clone OVARC10 



nucleoside phosphoiyiase 



wo U2/102235 



PCT/US02/19297 



BE440142 Hs.2943 
AI791905 Hs.96649 
AV652402 Hs.155145 
U71207 Hs.29279 
NM_013230 Hs.286124 
AF027208 Hs.297332 
NM.004753 H5.17144 
Hs.10029 
Hs.122579 
AB033a91 Hs.24g36 



443958 BE24ie80 



AW952022 Hs.234i74 

NM.016929 Hs.283021 

AU076628 Hs,79187 

L20688 Hs,83656 

W25005 Hs.24395 

AW067903 Hs.82772 

NM_000869 Hs,2142 

BE018682 Hs.44343 

T97307 Hs.199067 

AI656166 Hs,7331 

W07162 Hs.160826 

BE54B555 Hs.118554 

W47225 Hs.126256 



AA323362 Hs.9667 



444725 
430250 

418174 



433662 
422576 
423271 
443716 



443672 
416889 
408474 
411825 
400881 
440694 
414586 
411925 
417869 
433447 
450858 
410619 
434094 
421924 
446859 
421451 
433929 
438930 



417715 
409361 
416984 
430125 
434078 



439413 AI598252 

■449034 AI624049 

420344 BE4e3721 

431243 U46455 

417S1S L24203 

4S1267 AI033894 

450101 AV649989 

419693 AA133749 

431103 M57399 
451110 
425295 



AW445167 
AA306160 
AW014588 



AA305599 

BE514514 

AI494299 

AA291377 

AI375499 

AW843633 

AW503976 

AW958544 

W29092 

AW969587 

NM_006982 

H38765 

U46418 

AW880709 



Hs.76606 
Hs.72925 
Hs.827g3 
Hs.3261 



424570 
417847 
449027 
424959 
433159 
411393 
434815 
427585 
445721 



AA082750 
W51215 
AI521558 
AJ271216 



Hs.81256 

Hs.10649 

Hs.1 12242 

Hs.7678 

Hs.86366 

Hs.54416 

Hs.80706 

Hs.233950 

Hs.283683 ' 

Hs.78146 

Hs.37810 

Hs.277523 

Hs.97101 

Hs.252189 

Hs.82237 

Hs.117865 

HS.2438S 

Hs.92323 

Hs.44 

Hs.301584 

Hs.75839 

Hs.42194 



Hs.150587 
Hs.59771 
Hs.45744 
Hs.179729 



sprauty(Di 
Homo sap) 



coxsackie virus and adenovirus receptor 
Rho GDP dissociation Inliibitar(GDi) beta 
yB(Cys-X-C 



V'erb-b2 avian erythroblastic ieukeiria viral 



oyclinEI 

liver-specific'bHLH-Zip trar.scrtption factor 
atitenor gradient 2 (Xenepus laevis) tiofliolog 
butyrobetaicc (gamma), 2-oxogl'Jtarale dioxy 
mal, T-cell differentiation protein 
Homo sapiens cDNA; FU23597 lis, clone LN 
iiypoltietioal protein FLJ20327 



lymptiocyte cylosolic protein 1 (L-plastin) 
ctiromosome 11 open reading frame 13 
proteasome (prosome, macropain) subunit, beta 
neuronal pentraxin II 

elong ation of very loi^ chain fatty acids (FE 
l<eratin 18 

iiypolhetical protein PRO2013 

coronln. actlrWiinding protein, 1A 

C0X1 7 (yeast) homolog, cytooiirome c oxidase a 



cellular retlnoic acid-binding protein 1 



platelet/endotlielial cell adiiesion molecule ( 
ESTs 

gb;ts41a09.x1 NCLCGAP.UtI Homo sapiens oDNA 
Putative G protein-coupled receptor GPCi^SO 
syndecan 4 (amphiglycan. ryudocan) 
ataxia-telangiectasia group [^associated prot 
solute carrier fannily 17 (anionfeugar transfx) 
Human hbc647 mRNA sequence 
FXYD domain-containing ion transpc 
pleiolraphin (heparin binding growth factor 8 
ESTs 

zinc finger protein 6 (CWPX1) 
liypotlietical protein FU22649 similar to sign 
epithelial V-like antigen 1 

Homo sapiens cDNA: FIJ22316 Us, clone HRC0S2 

dipeptidylpeptidase III 



Idnesin-liite protein 2 



Corel UDP-ga!aclose:N-ace^/lgaIa 
collagen; type X; alptia 1 (Schmid r 
HSPCIBOprc ' 

ESTs, Weakly similar to A4P_HUMAN limSTIi^L 
CDP-diaoylglycerol synthase (pi 




DKFZPS86D0823 protein, Prostate androgen-regu 
. 0 

182 
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434042 
419080 
406545 
447362 
429547 



418917 
435777 
431049 
426427 



Al 589941 
AW150835 
AB018249 
AW176120 
AW009166 



Hs.8254 

Hs.18878 

Hs.10458 

Hs.9061 

Hs.99376 

Hs.247551 

Hs.124776 

Hs.2265 



hypothetical protein PRO0899 
' Heal protein aJ21620 
Juclble cytokine sut)fatnll) 



Homo sapiens niRNA; cDNA □KFZpS64N1116 (ftom c 



421340 
413966 
448243 



AW967069 
L43821 
AA284679 
AA852773 



Hs,24305 ESTs 
Hs.211556 
Hs.80261 
Hs.25640 
Hs.297939 



ESTs, Wealdy similar to LAK-4p [Ksaplens] 



AF013758 
403399 

435793 AB037734 
432629 AW360548 



polyadenylate binding protein-interacSng pK 



437575 
401131 
407207 
444783 
426230 



421305 
411704 
417018 
432827 
410174 
425184 
452322 
447526 

424867 
410275 
429083 
410173 



AK001468 
AA367019 
AA256641 



AL048753 

BE617695 

AI024860 

U85668 

Y09397 

AA706017 

M36135 



S.280658 ESTs 
Hs.22941 ESTs 
Hs.36529 ESTs 
0 

Hs.1 79661 tubulin, beta'polypepfide 



Hs.236894 

HS.S0081 

HS.2B9721 

Hs.71573 

Hs.80887 

Hs.3109 

Hs.59461 

Hs.155048 

Hs.28988 

Hs.340 

Hs.286192 

Hs.153591 

Hs.61796 

Hs.227817 



protease, serine, Kliypsini) 

ESTs; Highly similar to LOW-DENSIIY LIPOPROTE 

KIAA1199 protein 

diptheria toxin resistance protein required f 
hypothetical protein FUia074 
v-yes-l Yamaguchi sarcoma viral related oncog 
Rho GTPase activaling protein 4 
DKFZP434C245 protein 
Lutheran blood group (Auberger b antigen IncI 
glutaredoxin (Ihloltransterase) 



protein phosphatase 1, 
Not56(D. ■ ^ ■ . 
transcription factor AP-2 1 
BCL2-related protein Al 



419038 AI533323 
403331 

409162 H25S30 

426150 NM.003658 

449292 AI990292 



AA379597 Hs.5199 



446608 N75217 

425222 M85430 

428309 M9781S 

420005 AW271106 

4369B2 AB018305 

407142 AM12S3S 



421242 
457715 
451668 
437142 
418588 
433068 
419BS4 
444726 
423011 
451428 



AA084460 

X02994 Hs.1217 

AW419202 Hs.286192 

AA846576 Hs.103267 

M86599 Hs.169840 

AW411194 Hs.120051 

AA355752 Hs.155965 

AW161386 Hs.13661 

AA642402 Hs.59142 

Z43948 Hs.26789 

A1791617 Hs.146068 

BE387040 Hs.182476 

Nm.006456 Hs.288215 

AW564873 Hs.87836 

NM.006147 H5.11801 

NM_000683 Hs.299847 

AVW)833B4 Hs.11067 

AF011333 H5.153563 

AW451197 Hs.1 13418 

AI973016 Hs.15725 



methionine-tRNA synthetase 



hypdhefcal protein FU10783 

HSPC150 protein sin 

Homo sapiens clone 24747 mRNA sequence 



Hs.133294 

Hs.5378 

Hs.55235 



Hs.248 
Hs.301280 
Hs.1 2409 



spondln1,(f-spondln)exl 

sphingomyelin phosphodiesterase 2, neutral me 
TCF3 (E2A) fusion partner (in childhood Leuke 



protein phosphatase 1, regulatory (tnhlbltar] 

hypothetical protein FU22S48 similar to gene 

TTK protein kinase 

ESTs 

ESTs 

ESTs, Weakly similarto dJ37E16,5 |H.sapiens] 

ESTs 

ASPIC (acidic secreted protein In cartilage)A 
ESTs 

ESTs, Weakly simlar to simiar to alphafteta 



Homo sapiensPAC clone RP5-1087M19fi™ii7q11. 

interferon regulatoiy factor 6 

ESTs, Highly similar to A2AD_HUMAN AtJ'HA-2C-2 

ESTs. Weakly similarto K02Eia2 [Cdegans] 

lymphocyte antigen 75 

ESTs 

ESTs; hypothetical protein 8BBI48 
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424886 A1333771 



441784 
418758 
408621 



421750 
405224 
447630 
407663 
427490 
414812 
427691 
420650 
439841 
425810 



AA431367 Hs.234546 

AB020713 Hs.56966 

AI522132 H5.28700 

AW959311 Hs.87019 

A1970672 Hs.46638 

AW182614 Hs.128499 

X73424 Hs.63788 
AW748920 

L32137 Hs.1584 

AA626937 HS.1B1651 

AK00C768 Hs.107872 



NM_016429 Hs.37482 

Z95152 Hs.1 78695 

X72755 Hs.77367 

AW194426 Hs.20726 

AA4667C6 Hs.44581 

AF038961 Hs.6710 

A1923627 Hs.31903 

J04088 Hs.1 56346 

AW747800 Hs.55016 

NM_005756 Hs.1 84942 

BE391804 Hs.62661 

NM_000954 Hs.8272 

AW374954 Hs.71779 

AB037762 Hs.44268 

AI677897 Hs.76640 

AA486794 Hs.66915 ' 

AL1 17482 Hs.7978 

U376ig Hs.87539 

X69699 Hs.73149 



457706 
412723 
435774 
408753 
447783 
418085 
4S2472 
409112 



425812 
411742 
415076 



AA807814 

AW405434 

AW175997 

AA164687 

AW043637 

M22324 

AL1375g7 

AW974668 

AA648459 



Hs.76781 



Hs,61345 
Hs,70582 
Hs.98367 



carfllage oligome 
ESTs 

hypotnelioai protem FLJ20761 
0 

lymphoid enhancer-binding factor 1 
C0PZ2 for nonclathrin coat protein zeta-COP 
mitogen-aclvated protein kinase 13 
nnonoWne induced by gamma interferon 



ESTs 

topoisomerase (DNA) II alpha (17CkD) 

hypothelicaljiratein FIJ21936 

G protein-coupled receptor 64 

guanylate binding protein 1, interferon-induc 

prostaglandin D2 synthase (21 kD, brain) 

Homo sapiens DNA irm diromosome 19, cosmid F 



Hs.297889 
Hs.21766 
Hs.1239 



AA364128 
AW247593 
NM_000857 
AA236776 



430375 AW371048 



434725 
414683 
429500 



iexpressbn factor 2 
RGC32prolein 

ESTs, Wfeakly similar to 16.7Kd protein [H.sap 
DKFZP434C131 protein 
aldehyde dehydrogenase 8 
paired box gene 8 

ATP-binding cassette, suMamil/ D (ALD), mem 
cyclin-dependent kinase inhibitor 2A (mclanom 
myosin VB 
RU2S 

ESTs, Moderately similar to AF144056 1 apopto 
hypothetical protein FLJ22252 similar lo SRY- 
small nuclear ribonudeoprolein polypeptide B 
gb:QV0-BT0078-190899-C05-E02 BT0078 Homo sapi 



ESTs 
ESTs 

alanyl (membrane) aminopeptidase (am 



tumor suppressing subtransferable candidate 1 
SH3 domain binding glutamic acld-ridi protein 
MAOl dehydrogenase (ubiquinone) 1 alpha subco 



Hs.294142 
Hs.50649 
Hs.61384 
Hs.149827 
Hs,1 94657 
Hs.245633 
Hs.71819 
Hs.77890 
Hs.79078 
Hs.7337 
Hs.93753 
Hs.91579 
Hs.62275 



AA340864 Hs.278552 
AK000796 
S78296 
X78665 
AF290512 



ESTs, Weakly similar to SP49_HUMAN SPUCEOSCM 
quinone oxidoredudase homolog 
KIAA144S protein 
ESTs 

cadherin 1, type 1, E-cadherin (epithelial) 
ESTs 

eukaryolic translation initiation factor 4E b 
guanylate cyclase 1, soluble, beta 3 
MAD2 (mitotic arrest deficient, yeast, homolo 
hypothetical protein FU10936 
H4 histone family, member H 
Homo sa| ■ 
CGl-141 



hypothetksal protein 

internexin neuronal intermediate fflament pro 
hexabraohion (tenasdn C, cytotactin) 

Homo sapiens rhotekin mRNA, partial cds 



Hs.105509 CTL2gene 



418196 AI745649 Hs.26549 



414617 AI33gS20 HS.20S24 



ESTs, Weakly similar to T00066 hypothetical p 
ESTs, Weakly similar to C43H8.1 [C.elegans| 
Homo sapiens cDNA: FU21901 fis, clone HEP034 
ESTs, Moderately similar to hexoklnase I (Ks 
protein tyrosine phosphatase type IVA, member 
ESTs 

lethal giant larvae (Drosophila) homolog 2 
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447155 
40S699 
406893 



457205 
428188 
449845 
406429 



M22406 

BE247SS0 Hs^8S9 

NM.002497 Hs.153704 

AI9493S9 Hs.301837 

BE001S96 HS.8S266 

AI905780 Hs.198272 

M98447 Hs.22 

AW971183 Hs.60054 



55 
60 



ESTs, VlteaHy sWIarto AF251041 1 SGC32445 p 

gbiHuman intestinal tnudn mtm, partial ods, 
growtli factor receptor-bound protein 7 (GRB7) 
NIMA (never in mitosis gene a)-retated Idnase 



450043 
403121 
400214 
453252 
451734 
416855 
424474 



1494514 Hs.171380 
M_002220 Hs.2722 
A886699 Hs.24332 



R02436 Hs.215725 

NM_006176 Hs,26944 

AA188763 Hs.36793 

AA308883 Hs.148680 



1(Kpolypeplidespidennslt 



gb:ll0815.S€q.F Human fetal heart, Lambda ZAP 



A1687303 Hs.285529 ESTs 
AI023799 Hs.163242 
Hs.155956 



Homo sapiens cDNA: FU23188 fis, done LNG120 
calcyon; D1 dopamine receptor-Interacting pro 
Homosaplens mRI«IAfbrKIAA1SS1 prolsin, parti 



421674 
439999 
411351 



407894 AJ278313 



Hs.296355 neuronal PAS domain protein 2 



421877 
404780 
401192 
447519 



[)90041 
T10707 

AA115811 Hs.6838 

W02919 1^.283476 

NM.002885 Hs.75151 

AA355925 Hs.36232 

Hs.41143 phospboinositide-specitlcphospholipas 

AA316266 Hs.129349 ESTs 

A1888490 Hs.55902 ESTs 

AW250380 Hs.1 09059 mitochondrial ribosomal protein L12 



U46258 

AF121858 

H48299 

R89344 

AI136431 

AK001432 

AI420859 

BE614387 

AA531276 



Hs.23443 
Hs.12169 
Hs.26126 
Hs.14143 

Hs!l 53408 
Hs.150557 
Hs.47378 



prostate differentiation faclor; pixenlal bo 
Homo sapiens cDNA FU10570 fis. done NT2RP20 
basic Iranscription element binding protein 1 
ESTs 



437044 AL03Sa64 
428237 AF1 76206 
440048 AA897461 
414922 000723 
X51416 



408716 Ai567839 Hs.151714 ESTs 



ESTs 

ESTs, highly similar to differentially expres 
killer cell lectin-like receptor F1 
ESTs, Weakly similar to envelope protein [H.s 
glycine cleavage sysleni protein H (aminomethy 
estrogen-related receptor alpha 



D61594 



407014 
429311 
431842 



Hs.64173 
Hs.17279 
Hs.49767 
AI87112a - Hs.231265 
AK000733 Hs.23900 
Z46223 Hs.176663 
NM 001956 Hs.1407 
AI669750 HS.1BB881 
NHL001809 Hs.1594 
U38268 

AF080157 Hs.198993 
NM 005764 Hs.271473 
Z25427 



AL1 37692 
AW167128 
AI085198 



420917 
428676 
403482 



417749 
416693 
428474 



AA8S8643 Hs.120461 

AW135716 Hs.117330 

M19684 Hs.1 84929 

AI271438 Hs.105022 

U09196 HS.82S20 

AI373204 Hs.79531 

AB023182 Hs.184523 

NM.000346 Hs.2316 

T06199 HS.237S06 

Z21666 Hs.75900 

N88802 HS.S422 



ESTs, Highly similar to VABIJHUMAN VACUOIAR A 

tyiosylprotein sutfotransferase 1 

NADH dehydrogenase (ubiquinone) protein 

ESTs 

GTPase acBvalIng protein 

Fc fragment of IgG, low affinity llib, recept 

endothelln2 

ESTs 

centromere protein A (171(D) 
gb:Human cytochrome b pseudogene, partial cds 
conserved helix-loop-helbc ubiquitous itinase 
epithelial protein up-regulated in carcinoma, 
gb:H.sapiens protein-serlne/threonlne idnase 

Homo sapiens mRNA; cDNA DKFZp434P182 (from cl 

ESTs 
ESTs 

Homo sapiens mRNA; cDNA DKFZp434B1 120 (frame 

ESTs 
ESTs 

serine (or cysteine) proteinase inhibitor, d 

Homo sapiens PAC clone RP4-701O16 from 7q33-q 
0 

polymerase (DN^directed). delta 4 

Homo sapiens TTF-I interactng peptide 20 mRN 

KIAA0965 protein 

SRY (sex-determining region Y)-box 9 (campome 
heat shocit cognate 40 



glycoprotein MOB 
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453735 
424345 
423575 



AK001380 

C18863 

H49546 



Hs.125073 
Hs.145479 
Hs.163443 
Hs.298964 



Homo sapiens cDNA FU10518 lis, clone NT2RP20 



30 
35 
40 
45 
50 
55 
60 
65 



452101 
420505 
426125 
433336 
428977 
429785 
402424 
424971 
433037 
421670 
438598 
453370 
410561 
402287 
419741 
442047 
428582 
440006 
406851 
457316 
420463 



ABOOigU 
AA290712 

T60298 

AW967984 

X87241 

AF017986 

AK001404 

H82114 



AI123657 
AL1 57500 
AW105723 



429113 D2B235 



, 427414 
419839 
418738 
429414 



M417181 
AW361666 
AI911841 



444261 
407846 
425163 
402520 
429597 
430044 
429663 
427036 
444381 
432090 
406778 
404961 
452313 
452355 
429942 
403165 
442150 



Homo sapiens HSPC296 mRNA, partial cds 
gb7b87f12.r1 Stratagene llrer (937224) Homo 

ESTs 

FAT tumor suppressor (Drasophila) homolog 
ESTs; Highly similar to FRIZZLED PROTHN PRE 



tumor suppressing sublransferable candidate 3 
HSPC067 protein 

BRCA1 associated protein-1 (ubiquiSn cartjoxy 
Homo sapiens cDNA: FU23439 fis, clone HSI0C1 
ESTs. Itaierately similar to translation inili 
Homo sapiens aOHfc FU22044 f s, clone HEP091 

0 

ubiquitin carder protein E2-C 

ESTs 

BENE protein 

hypothetical protein FLJ20510 

major histocompalibilit)' complex, class II, D 

ESTs 

Homo sapiens mRNA; cDNA DKFZp434G01S (from cl 



Hs.49500 
Hs.5184 
Hs,82767 



AA298958 
AA426202 
D10040 

NM_003816 
AA464510 
M68874 



Hs.196384 P[Dstaglandliv«ndoperoxide synthase 2 (COX-21 
Hs.24724 ■•— - ■ 
Hs.153716 
Hs.6647 
Hs.93304 



> sapiens mRNA for Hmob33 protein, 3' untr 
Homo sapiens cDNA FU13088 lis, clone hfr2RP30 
phosphollpase A2, group VII (plateiet-acllvat 
solute carrier family 7, member 1 1 
empty spiracles (Drosophila) homolog 2 
Homo sapiens cOIMA FU13945 lis, clone Y79AA10 
KIAA0746 protein 
TH1 drosophila homolog 
sperm specific antigen 2 
0 

gb;yra96a06.r 



Hs.1 08923 
Hs. 10724 
Hs.40403 



Hs.2442 
Hs.1 52812 
Hs.211587 
Hs.1 63913 
Hs.283713 
AW972855 Hs.292853 
H06273 Hs.101651 
AWg72195 Hs.284236 
YC0486 Hs.28914 
NS4926 Hs.2g202 
AI338993 Hs.134535 

AI368158 H3.128864 

AW401433 Hs.6649 

AC004923 Hs.135187 

AW023482 Hs.97849 

AI638164 Hs.225520 

BE407132 Hs.1 11286 

AA361562 Hs.178761 



MDS023 protein 

Cbp/p300-interacting transactivator, with Glu 
fatty-acid-Coenzyme A ligase, kmgxhain 2 



adisintegiinandm* 
EST cluster (not In UniGene) 
Human pbosphaddylcholine 2-acylhydnilase (cP 



G protein^upled receptor 34 



418601 AA279490 



Is. 130425 ESTs 



AB007860 
AM56483 
AI458165 



Homo sapiens cDNA FLJ13479 f s. done PLACE10 



416661 AA634543 
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410001 
430315 
422759 



419034 
448361 
412754 



417605 
445861 
4473S0 



N73885 Hs.124169 

BE258532 Hs.251871 

M339526 Hs.279593 

AI279652 Hs.132879 

AB041036 Hs.57771 

NIO04293 Hs.239147 

M938905 Hs.289112 CGI-43 protein 



X91809 

M31732 
U81961 
AB020676 
AF112213 
AA643687 
AW451676 
AA434054 



regulator of &protein signalling 19 
gb:IL3-CTO220-15020M70-B02 CT0220 Homo sapi 

B-cell CLL/lymphoma3 

sodium channel, nonmltage-gated 1 alpha 



Hs.1 53564 
Hs.30524 
Hs.1 59448 
NM_001851 Hs.1 54350 
U17760 Hs.301103 
NM_002110 Hs.89555 



Hs.238707 
Hs.74565 

AI798863 Hs.87191 

AW138839 Hs.24210 

AF054910 Hs.1 271 11 

BE336857 Hs.1 18787 
AF086467 

AF006609 HS.B2294 

BE2g3423 Hs.11809 

AI375572 Hs.172634 

451807 W52854 Hs.27099 

421515 Y11339 Hs.105352 

422443 NHA.014707 Hs.116753 

412504 Z44496 Hs.26039 

453344 BE349075 Hs.44571 
402885 

438712 AW978161 Hs.169877 

421774 AL050374 Hs.108169 

425638 NIUL012337 Hs.1 58450 



AW629485 

AW204145 

AI199738 

AA447492 

N34895 

BE292842 

D55638 



442986 
42C317 
443211 
434361 
423493 
414183 
447778 
435106 
439490 
409606 
421308 
414960 
416783 

422605 
430427 
424620 
421693 
407727 
427706 



447843 
446576 
449700 
436476 



432673 
414684 
447210 



AA100847 

AW249197 

AW444594 

AA687322 

CI 5407 

AA206186 

AL120168 

H16646 

AA296701 

AA101043 

X71490 



AA326108 
AW0S84S9 
AKD01332 



ESTs 

Homo sapiens cDI^A; FU23442 lis, clone HSI009 
surfeit 2 

collagen, type IX, alpha 1 
Uminin. tieta 3 (nicein (125kD), kalinln (140 
hemopoietic cell kinase 



ESTs 



icular) 



transfomiing growth fa< 
gbiHomo sapiens full length insert cONA clone 
regulator of &prolein signalling 3 
siiqlelglL-IR-relaled molecule 
ESTs; HER4(c<rt>-B4) 
DKFZPS64J0863 protein 
GalNAc alpha-2, 6^lalyltiansferese I, \ovg f 
histone deacetylase 7B 

Homo sapiens cDNA FU13937 Us, clone Y79AA10 



DKFZP586C1619 protein 



Hs.293352 
Hs.1 56044 
Hs.208275 
HS.201B3 
Hs.44648 
Hs.1 661 20 



AI025990 Hs.285520 ESTs 
AB006628 Hs.96485 KIAA029 
AI128388 Hs.143655 ESTs 



interferon regulatory fector 7 
gb:Hunnan B-cell PABL(pseudo 
CCR4-N0T transcriptbn complex, subunit 3 



Hs.117772 
Hs..12g683 
HsJ01711 
Hs,71190 
Hs.193380 
Hs.100043 
Hs.2387 
Hs.192843 

Hs.79889 
Hs.78919 
Hs.118666 
Hs.241413 
Hs.151254 
Hs.106876 
Hs.38044 
Hs.293800 
Hs.242463 

Hs.21694 
Hs.155651 
Hs.224891 
Hs.51820 
Hs.78358 
Hs.53631 
Hs.1 62246 
Hs.44672 
HS.27860& 
Hs.76893 
Hs.1 7752 



437395 AL365408 Hs.10632 



ubIquibMongugal'ng enzyme E2D 1 (homologou 



ESTs, Highly sWarto AF174600 1 F-box prot 
ESTs, Weakly similar to PSF.HUMAN PTBASSOCIA 

transglutaminase 4 (prostate) 
ESTs 

gb:C15407 Clonlech human aorta polyA*mRNA (6 
monocyte to macrophage ditferBntiatton-associ 
Kell blood group precursor (l^cLeod phenotypa) 
Human clone 23759 mRNA, partial cds 



kallikrein 7 (chymotryptic; si 
ATPase, H+ transporting, lysosomal (vacuolar 
DKFZP564M082 protein 

ESTs, Weakly similar to ALU LHUMAN ALU SUBFAM 

0 

ESTs 

hepatocyte nuclear factor 3, beta 

ESTs 

ESTs, Moderately similar to ALU7_HUMAN ALU SU 



1FU10470 

ER-associated DNAJ; ER-assodated Hsp40c<H:h 
S-hydrocybutyrate dehydrogenase (heart, mitoc 
phosphatidytserlne-specMc phosphoHpase A1 a 
Fzrl protein 

hypothetical protein DKFZp762M136 

187 



wo 02/102235 



PCT/US02/19297 



40 
45 
50 
55 
60 



441627 


AA947552 


Hs.58085 


ESTs 


12.9 


419084 


AA496539 


H3.1 79902 


transporter-like protein 


12.9 


423067 


AA321355 


Hs.285401 


ESTs 


12.9 


42307C 


R55677 


Hs.1 55569 




\2S 


441344 


BE250144 


Hs.41514 




12.8 


423527 


AI20G965 


HS.10S861 


Home sapiens cDNA FU13824 lis, clone THYRO10 


123 


417006 


AWS73606 


Hs.80758 


aspar^-tRNA syntlietase 


m 


453552 


AL041941 


Hs.154729 


3-ptiosplioinositide dependent protein t(inase-1 


12.8 


453657 


W23237 


Hs.296162 


ESTs 


12.8 


434414 






gb:lr34t)07.x1 |\ICLCGAP_Ov23 Homo sapiens cDW, 


12.7 


456051 


TB5626 


Hs.76239 


hypothetical protein FU20608 


12.7 




BE379761 




ESTs, Weakly simila- to ALII8_HUMAN ALU SUBFAM 


12.7 


418216 


AA662240 


Hs!283099 


AF15q14 protein 


12.7 




AJ271684 


Hs.126355 


C-type (calcium dependent, caitiohydrate-recog 




424275 


AW673173 


Hs.144505 


DKFZP566F0546 protein 


12.7 


440062 


AI360518 


Hs.1 29692 


ESTs 




444371 


BE540274 


Hs,239 


ForkheadboxMI 




412520 


AA442324 


Hs.795 


H2A histone family, member 0 




413349 


BE086692 




gb:QV1-BT0678-130400-156^07 BT0678 Homo sapi 


12.7 


414500 


W240a7 


Hs.76285 


DKFZP564B167 protein 


12.6 


429261 




Hs.143475 


ESTs 


12.6 


402238 
400280 










421246 


AW582962 


Hs.300961 


ESTs, Highly similarto AF151805 1 CGI47 pro 


12.6 


442029 


AW956698 


Hs.14456 


neural precjjrsor cell expressed, developraenta 


12.6 




L13266 


Hs.105 


glutamate receptor, ionotropic, N-methyl D-as 




409964 


AW368226 


Hs,67928 






418793 


AW382987 


Hs.88474 


prostaglandin-endoperoxlde synthase 1 (piosta 


12.6 


452117 


AI421760 


Hs.77870 


Homo sapiens cDNA F1J12750 lis, clone NT2RP20 


12.5 


448074 


BE6213S5 


Hs.27160 


ESTs 


12.5 


44265S 


AW027457 


HsJ0323 


ESTs 


12.5 




AL137163 


Hs.57549 


liypothetlcal protein dJ473B4 
0 


12.5 
125 


400240 
413048 


M93221 


Hs.75182 


mannose receptor, C type 1 


12.5 


426215 


AW963419 


Hs.166223 




12.5 


430024 


AI808780 


Hs.227730 


integrin, alpha 6 


12.6 


445655 


AA873830 


Hs.167746 


B cell linker pralein 


12.5 


419941 


X98654 




phosphatidylinositol transfer protein, membra 


12.5 


425280 


U31519 


Hs.1872 


phosphoenolpyruvatecarboxyWnase 1 (soluble) 


12.5 


427767 


Aia79283 


Hs.180714 


cytochrome c oxidase subunil Via polypeptide 


12.4 


450243 


AW1 19084 




ESTs 


12.4 


408930 


AA146721 


Hs.49005 


hypothetical protein 




418783 


T41368 




gb;ph1d1_19/1TV Outwani Alu-primed hncONA lib 


12.4 


452096 


BE394g01 


Hs.226785 


ESTs 


12.4 


424513 


BE385864 


Hs.149894 


mitochondrial translalional initiation factor 


1Z4 


422306 


BE044325 


Hs.227280 


Homo sapiens mRNA for Lsm5 protein 




409031 


AA376836 


Hs.76728 


ESTs 


1Z4 


435515 


N40080 


H3.6879 


DC13 protein 


12.4 


429583 


NM_006412 


Hs.209119 


1-acylglycerol-3-phosphate Oacyllransferase 


12.3 


449643 


R05989 


Hs.1 9603 


ESTs 


12.3 




AL050060 


Hs.7158 


DKFZP566H073 protein 


12J 


425593 


AA278921 


Hs.1908 


proteoglycan 1, secretory granule 


123 


447357 


AI375922 


Hs.159367 


ESTs 


12J 


405089 






0 


123 


414972 


BE263782 


Hs.77695 


KIAAO008 gene product 


123 


435039 


AW043921 


HS.130S26 


ESTs 


123 


447033 


AI357412 


Hs,157601 


EST-notinUnrGene 






AW973352 






123 


409377 


AA300274 


H&11S6S9 


Homo sapiens cDNA- FU23461 Us, clone HSI077 


123 


400116 






0 


12J 


445806 


AL137516 


Hs.13323 


hypothetical protein FU22059 




457817 


AA247751 


Hs.79572 


calhepsin D (lysosomal aspartyl protease) 




442410 


AW996503 


Hs.1 97680 




12.2 


445404 


AI261687 




ESTs, Weakly similar to JC4974 sodium iodkle 


12.2 


403372 


AW249152 


Hs.44017 


SIF!2 (silent mating type 'mfomiation regulaU 


12.2 


427082 


AB037858 


Hs.1 73484 


hypothetical protein FU10337 


12.2 




AVV753676 


Hs.39982 


ESTs 


12.2 


400268 
433190 


M26901 


Hs.3210 




12.2 


444863 


AW384082 


Hs.301323 


ESTs 


12.2 




AF153815 


Hs.50151 


potassium inwardly-rediVing channel, subfem 


12.2 


451346 


NtL006338 


Hs.26312 


glioma amplified on chromosome 1 protein (leu 


12.2 


430262 


AA213780 


Hs.237323 




12.2 


421071 


AI311238 


Hs.104476 


ESTs 


12.2 




NM_015556 








409178 


BE393948 


Hs]50915 


kallikreinS 


121 
12.1 


400250 
428460 


NM.014791 


Hs.184339 


KIAA0176 gene product 




414531 


T69387 


Hs.76364 


allograft inflammatory factor 1 


12'.1 


448210 




Hs.7393 


hypothetical protein from EUROIUMGE 1987170 




440081 


AA863389 


Hs.135643 


ESTs 


121 


413179 




Hs.75227 


l*\DH dehydrogenase (ubiquinone) 1 alpha subco 


12.1 


447551 


BB£6&34 


Hs.929 


myosrn, heavy polypeptide 7, cardiac musde. 




400517 


AF242388 


Hs.149585 




121 



wo 02/102235 
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430637 
415099 
44M22 




Homo sapi^gs cDNA: nJ2113S fis. clone CA8072 
ESTs 

gb:ht98t1 1 .x1 NCLCGAP_Ui24 Homo sapiens cDNA 



50 

55 



442556 
451002 
401879 



AL137761 Hs.8379 



436877 
434560 
448105 
400279 
440497 
451260 
429175 



420153 
428760 
421401 



AI267700 Hs.111128 
AW163045 Hs.79334 
AW373401 Hs.273599 
T24653 Hs.23360 
AL0499B5 Hs.19180 
Hs.1 21255 
Hs.3964 
Hs.170675 



nuclear factor, m 
ESTs 



AW591433 

AA887266 Hs.144979 
AW750773 

AI953040 Hs.127714 

NM_004454 Hs.43697 

AA436735 Hs.1 831 71 
N22120 



430423 AI190548 



ukin 3 regulated 



Hs.1 92398 ESTs 



likely ortholog of yeast ARV1 
Homo sapiens mRNA: cDNA DKFZp564E122 (from d 
ESTs, Weakly similar to cDNA EST EMBLD67419 
Homo sapiens ctone 24877 mRI»IA sequence 
ESTs, Weakly similar to TMS^UMAN TRANSMEMBR 



gb'.CM0-CN0044-26010O-164'h03 CN0044 Homo sai 
ESTs, Moderately simlar to SOX30 protein IH. 
ets variant gene S (els-related molecule) 
Homo sapiens cDNA: FU22002 Js, clone HEP066 
hypothefcal protein FU13910 



transfonning, acidic ccileikoil containing p 

0 

ESTs, Weakly similar to hypotheOcal protein 

0 

gb:oc44(08.s1 NCI_CGAP_GCB1 Homo sapiens oDNA 

0 

ESTs, Weakly similar to S59856 col^en alpha 



461051 BE254309 
415323 BE269352 
412153 R87934 



457644 
422848 
424134 



Hs.97723 
Hs,2424S3 
Hs.144962 
Hs.121483 
Hs,140950 . 



K interacting, kinesin-like (rabklnesin6) 

gb:ao86a08j(1 Schiller meningioma Homo sapien 
DKFZP586G1624 protein 
neutrophil cytosde telor 2 (6SkD, chronic 
• gb:yo47b10.r1 Soanes adult brain N2b4HB55Y Ho 



wo 02/102235 



PCT/US02/19297 





451931 


AK000208 


Hs.27267 


Homo sapiens cDNA FU20201 Us, done C0LF121 




400438 


AF185611 


Hs.115352 


growth hormone 1 




412994 


D32257 


Hs.75113 


general transcription factorlllA 




408124 


U89337 


Hs.42853 


cAMP responsive element binding prolein-lllte 


5 


452249 


BE394412 


Hs.51252 


ESTs 




424627 
405626 


AA344555 




gb:EST5071 5 Gall bladder 1 Homo sapiens cDNA 
0 




436690 


AA373970 


Hs.183096 


ESTs. 


10 


415862 


R51034 


Hs.144513 


ESTs 


406755 


N80129 


Hs.94350 


metallothionegilL 






AI244368 


Hs.8124 


PH domain containing protein in retina 1 




429612 


AF062649 


Hs.252587 


pituitary tirmor-transfbrming 1 




423334 


AK000906 


Hs.127273 


hypotheticai protein FU10044 


15 




BE301909 


Hs.279952 


glutathione S-trans<erase subunit 1 3 homoiog 


428423 


AU075517 


Hs.184275 


solute carrier family 9 (sodium/hydrogen exch 




442353 


BE379594 


Hs.49136 


ESTs 




447700 


A1420183 


Hs.171077 


ESTs, Wealdy similar to simlar to seiineAhr 




402C77 






0 


20 


409203 


AA780473 


Hs.687 


cytochrome P450, subfamily iVB, polypeptide 1 


405146 












All 26772 




ESTs 




425508 


AA991551 


Hs'97013 


ESTs 




428340 


AF261038 


Hs.154721 


aconitasel, soluble 


25 


431452 


AI073641 


Hs.162372 


ESTs 


446651 


AA393907 


Hs.97179 


ESTs 

tachykinin 3 (neuromedin K, neurokinin beta) 




443755 


C18397 


Hs.9730 




436209 


AW8S0417 


Hs.254020 


ESTs, IWerateiy similar to unnamed protein p 




401020 






0 


30 


456724 


AW247388 


HSJ01423 


calcium binding protein 1 (caibrain) 


407227 


Hg4949 


Hs.171955 


trophinin associated protein (tastin) 




402066 
442721 


AI015892 


Hs.101282 


0 

Homo sapiens nmA; cDNA DKFZp434B102 (from cl 




401025 






0 


35 


452423 


AA9gi724 


Hs.1 80535 


Homo sapiens cDNA: FU2271 1 Bs, done HSI133 


431685 


AW296135 


Hs.267659 


vav 3 oncogene 




425176 




HS.30143Q 


ESTs, Moderately similar to TEF1_HUMAN TRANSC 






AW840171 


Hs 265398 






409079 


W87707 


Hs!82065 


Interleukin 6 signal trar^ucer (gp130; oncos 


40 


466995 


T89832 


Hs.1 70278 


ESTs 


419223 


X60111 


Hs.1244 


CD9 antigen (p24) 






BE514982 


Hs.38991 


SlOOcakium-binding protein A2 






AW191962 


Hs.288061 


actin, beta 




437929 


T09353 


Hs.1 06642 


ESTs, Weakly Mar to hypothetcal protein 


45 


415789 


H01581 




gb:yj33IOar1 Soares placenta Nb2HP Homo sapi 


424147 


AL137376 




Homo sapiens mRNA; am DKFZp434J0226 (from c 




436034 
404931 


AF282693 


Hs.150185 


InflammaBon-relaled G protein-coupled recept 

0 




445979 


AI695047 


Hs.202395 




50 






Hs^6040 


ESTs; Highly similar to CYTOCHROME P45 IVA2 


433133 


AB02724g 


Hs.104741 


PDZ-binding kinase; T-cell originated protein 




445258 


AI635931 


Hs.147613 


ESTs 




417251 


AW015242 


Hs.99488 


ESTs; Weakly similar to ORF YKR074w [Sjierevi 




421041 


N36914 


Hs.14691 


ESTs 


55 


425537 


AB007913 


Hs.158291 


KIAA0444 protein 


435763 


AI243929 


Hs.190419 


ESTs 




444790 . 


AB030506 


HS.119SS 


BO protein 




453857 


AL080235 


HS.35B61 


DKFZP58BE1621 protein 




433882 


U90441 


Hs.3622 


ptDCollagervproline, 2-oxoglutarate 4-(lioxyge 


60 


405358 
435814 


AW615179 


Hs.1 52370 


ESTs 




422809 


AK001379 


Hs.1 21 028 


hypothetical protein FU 10549 




446772 


AW294404 




Homo sapiens cDNA FU1 1672 fis, clone HEMBA10 




456694 


AW016382 


Hs.105642 


Homo sapiens cDMA; FLJ23271 fis, clone HEP001 


65 


441128 


AA570256 


Hs.54628 


ESTs 




NM 004482 


Hs.278611 


UDP-N-acetyl-alpha-D.ga!5ctosamine:polypeplid 




412576 


AA447718 


Hs.107057 


ESTs 




411122 


F00809 


Hs.1 43696 


coactlvator-associated a'ginine meitiyltrarsfe 






AA432391 






70 


426260 


NM_002541 


Hs.168669 


oxoglutarate dehydrogenase (lipoamide) 






Hs.11538 


actin related pcDtein 2/3 complex, sybynil 1A 




431947 


AL359613 




hypothetical protein DKFZp762D1011 






BE378174 


Hs.26506 


Homo sapiens clone COABP0005 mRNA sequence 




458627 


AW088642 


Hs.97984 


ESTs; Weakly similar to WASP-faraily protein [ 


75 


409142 


AL1 36877 


Hs.50758 


chromosome-associated polypeptde C 


447627 


AF090922 


Hs,285902 


CGI-113 protein 




447656 


NM_003726 


Hs.1 9126 


src klnase-associaled phosphoprotein of 55 kD 




454227 


AW963897 


Hs.44743 


KIAA1435 protein 
0 


80 


402927 
422380 


AA30g881 


Hs.136246 


ESTs 


455986 


BE177736 




gb;RC1-HT0598-1403004)21-g06 KT0598 Homo sapi 




410962 


BE273749 


Hs.752 


FKS06-bindirq protein 1A (12kD) 




450361 


BE32710B 


Hs.202512 


ESTs 




457484 


H57645 




gb:yr21e01x1 Soares fetal liver spleen 1NFLS 






AI287341 


Hs.1 54029 


. bHLHfectorHes4 



wo 02/102235 
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ESTs, WeaMy similar to PLK.HUMAN PROTEOaYCA 




439190 
408975 
415130 



wo 02/102235 



PCT/US02/19297 



10 
15 
20 
25 



Hs.159408 
Hs.11 2554 
Hs.294151 
Hs^1969 
HS741517 
Hs.62713 



438982 
446224 
405108 
438233 
401799 
454038 
414222 
421828 



BE613126 
N30714 
AW979101 
AW450551 



Hs.11 
Hs.47783 
Hs.20161 
Hs.291980 
Hs. 13308 



416218 
457848 
442577 
406506 



AA292998 
AF015272 
AA376768 



Homo sapiens done 24420 mRNA sequence 
ESTs 

ESTs, Moderately similar to KIM0544 protein 
ESTs 

polymerase (DMA directed), theta 
ESTs 

DKFZP5640123 protein 

ESTs, Weakly similar to T12540 hypothetical p 

HDCME31P protein 



AA344932 Hs.118786 melaltothionein 2A 



Hs.23213 
Hs,125682 
Hs.163900 
Hs.1 15418 
Hs.288977 
Hs.198272 
HS.19S54 



ESTs; Weal<ly similar to D2092,2 [Celegans] 



Homo sapiens cDNA: nJ22622 lis, clone HSI056 
NADH dehydrogenase (ubiquin ; " ' 
chromosome 1 open reading frane 2 



431986 
423145 
414402 
417079 
426095 
434577 
442416 
421506 
435084 
431724 
456798 
417370 



D17516 
AA614535 
AJ006422 



433192 AB040880 



Hs.81134 

Hs.8gg86 
Hs.179769 
Hs.129550 
HS.105C97 

HS.2837C4 
Hs.135183 
HS.8203G 
Hs.11 8633 
IIS.2701C6 



Hs.93 



401782 
4S3927 
450737 
421633 
409881 
432883 
440099 
419024 
401836 
408896 
443120 



BE466e39 Hs.61779 

AL050062 Hs.19999 

NM.012434 Hs.1 17865 

AA082465 HS.3017S1 



ESTs, Weal<ly similar to secretory carrier mem 
glK601172959F1 NIH_MGC_17 Homo sapiens cDNAo 
interleulcin 1 receptor anlagonist 
ESTs 

Homo sapiens cDNA: FU224a7 fis, clone HRC109 



centauiln-alpha 
Iryplophanyl-tRNA synthetase 
2'-5'aligoadenylate synthetase-IIke 

ESTs 

ESTs, Moderately similar to KIAA1447 protein 
Homer, neuronal immediate early gene, 2 
sema domain, immunoglobulin domain (Ig), Iran 
Homo sapiens cDNA FU13591 Ss, clone PLACE10 



AA333990 Hs.80424 



421154 
414271 
400812 



Hs.728 

Hs.69954 

Hs.238730 

Hs.287631 

Hs.75871 



Hs.106260 
Hs.202830 
Hs.3112 
Hs.6909 
Hs.1219 



coagulation factor XIII, A1 polypeptide 



Hs.153436 N-acetyltransferase, homolog of S. cerevlsiae 
Hs.1 2400 ESTs 

Hs.162189 serine/threonine kinase with Dbl- and pleckst 



421905 AI660247 



oxidised low density lipoprotein (lectin-like 
litMnuciease, RNase A family, 2 (liver, eosin 
laminin, gamma3 



death associated protein 3 

Homo sapiens cDNA FLJ13g37 (is, clone Y79AA10 

KIAA0203 gene product 

MAD (mothers against decapentaplegic, I>oscph 
ESTs 

Homo sapiens cDNA: FU2312S Us, clone U4G082 
\Hnyc avian mydocytomalosis viral related on 
nel(chicken)-lil<e2 

ESTs, Weakly similar to LIV-1 protein [H.sapi 



wo U2/102235 



PCT/US02/19297 




412116 
413808 
458572 
403295 
403910 
453400 
406502 
404743 
412517 



NMJ06799 Hs.72026 

AW40216e Hs.784 

J00287 Hs,182183 

AI223423 Hs.292794 



ESTs, Weakly sWarto AF151840 1 CGI-82 pro 

KIM1275 protein 
0 

ESTs, weakly sMarto AF220049 1 uncharacte 
ISTs 
ESTs 

steriod receptor RNA activator 1 (complexes w 
gb:oz05g05.i(1 Soares_letalJver_spleen_1NFLS 
ESTs, Weakly similar to pMOmDia^musculus] 
B-<;ellCLL/lymphoma7B 
hypothetical protein 
0 

ESTs, Weakly similar to unnamed protein produ 
phosphoinositide-3-kinase, catalytic, beta po 

sapiens cDNA FU13495 Ss, done F^LACEIO 
Homo sapiens mRNA; cDNA DKFZp564C2163 (from c 
conserved gene amplifed in osteosarcoma 

gb:PM0-HT0335-180400-008-c08 HT0335 Homo sapi 
0 

muts (E. cop) homolog 2 (cokin cancer, nonpo 
protease, s^jne, 21 (leslisin) 

'nduced gene 2 (lymphocyte 



Hs.825g0 ESTs, Moderately s>iiila'toALU7_HUMAN ALU SU 



gb.-601141065F1 NIH_MGC.9 Homo sapiens cDNAd 



S.I 24915 



425734 AF056209 Hs.159396 

419280 W07606 Hs.283725 

443503 AV645438 

423165 Ara37547 

450206 AI796450 Hs.201600 ESTs 

459052 AA298ai2 H5.98539 

456248 AL035786 Hs.82425 

428438 NM.001955 Hs.2271 

456525 AW468397 Hs.100000 

426127 L36983 Hs.167013 

TABLE 13B: 

Pkey: Unique Eos probeset identilier numbe 
CAT number: Gene cluster number 
Accession: Genbank accession numbers 



ins cDNA FLJ 12627 [is, clone NT2RM40 



acOn related protein 2/3 complex, subunit 5 
Endothelhl 

8100 catelum-blnding protein A8 (calgranulln 



412153 
412241 
412517 
412801 
413349 



414950 1509777.1 

415789 1555357_1 

416368 1591066J 

418464 1759038_.2 

418783 1789791J 

424627 241724_1 

425827 256834J 

434371 384839_1 

434414 38585_1 



434732 392447 1 

439092 468554_1 

439636 47467.1 

447824 73861.-1 

450484 83645_1 

451260 863912.1 

451852 888359.1 

452101 898742.1 



BE269057 BE513434 BE396654 



C15407 D81769 D61133 
H01581H12860 R65905H13053 
R88849R84573 H50890 
R87560 

T41368T41369T41294 
AA344555 AA344312 AW963070 
W28316W26507AA364334 
AA631352AA631438 

AI798376 S46400AW811617AW811616W00557BE142245AW858232AW861851AW358362AA232351AA218567AA055556AW858231 
AW857541 AW8U172H66214AW814398AF134164AA243093AA173345AA199942AA223384AA227092AA227080T12379AA092174 
T61139AA149776AA599829AW879188AW813567AW813538AI267168AA157718AA157719AA100472AA100774AA130756AA1S7705 
AA157730 AA157715 AA053524 AW849581 AW8545e6 C05254 AW882836 T92637 AW812621 AA206583 AA209204 B El 56909 AA226824 
A182930g AW99ig67 N66951 AA627374 H66215 AA045564 AI694265 H60808 AA149726AW195620 BE081333 BE073424AW817662 
AW817705 AW817703 AW817659BE081531 H59570 
AI078443AA648102AI765577AW974381 
AA830149 AW978407 M85983 AW503637 
AF086467 W81444 W81445 



T60298AI858257Te9667T67634 T61224 T71537 T68933 

193 



wo 02/102235 



454163 1048369 1 

454393 115888_1 

455102 1253524.1 

455864 1377038_1 

■ 455986 1397521.1 

456423 187241_1 

457426 335189.1 

457484 342113.1 

457705 389383J 



AW176997 AW176000 AW175999 AW175994 AW176004 AW176989 

BE153288BE153151 BE152925 AA078302 

BE00S496 BE00S494 AW856324 AWg00ig9 

BE148970 BE14a975 BE148957 BE148937 

BE177736BE177735BE177734 

AW748g20AA487506AA248914AA780494 

AW971119AA574265AA513268 

H57645T19302AA527038 Z24851 H93171 

AW974668 AA661959 AA649572AA640401 AA640402 

AJ003294AJ003315AJ003293 



TABLE 13C: 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source, The 7 digll numbers in ftis column are Genbank Identifier (Gl)nu 

human chromosome 22' Dunham, et al. (1999) Nature 402489495 
Strand: Indicates DNA strand from which exons were predicted 
NLposition: Indicates nucleotide positions of predicted exons 



Pkey 

400640 

400666 

400776 

400812 

400881 

400882 

400965 

401010 

401020 

401025 

401047 

401131 

401192 

401220 

401383 

401405 

401519 



401879 
401888 
401897 
402066 
402077 
402104 
402238 
402287 



402679 
402840 
402883 



402927 
402943 
402944 
402947 



403121 
403165 
403295 
4033B1 



403649 
403910 
403912 
403914 
404182 
404502 



Ref Strand 
8117685 Plus 



2842777 Minus 

2842777 Minus 

7770576 Minus 

6117391 Minus 

8117458 Minus 

8117518 Minus 

6705887 Minus 

8699812 Minus 



7798785 Plus 

7705041 Minus 

6715702 Plus 

3262819 Plus 

7331447 Plus 

7139700 Plus 



8119072 
7690126 
4559317 
9885999 
9796239 
9796344 
9796344 



9926562 Plus 
9926751 Plus 



9581599 Minus 



9438267 Minus 



8705159 
7710710 
7710730 
7417588 
4775644 



Ntjosltion 
144324-144429 
17982-18115,20297-20 
103576-103720 



91446-91603,92 
110431-110708 
173043-173564 
83967-84180 
59085-60227 

179287-179483,181044-181166,181844-182039 



94802-94987.95804-95887.96323-96487,97596S7826 

69559-70101 

48079-48279 

155543-157381 



157315-157950 

124414-124950,125050-125418 

18921-19505 
96484-96681 

217235-217356,217521-217873 

147802-148251 

4588845535 

142257-142742 

101064-102827 



604-767 

135543-135031 

65014^5195 

122409-122500 

24726-24830,26791-27021 

40811-42447 



64925-65073 

8615-9103 

171761-171996 



57118-57306 

38666-38803,38885-39019,39097.J9231,39308J9445 

71919-72049 

41251-41443 

47247-47396 



11-110715,111173-1116' 
!9-101991 
4685546941,47032-47148 
92097-92864 
40594258 



22386-22708 
26009-26178 
119837-121197 



61841-62145,62367-52756 



2888^001,3198-3532,36554117 

27141-27247 

5761-6188 

72000-72290,72431-72700,72929-73199 

7431-8472 

18163-18444 



101320-101501 



wo 02/102235 



PCT/US02/19297 




115196-115448 

120556-120999 

175708-175871 

38611-38761 

4422^44382 



103182-103973 
136687-136768 
135020-135472 
37883-38052,38138-38332 



37640-37817 
93876-94275 

199214-199579,199672-199920,200262-200495 

46199-45647 



89276-89384,92450-92629,97091-97279,98546-98666 
100727-100859 
160502.1611W 
27743-28264 
122557-123551 
607903-608271 
83206-83 
63430-63 



Table 14A lists about 695 genes up-regulated in ovarian cancer compared to normal adult tissues. These were selected from 59680 probesets on the Aifymetrlx/Eos Hu03 
GeneCliip array sucli that the ratio of 'average' ovarian cancer to 'average' normal adult tissues was greater than or equal to 4.a The 'average' ovarian cancer level was sei 
the 90th percentile amongst various ovarian cancers. The 'averager normal adult tissue level was set to the 90th percentile amongst various norHnalignant tissues. In order to 
removegene-specificbacl(graundlevelsornon-specifchybrid1zatlan,the1Sthpercentitevalueamongstvariousnoi 
and the denominator before the ratio was evaluated. 

TABLE 14A: ABOUT 695 UP-f^GULATED GENES, OVARIAN CANCER VERSUS NORIUIAL ADULTTISSUES 
Pl<ey: Primekey 
Ex.Accn: Exemplar Accession 



Pl<ey Ex. Accn 

452838 U65011 

438817 AI023799 

432938 T27013 

421478 AI683243 

415989 A1267700 

418179 X51630 

449034 AI624049 

428579 l^l\L00S7S6 

428153 AW513143 

436982 AB018305 

427585 1^31152 

435094 AI560129 

430691 C14187 

430491 AL109791 

41-5511 AI732617 

448243 AW369771 

428187 AI687303 

408081 AW451597 

413007 M13509 

400292 AA250737 

422956 BE545072 

413335 AI613318 

423739 AA39S155 

410929 H47233 

424086 AI351010 

424905 NM_002497 

427356 AW023482 

407153 R45175 

407633 AJ404572 

427459 AA403084 

438993 AA82899S 

428664 AK001666 

439820 AL360204 

421155 H87879 

426635 BE39510g 

431989 AW972870 

422805 AA436989 

444783 AK001468 



Hs.30743 
Hs.1 63242 
Hs.3132 
Hs.97258 

Hs.1 145 

Hs.184942 
Hs.98367 
Hs.5378 

Hs.179729 
Hs.277523 
Hs.103538 



Hs.285529 
Hs.1 67409 
Hs .83169 
Hs.72472 
Hs.122579 
Hs.48442 
Hs.97600 
Hs.30643 
Hs.102267 
Hs.153704 
Hs.97849 



steroidogenic acute regulatory protein 



Wims tumor 1 

gb:ts41a09.x1 NCI_CGAP_Ut1 Homo sapiens cDNA clone 

G protein-coupled receptor 64 

hypothetical protein FLJ22252 similar to SRY-boxc 

spondin 1, (f-spondin) extracellular matrix prolei 

collagen; type X; alpha 1 (Schmid metaph)fseal chon 

EST 

ESTs 

Homo sapiens mRNA full length Insert cOiA otone EU 



Hs.102267 
Hs.129327 
Hs.291059 
Hs.121017 
Hs.62180 



H\Uk (never in mitosis gene a)-Felated kinase 2 
ESTs 

gb:yg40l01.s1 Scares infant brain 1 NIB Homo saplen 



integr1n',beta8 
Hs.189095 similar to SALLI (sal (DrosopNIaj-like 
■ HomosapiensmRNAfulllengthinsertcDNAcloneEU 
lysyl oxidase 



H2AhislaneMy;memberA 



wo 02/102235 



PCT/US02/19297 



428876 AII037824 

416209 AA236776 

408660 AA525775 

41C247 AF181721 

418738 AW388633 

459583 AI907673 

413623 AA825721 

439706 AW872527 

409041 AB033026 

451110 AI955040 

436775 AA731111 

443211 All 28388 

445258 AI635931 

447350 AI375572 

428227 AA321649 

453392 U23752 



423811 AW299598 

452461 N78223 

451106 BE382701 

416208 AW291168 

452249 BE394412 

4S20SS AI377431 

439243 AAS93254 

420149 AA255920 

429125 AA4468S4 

413S97 AW302885 

416566 NM.003914 

442438 AA995998 

407710 AW022727 

416661 AA634543 

428392 H10233 

431725 X65724 

447700 AI420183 

458027 L49054 

408460 AA054726 

424735 U31875 

415263 AA948033 

400298 AA032279 

462096 BE394901 

421451 AA291377 

435496 AW840171 

443715 AI583187 

402606 #(NOCAT) 

436954 AA740151 

413472 BE242870 

410102 AW24B50e 

408562 A1436323 

452030 AL137578 

442353 BE3795g4 

427344 NltL000869 

453160 AI263307 

426427 M86699 

449433 AI672096 

412723 AA648459 
400250 

419752 AA249573 

438167 R28363 

434539 AW748078 

429918 AW873986 

450375 AA009647 

400289 X07B20 

420900 AL045633 

428758 AA433988 

446142 AI754693 

421285 NM_000102 

433496 AF064254 

418506 AA084248 

433447 U29195 

424188 AW954552 

414245 BE148072 

426462 U59111 

418601 AA279490 

444170 AW513879 

453818 NM.003462 

407378 AA299264 

440901 AA909358 

407366 AF026942 

415227 AWB21113 

409269 AA576953 

450480 X82125 



Hs.150917 
Hs.109057 
HS.2821B4 
Hs.194695 
HS.7907B 



Hs.59761 
Hs.50081 
Hs.301584 
Hs.291891 
Hs.143655 
Hs.147613 
Hs,172634 

Hs!32964 

Hs!50895 
Hs.108106 
Hs.25960 
Hs.41295 
Hs.61262 
Hs.293772 
Hs.191349 
Hs.88095 
Hs.271004 
Hs.117183 
Hs.79378 

Hs.23616 

Hs,79440 

Hs.2265 

Hs.2839 

Hs. 17 1077 

Hs-85195 

Hs,285574 

HS152677 

HS130853 

Hs,61635 

Hs.226785 

lls.50831 

Hs.265398 

Hs.9700 

Hs.130425 
Hs.75379 
Hsi79727 
Hs.31141 
Hs.27607 
Hs.49136 
Hs.2142 
Hs.146228 
Hs,169840 
Hs.9012 
Hs.179912 

Hs.152618 
Hs.24286 
Hs.214410 



ESTs; HER4(c-€rb€4) 

INTERFERON-GAMMA INDUCED PROTEIN PRECURS 
SRY (sex deteimining region Y)-box 1 1 
EST-nollnUnlGene 



IGF-HmRNA^Indlng protein 3 

seoreloiy granule, neuroendocrine proteh 1 (7B2 p 

Norrie disease (pseudcgllonia) 

ESTs, Weakly similar to sWar to serlneftlweonln 

ESTs, Highly similar to t(3;SXq25,1;p34) fusion g 



solute earner My 1 (gial high affinity gluta 



54iydroxytryptamine (serolonir) receptor 3A 
ESTs 

TTK protein kinase 



S.1 19383 ESTS 



Hs.86368 
Hs.102408 
Hs.33846 



Hs.128612 ESTs 



Homo sapiens cDNA FU14303 fis, done PLACE2000132 



G proteiivcoupied receptor 39 

neuronal pentraxin il 

zinc linger protein 

WAS protein My, member 1 

dermatan sulpiiate proteoglycan 3 

ESTs 



Hs.72402 
Hs.22972 
Hs.25040 



gb-.Homc sapiens cig33 inRI«, parlal sequence. 



wo 02/102235 
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419088 AI638323 

453922 AF053306 

428253 AL133640 

426471 M22440 

407881 AW072003 

452291 AF015592 

445537 AJ245671 

442875 BE623003 



Hs.77496 
Hs.36708 
Hs.183357 
Hs.170009 



Hs.137206 
Hs.73625 

Hs.38018 



buddng unhhibiled by benzlmidazoles 1 (yeast horn 

Homo sapiens mRNA; cDNA DKFZp586C1 021 (from clone 

translbnn'mg growth fador, alpha 

heparan sulfate (glucosamine) 30-sulfotrans1erase 

CDC7 (ce« division cycle 7, S. cerevislae, homolo 

EGl^nk&^lomaln; multiple 6 

Homo sapiens clone TCCCTA00142 mRNA sequence 

Homo sapiens mRNA; cDNA DKI7:pS64H1663 (from clon 



429782 NM_005764 

424945 AI221919 

414972 BE263782 

439262 AA832333 

403381 #(NOCAT) 

424B34 AK001432 

435509 AI458679 

445413 AA151342 

414083 AL121282 

421373 AA608229 

430510 AW162916 

446999 AA1 51520 



dual-specificity tyrosine-(Y)-phosphorylation regu 
KIAA0307 gene product 

Ras-GTPase-activating protein SH3-domain-binding p 
Ms.lC343a hypothetical protein FU1C582 
Hs.77695 KIAA0008 gene product 
Hs.1 24399 ESTs 
0 

Hs.1 53408 Homo sapiens cDNA FU10670 fs. clone NT2RP200311i 



431548 AI834273 

433764 AW753676 

434636 AAa83764 

461807 W52854 

437872 AK002015 

443054 AI745185 

420092 AA814043 

420159 AI572490 

447164 AF026941 

451254 AI571016 

432677 NM_004482 

460434 AA1 66950 

400301 X03635 

408829 NM_006042 

434891 AA814309 



445657 AW612141 
450221 AA328102 
426320 W47695 



Hs.1 2677 
Hs.257786 
Hs.1 67771 



Hs.1 84572 
Hs.48915 
Hs.9711 



ESTs 
ESTs 

hypothetical protein PR02577 

hypothetical protein PRO2605 

gb:zl<15e04.5l Soares_pregnant_uterus_NbHPU Homo sa 

Prostate cancer associated protein 1 

pregnancy specific beta-1-glycoprotein 9 

cell division cyde 2, Gilo S and G2 to lui 

serineAhreonine kinase 15 

Homo sapiens cDNA FU13018 Us. done Nr2RP3000B85 



Hs.88045 
Hs.99785 
Hs.17518 
Hs.172967 



447342 AI199268 

434424 AI811202 

438078 AI016377 

437212 AI765021 

417728 AW138437 

438081 H49546 

411571 AA122393 

435663 AI023707 

424717 H03754 

425734 AF05620g 

4S0S0S NM.004572 

436211 AK001581 

436396 AI683487 

425695 NM.(X)5401 

438180 AA808189 

+47268 AI370413 



Hs.184987 
Hs.206332 
Hs.19322 
Hs.l 25366 



Hs.24790 
Hs.298964 
Hs.70811 
Hs.134273 
HS.1S2213 
Hs.159396 



Hs.299112 
HS.1S9238 
Hs.272151 

Hs.36563 
Hs.150587 

Hs.163716 
Hs.22588 
Hs.23282 
Hs.30057 
Hs.36232 
Hs.99915 



453102 NMJ07197 Hs.31664 

424001 W67883 Hs.1 37476 

434415 BE177494 

417576 AA339449 Hs.82285 

438966 AW979074 

415245 N59650 Hs.27262 

422352 AA766296 HS.9920C 

425492 AL021918 Hs.158174 



Hs.123533 ESTs 



Homo sapiens cig5 mRNA, partial sequence 

UDP-N-acetyl-alpha-D^alaotosamine:polypaplide N-a 
ESTs, Weakly similar lo partial COS [CelegansJ 

Estrogen receptor 1 
heparan sul'-'- - 



gb:Ui.H-BW0-^p-g-09*Ul.s1 Na_CGAP_Sub6 Homo s 
ESTs 

EST l*derately similar to ALU8J1UMAN ALU SUBFAH/I 
ESTs; Weakly similar to III! ALU SUBFAMILY J WARNI 
Homo sapiens cDNA: FU23S23 fis, clone LNGa5548 



hypotheical protein FU20S16 
ESTs 

wingless-type MMTV integration site family, m 



polymerase (DNA directed), gamma 

Homo sapiens cDNA FU11441 fIs, clone HEMBA1001323 

protein tyrosine phosphatase, non-receptor type 1 4 



Homo sapiens mRNA for Hmob33 protein, 3' untransia 

ESTs 
ESTs 

transporter similar lo yeast MRS2 
KIAA0186 gene product 

androgen receptor(dihydrotestoslefone receptor; t 
Homo sapiens mRNA U\ length insert cDNA ckjne EU 
frizzled (Drosophila) homolog 10 
KIAA1051 protein 

gb-.RC6-HT0696-270300-01 1-C06 HT0596 Homo sapiens c 
phosphoribosylglycinamide fomnyllransferase, phosp 
gb:EST391 184 MAGE resequences, MAGP Homo sapiens c 



zinc linger protein 184(Knjppel-like) 



wo 02/102235 
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414142 AW358397 

412170 D16632 

410011 AB020641 

436476 AA326108 

414132 A1801236 

437789 AI581344 

460192 AA263143 

449328 A1962493 

440238 AW451970 

403667 HKNOCAT) 

408826 AF216077 

418735 N48769 

413627 BE182082 

446293 AI420213 

441627 AA947552 

425465 L18964 

409242 AL080170 

450262 AyV409872 

440250 AA876179 



Hs.150042 
Hs.73729 
Hs.57856 
Hs.53631 
Hs.48480 
Hs.127812 
Hs.24596 
Hs.197647 
HS.15S644 

Hs.48376 
Hs.44609 
Hs.246973 
Hs.149722 



Homo sapiens clone HB-2 mRNA sequence 



protein kinase C; lota 
DKFZP434C091 protein 

ESTs, Moderately similar to ALU7_HUIIttN ALU SUBFA 



436032 AA150797 

407771 AL138272 

436039 AW043921 

444342 NM_014398 

407829 AA045084 

409731 AA126986 

404253 mom] 

424120 T80579 

429126 AW172356 

413573 At733859 

421464 AA291553 



420362 U79734 

444743 AA045648 

415138 C18356 

410668 AW162948 

429418 AI381023 

409178 BE393948 

446608 N75217 

425905 AB032959 

428532 AF157326 

433426 H69125 

431322 AW970622 

437960 AI669536 

423244 AL039379 

424065 NM_002914 

448674 W31173 

438122 AI620270 

440048 AA897461 

418478 U3894S 

407162 N638S5 

410804 U64820 

424639 AI917494 

432415 T16971 

421470 IQ7496 

445459 A1478629 

418203 X54942 

432809 AA56S509 

409234 AI879419 

438394 BE379623 

452097 AB002364 

453745 AA952989 

414136 AA812434 

423248 AA380177 

454018 AW016892 

452281 T93500 

424520 AA101043 

452594 AUC76405 

434149 Z43329 

425776 U25128 

413577 S83308 



448705 AW291095 

429163 AA884765 

413532 AW296647 

419917 AA320068 

424153 AA461737 

434255 AA846811 

435082 AA664273 

441081 AI584019 

443539 A1076182 



Hs,62713 
Hs.130526 
H3.1C887 



Hs.290270 

Hs.ggo83 

Hs.149089 
Hs.190086 
Hs.240770 



Hs.257846 
Hs.161700 
Hs.184786 
Hs.1 33625 



Hs.27916 
Hs.63908 
Hs.178227 
Hs.125845 
Hs.241652 
Hs.28792 
Hs.151254 
Hs.29981 
Hs.1 9574 
Hs.159499 

Hs!54668 



Hs.1 30554 
Hs.186104 
Hs.159006 
Hs.134074 



nuclear cap binding protein subunit 2, ZkD 
ESTs; WeaHysimilarto Gag-Pol ' ' ' 
huntingtin inleracting protein 1 
nudix (nucleoside diphosphate linked 
tissue factor pathway inhibitor 2 TFPi; 
pre-mRNA cleavage faolor Im (68kD) 



gb:EST382704 MAGE resequences, MAGK Homo sapiens 



Hs.222194 ESTs 



Hs.154140 ESTs 

Hs.129837 ESTs 

Hs. 1 58469 ESTs, Weal<lr similar to envelope protein [Rsapien 

Hs.1174 cyclin-dependentkinaseinlilbltor2A(melanonia,p1 

Hs.142634 zinc finger protein 

Hs.66521 Machadovloseph disease (spinocerebellar ataxia 3, 

Hs.131329 ESTs 

Hs.289014 ESTs 

Hs.1378 annexinA3 
Hs.158465 
Hs.83758 
Hs.131703 
Hs.27206 



CDC28 protein kinase 2 



CGI-124 protein 
ADAM-TS3; adis 
Homo sapiens HSPCSf 6 mRNA, partial cds 



ESTs 
ESTs 

kallikrein 7 (chymotryptic; stratum comeum) 
solute earner family 26 (sulfate transporter], me 
ESTs, Weakly similar lo kalanin p80 subunit [H.sap 
parathyroid hormone receptor 2 
SRY (sex determining region Y)-box 5 
troponin I, cardiac 

ESTs, Weakly similar to unnamed protein product |H 
class II cytokine receptor ZCYT0R7 
gb:am20a10.s1 Soares_NFL_T_GBC SI Homo sapiens cDN 
Homo sapiens cDNA: FU21971 fis, clone HEP05790 
Homo sapiens mRNA cDNA DKFZp434E232 (from clone 
MAGE-like 2 

Homo sapiens oDNA: FU23089 fis, done LNG07061 
Homo sapiens cDNA RJ^3m fis, done TWRO1000187 
ESTs, Moderately similar to piakophiUn 2b [H.sapi 



wo 02/102235 
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443830 AI142095 

462606 N45202 

418384 AW149266 

425371 D49441 

429441 AJ224172 

449048 Z45051 

437117 AL049256 

449579 AW207260 

453370 AI470523 

426514 BE616633 

415076 NM 000857 

408155 AB014528 

452904 AL1 57581 

439138 AI742605 

457030 AI301740 

436281 AW411194 

407385 AA610150 

406815 AA833930 

430437 AI768801 

428743 AL080060 

415139 AW975942 

417404 NM_007350 

433527 AW235613 

449448 D60730 

457733 AW974812 

457979 AA776655 

422867 L32137 

423554 M90516 

421502 AF111B56 

412733 AA984472 

422095 AI868872 

449347 A\/649748 

440870 AI687284 

437478 AL390172 

411598 BE336654 

418134 AA397769 

413845 AA85298S 

452039 A!922g88 

410555 U92649 

412719 AW016610 

410566 AA373210 

437099 N77793 

453431 AF094754 

408920 AL120071 

417866 AW067903 

420440 NM_0024C7 

430291 AV660345 

405547 #(NOCAT) 

427510 Z47542 

435793 AB037734 

427975 AI536065 

428949 AA4421S3 

452693 T79153 

440138 AB033023 

421246 AWS82g62 

445424 AB028g45 

448186 AA262105 

425154 NM_0018S1 



452971 AI873878 

428927 AA441837 

425282 AW163518 

419247 S65791 

446640 AW969626 

422938 NM_001809 

447078 AW885727 

421247 BE391727 

407896 D76435 

436666 A1364997 

417830 AW504786 

429826 N93266 

432030 AI903400 

443270 NM_004272 

453900 AW003582 

411096 U80034 

419558 AW953679 

427386 AW836261 

427961 AW293165 

404561 #{NOCAT) 

429682 NltL006306 

407216 N91773 



H5.143273 
Hs.90012 
Hs.26130 
Hs.166981 
Hs.204096 
Hs.22920 
HS.122SS3 
Hs.134014 
Hs.182356 
Hs.301122 
Hs.77890 
Hs.43133 
Hs,30967 
Hs.1 93696 
Hs.1 73381 
Hs.120061 
Hs.272072 



Hs.301549 
Hs.82101 



Hs.74554 
Hs.288966 
HS.29S901 
Hs.1 50539 
Hs.118811 
Hs.70g37 
Hs.86617 
Hs.89232 
Hs.172610 
Hs.64311 
Hs.129911 
Hs.43047 
Hs.48659 
Hs.32973 



Hs.48589 

Hs.6982 

Hs.300961 

Hs.12696 

Hs.4094 

HS.1S4860 

Hs.284137 



Homo sapiens cDNA: FL)23441 fs, clone HSI00612 



, Moderately sMIar to translation iniliab'on 
bone morphogenetic protein 7 (osteogenic protein 1 
guanylale cyclase 1, soluble, tiela 3 
KIAA0628genepradud 

Homo sapiens mRNA; cCWA DKFZp434H)626 (from clone 

ESTs 



ESTs, Moderately sinilarto ALU7_HUMAN ALU SUBFA 



.pleckstiin homology^ike domain, family A, memt)er 



cartilage oligomeric r 
glutamine-fructose " 
solute earner femil 
KIAA0080pfDtein 
• • • (fen 



se) 



Homo sapiens cDNA FU13793 Ss, clone THYRO1000085 
ESTs 

H3 hislone family, member K 
ESTs 

ctiromabox tiomoiog 5 (Drosoplfla HP1 alph:) 
ESTs 

a disintegrln and roetalloproteinase domain 17 (turn 
ESTs 

Homo sapiens cDNA FU13586 fis, clone PLACE1009160 
ESTs, Higlily similar to LMA1_HUMAN LAMININ ALPH 
glycine receptor, beta 

fibronecth teudne rich transmembrane protein 2 
'collagea type XI, alpha 1' 

CGl-49 protein 
0 

small nuclear RNA activating complex, polypeptide 

ESTs 
ESTs 

ESTs, Weakly similar to AF208866 1 BHW)13 [H.sapie 

zinc finger protein 228 

hypoltietical protein FU102Q1 

ESTs. Highly similarloAF151805 1 CGl-47protan 

cortacDn 8H3 domain-binding protdn 

Homo sapiens cDNA FU14208 fs, clone NT2RP3003264 

collagen, type IX, alpha 1 

Homo sapiens cDNA FU12888 fs, clone NT2RP2004081 
gb-.EST388262 MAGE resequences, MA6N Homo sapiens 
Homo sapiens cDNA FU20843 fs, clone ADKA01 954 
ESTs 
ESTs 



Hs.1 55485 
Hs.89764 
Hs.31704 



Hs.7672 

Hs.1 32308 

Hs.40747 

Hs.143739 

Hs.9192 

Hs.226414 

Hs.68583 

Hs.278394 

Hs.177486 

Hs.143134 



huntingtin interacting protein 2 



epithelial cell Iransfbrming sequence 2 oncogene 

ESTs 

ESTs 

Homer, neuronal immediate early gene, IB 

ESTs, Weakly similar to ALU8_HUMAN ALU SUBFAMIL 

mitochondrial intermediate pepidase 

ESTs 

amyloid beta (A4) precursor protein (protease next 
ESTs 
0 

SMC1 (structural maintenance of < 



wo 02/102235 



PCT/US02/19297 



410658 AW105231 

413930 M861S3 

414315 Z24878 

427878 C05766 

431041 M490967 

441645 AI222279 

428071 AF212848 

436406 AW1 05723 

429181 AW979104 

410909 AW893161 

424345 AK001380 

461996 AW514021 

449318 AW236021 



442611 BE077155 

452401 NM_007116 

453161 AA628608 

419948 AB(M1035 

427718 AI798630 

453867 AI929383 

422634 NM_016010 

444476 W07318 

428002 AA418703 

443486 NM_003428 

451177 AI96971S 

408298 AI74S325 

435867 AA9S4229 

423698 AA329796 

448543 AW897741 

427660 A1741320 

430345 AK000282 

433222 AW614472 

449532 W74653 

452822 X85689 

437641 A.\811452 

418379 AA218940 

416530 U62801 

433589 AA886530 

409143 AW025980 

410303 AA324597 

413384 NM_000401 

424698 AA1 64366 

431229 AA496479 

433377 AI752713 

445236 AK001676 

406367 #(NOCAT) 

442500 AI819068 



417791 AWS6S33g 

433277 W27266 

447835 AWS91623 

434401 AI864131 

437496 AA452378 

418849 AW474547 

428093 AW594606 

408621 AI970672 

453096 AW294631 

418852 BE637037 

436787 AA908554 

446577 AB040933 

437267 AW511443 



RAB1 1 A, member RAS oncogene family 
gb:HSB65D052 STRATAGENE Human skeletal muscle cD 
CGI-07 protein 



Hs.182339 transcription factor ESE-3B 



.294009 ESTs 



125346 
294005 
Hs.53112 

Hs!245510 
Hs.108788 
Hs.42746 
Hs.38489 



HS.94S0 

Hs.13034 

Hs.271923 

Hs,1 14052 

Hs.1098 

Hs.21380 

Hs.114121 

Hs.239681 

Hs.238415 



ESTs, Weakly similar to zeste [Dimelanogasler] 



1S.15830 Homo sapiens cDNA FU12691 Ss, clone Nr2RM4002S71 
jsis factor, alpha-induced protein 6 



M-phasepKosphoprotein 1 

gb3v98c03.;1 SaaresJ4liHMPu_S1 Homo sapiens cDNAc 
zinc fi«er protein 84 (HPF2] 



ESTs; 



ill! ALU SUBFAMILY SB2 



DKFZp434J1813proteln 

Homo sapiens mRNA; cI»IA DI<FZp586P1124 (from done 
Homo sapiens cDNA: FU23228 Ss. clone CAE06654 
hypolhelical protein FU20275 
ESTs, Moderately similar to ALUSLHUMAN ALU SUBFA 



Hs.29igi1 ESTs 
Hs.1 37516 



no sapiens cDNA: FU22621 fls, clone HS105658 



Hs.188812 
Hs.138965 
Hs.21851 
Hs.75334 
Hs,151973 

Hs.43846 
Hs.12457 

Hs.209122 
Hs.24385 
Hs.215725 
Hs.182364 
Hs.1 608 
Hs.1 73392 

Hs.194097 
Hs.1 11471 
Hs.151010 
Hs.164129 
Hs.71119 
Hs.170144 
HS.53S6S 
Hs.104830 
Hs.46638 



no sapiens cONA FLJ 1 2900 lis. clone MT2RP2004321 



hypolhelical protein FU10814 



Human hbc647 mRNA sequence 
ESTs 

ESTs, WeaMy similar to 25 kDa trypsin inhibitor [ 
replication protein A3 (14kD) 
KIAA114S protein 

gb:EST391282 MAGE resequences, MAGP Homo sapiens c 



Putative prostate cancer tumor suppressor 
Homo sapiens mRNA; cDNA DKFZpS47J12S (from cbne D 
ESTs, Weakly simliartoB0491.1 [Celegans] 
ESTs 

1 open reading firame 8; fetal brain ( 



5.273294 hypothetical protein FLJ20069 

S.192756 ESTs 

5.15420 KIAA1 500 protein 



447020 T27308 

453878 AW964440 

410824 AW994813 

427701 AA411101 

424602 AK002055 

430044 AA464510 

417423 AA197341 

421477 AI904743 

433384 AI021992 

434160 BESS1196 

443555 N71710 

416198 H27332 



Hs.37921 
Hs.1 6986 
Hs,19025 
Hs.33264 
Hs.221750 
Hs.301129 
Hs.152812 
Hs.111164 
Hs.104650 
Hs.124244 
Hs.1 14275 
Hs.21398 



hypothetical protein FLJ11046 



hypothetksal protein FU102S2 

ESTs 

ESTs 

ESTs, Moderately similar lo GNPI HUMAN GLUCOSAM 
ESTs 

200 



wo 02/102235 
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40 
45 



424539 L02911 

436645 AW023424 

417251 AW015242 

447207 AA442233 



435532 AW291488 

443268 AI800271 

446140 AA356170 

4S2891 N7SS82 

431130 NM_006ia3 

408938 M059013 

432842 AW674093 

436754 AI061288 

442573 H93366 

409049 AI423132 



429414 AI7836S6 

418882 NM_004996 

422505 AL120852 

426977 R15138 

428555 NM 002214 

452909 NM 015368 

449535 W15267 

452232 AW020603 

409732 NM_016122 

416115 AA214226 

423161 AL049227 

441085 AW136551 

423575 C18863 

415211 R6473acomp 

418804 AA809632 



421308 AA687322 

456273 AF154846 

443933 AI091631 

434651 BE387162 

440351 AF030933 

426300 U15979 

453775 NM 002916 

446102 AW168067 

420547 AF155140 

429486 AF156827 

429944 R13949 

433042 AW193534 

434988 AI413065 • 

452571 W31518 

434361 AF1 29765 

406400 #(NOCAT) 

410227 AB009284 

419945 AW290975 



431349 AAS03653 

446254 BE179829 

447505 AL049266 

448027 AI458437 



aclivin A receptor, type I 



37 kDa leucine-rich repeal (LRR) protein 



hypothetical protein FU12496 

Homo sapiens cDNA: FU21908 fis, clone HEP03830 

ESTs, Wealdy similar to KIAA0357 [H^apiensJ 

epididym'&specific; whey-acidic protein type; fbu 

ESTs 

hypoUietical protein PRO2605 

ESTs, Moderately similar to gonadotropin Inducible 

Branclied chain aminotransferase 1, cylosolic. U215 

ESTs 

Meis(mouse)honK)log3 
ESTs 

ESTs-, WeaMy similar to WASP-famiiy protein [H^p 
UDP-Gal:betaGlcNAc tieta 1,3^alactosyltrans{erase, 
ESTs, Highly similar to VABIJtUMAN VACUOLAR AT 
empty spiracles (I>osaph9a) homolog 2 
ATP-blnding cassette, sul}-femily C (CRIVMRP), mem 
ESTs; (HS^PAP protein (programmed cell death 9; 
Homo sapiens cbne 25052 mRNA sequence 
inlegrin,beta8 
pannexin 1 

low density lipoprotein receptor-related protein 6 
ESTs 

NY-REN-58 antigen 
hypolhelical protein 

Homo sapiens mRNA; cDNA DKFZp564N1 1 16 (from clon 
Homo sapiens cDNA FU12532 fis, clone NT2RM4000200 
ESTs 

ESTs; Highly similar to SPERM SURFACE PROTEIN SP1 
gb:nz17h04,s1 NC1_CGAP_GCB1 Homo saptens oDNA do 
cholinergic receptor, nicotinic, alpha polypeptide 



zinc finger protein 

Homo sapiens two pore potassium channel KT3.3 

ESTs, KgNy similarto XPB_HUItMN DNA-REPAIR PRO 

RAD1 (S. pombe) homolog 

della-like homolog (Drosophila) 

replicafion factor C (activator 1) 4 (371(D) 

ESTs 

gonadotropin-regulaled testicular RNA helicase 

hypothetical protein FU10339 

Homo sapiens clone 24881 mRNA sequence 

Homo sapiens cDNA FU11660 fis, clone HEMBA1004610 



gb:EST380338 MAGE resequences, MAGJ Homo sapiens c 
ESTs, Moderatelysimitarto ALU2_HUMAN ALUSUBFA 
Homo sapiens cDNA FU12832 lis, clone NT2RP2003137 
Homo sapiens mRNA; cDNA DKFZp564F093 (fram ctone 



424078 AB006625 

435244 N77221 

404996 mXATj 

407905 AW103655 

411560 AW851186 

424341 AA385074 

441675 AI914329 

452172 H00797 

420276 AA290938 

402820 mOCAT) 

419699 AA24B998 

422529 AW015128 

438018 AK001160 

441826 AW503603 

453931 AL121278 



patemaly expressed gene 3 



Hs,252905 ESTs 



Hs.31246 

Hs.256703 

Hs.5999 

Hs.129915 

Hs.25144 



wo 02/102235 



PCT/US02/19297 



15 
20 
25 
30 
35 
40 
45 
50 
55 



43SS38 AB0115<I0 

457465 AW301344 

418848 AI820961 

408321 AW405882 

447499 AW262580 

424513 BE385864 

432731 R31178 

448275 BE514434 

430371 D87466 

448693 AW004854 

407289 AA135159 

448141 AI471598 

434699 AA643687 

417718 T86540 

436464 AI016176 

427528 AU077143 

409092 AI735283 

416241 N52639 

432005 AA524190 



Hs.4930 

Hs.195969 

H5.193465 

Hs,44205 

Hs.147674 

Hs.149894 

Hs.287820 

Hs.20830 

Hs.240112 

Hs.228320 

Hs.203349 

Hs.149425 

Hs!269783 
Hs.179665 
Hs.172608 
Hs.32683 
Hs.120777 



bw density lipoprotein receptor-related protein 4 



448743 AB032962 Hs.21896 

451389 N73222 Hs.2173a 

453331 AI240655 Hs.8895 

454036 AA374756 Hs.93560 

448133 AA723157 Hs.73769 

429697 NM.003815 Hs.2442 



409459 D86407 
431708 AI698136 
433906 AI167816 



441423 AI793Z99, 

429876 AB028977 

446770 AV660309 

412078 X69699 

422093 AF151852 

423123 NH/L012247 

448390 AL035414 

453628 AW243307 

449722 BE280074 

436679 AI127483 

431592 R69016 

432383 AK000144 

419926 AW900992 

452367 U71207 

401644 #(NOCAT) 

410044 BE566742 

413775 AW409934 

424296 AI631874 

431118 BE254901 

432201 AI538613 

451073 AI768905 

451592 AI80S416 

4S24S3 AI902S19 

441020 W79283 

439024 R96696 

453619 H87648 

453459 BE047032 

408427 AW19427C 

419311 AA689591 

426460 D79721 

444540 AI693927 

452943 BE247449 

453913 AW004683 

417847 AI521558 

428866 AA436735 

423679 AA431765 

436209 AW850417 

446936 H10207 

406076 AL390179 

428319 AL135623 

406671 AA129547 

418432 M14166 

417048 AI088775 

431750 AA614986 

439314 AA382413 



455700 BE068115 

409073 AA063458 

433929 AI37S499 

415457 AW081710 



Hs.120451 
Hs.293871 
Hs.274449 
Hs,93796 
Hs.29279 



Hs.7627 

Hs.254020 

Hs.47314 

Hs.137011 

Hs.193914 

Hs.285754 

Hs.85112 

Hs.55498 

Hs.283705 

Hs.178144 

Hs.94812 



synaptic Ras GTPase actfvafing protein 1 (liomolcg 
KIAA0276proteh 

Homo sapiens cDNA: FU23S37fis, clone LN607690 
Homo sapiens cDNA FU12149 fis.clone MAMMA100042 

ESTs 

Homo sapiens cDigA FU11980 fis.clone HEMBB1001304 

ESTs 

ESTs, Weakly simllarto ALUIJIUIIMN ALU SUBFAMIL 



rs, Weakly simllarto ELL2_HUMAN RNA POLYMER 



ESTs, Weakly simllarto unnamed protein product [H 
folate recep^l (adult) 

. ^ j^^g,^ g (^gl, 



low densily lipoprotein receptor-related protein 8 



ESTs 

KiAA10S4 protein 

ESTs, Weakly similar to AF137386 

paired tiox gene 8 

CGI-94 protein 

SELENOPHOSPHATE SYNTHETASE ; I 
hypothetical protein 



ESTs, Weakly similar to unnamed protein product [H 
ESTs. Weakly similar to ALU1_HUMAN ALU SUBFAMIL 
Homo sapiens cDNA FU20137 fis, clone COL07137 
DKFZP586D2223 proton 
eyes absent (Drosophila) homolog 2 



enrich in leucine heptad 



Hs.250502 
Hs.135657 
Hs.206063 
Hs.213897 



gtKQV-BTOOS-l 01 198-051 BT009 Homo sapiens cI»IA, m 



H^aplens noi/el gene from PAC 1 17P20, chromosome 1 



hypothetteal protein FU10S25 

ESTs 



Homo sapiens cDNA FU22316 ts, clone HRC0S262 
Homo sapiens cDI^ FU22002 lis, clone HEP06638 
gb:zw80c03.s1 Soares_testls_NHT Homo sapiens cONA 
CGl-60 protein 

ESTs, Moderately similar to unnamed protein produc 
ESTs 

Homo sapiens mlWA; cDNA DKFZp547P1 34 (trom ctone 
KIAA0575 gene product 

met protooncogene (hepatocyte growth factor recep 
insulin-tke growth fxtx 1 (somatomedia C) 
gerany^eranyl diphosphate synthase 1 
ESTs 



Hs.27379 
Hs.7369 
Hs.283713 



ESTs, Weakly similar to ALU1_HUMAN ALU SUBFAMIL 



wo 02/102235 



PCT/US02/19297 



451024 AA442176 

415539 AI733881 

421515 Y11339 

420736 AI263022 

453293 AA382267 

409564 AA045857 

418378 AW962081 

429528 H09604 

439635 AA477288 

440452 A1925136 



452795 AW392555 

443171 BE281128 

425322 U63630 

442717 R88362 

414747 U30872 

417300 AI765227 

417389 BE260964 

448105 AW591433 



420908 AIJM9974 

421039 NM_003478 

426890 AA393167 

428671 NM_006531 



gb:zw63b08.r1 Soarest.totaUelus.Nb2HF8_9w Homo sa 
BMPR-lb; bone morphogenelic protein receptonlyp 
GalNAc alpha-2, 6-slalyltransferase I. tong foim 



Hs.54943 

Hs.13268 
Hs.94891 
Hs.55150 
Hs.112759 
HS.221S1 
Hs.18878 
Hs.9030 
Hs.155637 
Hs.180591 
Hs.77204 
Hs^eiO 
Hs^2045 
Hs.170675 
Hs.301622 

Hs.183114 
Hs.93201 
Hs.100261 
Hs.101299 
Hs.41294 
Hs.2291 
H3.10 



Hs.226440 
Hs.75319 



Hs.12407 
Hs.172636 
Hs.21672 



424081 NM_006413 

418376 NM.003031 

447204 A13SS881 

407910 AA650274 

412314 AA826247 

436291 BE568452 

450664 AJ246687 

426991 AK001536 

409365 AA702376 

410784 AW803201 

413374 NM.001034 

413426 F20966 

417666 AA780791 

424783 AA913909 

425024 R39235 

446941 A1267371 

448596 AB014544 

453448 AL036710 

468944 N93227 
400284 

441134 W29092 

408796 AA638292 

408296 AU 17452 

438913 A1380429 
402408 

411630 U42349 

450701 H33960 

439780 AL109688 

418301 AW976201 

420077 AW512260 

426572 AB037783 
403721 

411945 AL033527 
408684 R61377 
414869 AA157291 

437980 R60393 
451050 AW937420 



TABLE 14B: 

Pkey; Unique Eos probeset identifier ni 
CAT number Gene cluster number 
Accession: Genb; i 



ESTs 

Homo sapiens cDNA: FLJ22729 fis, clone HSI15685 
ESTs,WeaWysimilartoCAYP HUMAN CALCYPHOSIN 
ESTs, \ntealdy similar to AF126780 1 rellnal short-c 
KIAA1226 protein 
hypollietica! protein FU21620 
TONDU 

protein kinass; DNA^Iivated; catalytic polypepU 

ESTs, Weakly similar to R06F6^ [Celegans] 

centromere protein F (35W400kD, mitosin) 

solute carrier family 30 (zinc transporter), membe 

Midline (neuiite growgi-promotliig faster 2) 

ESTs, Weakly similar to TMS2_HUMAN TRANSMEMBR 

ESTs 



Homo sapi^js mRNA; cONA DKFZp564B222 (from done 
ESTs 

Probe liTg737 (polycystic kidney disease, autosomal 



ls.193143 KIAA1 069 protein 



Homo sapiens cDNA FU10784 lis, done NT2RP4000448 

ribonucleaseP(30kD) 

synaplosomal-assodated protein, 25kD 

ESTs, Moderately similar to ALUC.HUMAN III! ALU a 

libroneclin leucine rich transmembrane protein 3 

tieat shock factor binding protein 1 

ESTs; Highly similar te protein regulating cytokin 

Knippel-lype zinc linger proteh 

Homo sapiens cI»IA FU 12852 fis, done NT2RP2003445 

Homo sapiens clone 24881 mRNA sequence 

gb:IL2-UM0077-070500^0-E06 UM0077 Homo sapfens o 



gb:HSPD05390 HM3 Homo sapiens cONA done 032-X4-1 
ESTs, Weakly similar to KIAA0973 protein [H.sapien 
TATA box binding protein (TBP)-associated tacter, 



DKFZP586G1517pr( 



Homo sapiens cDNA FU12280 lis, done WMM 001 74 
gb:Homo sapiens mRNA full tength Insert cDNA done 
Hs.187618 ESTs 
Hs.87767 ESTs 

Hs.170623 liypothetbalproteinFU11183 



Hs.12727 hypothetical protein FU21610 
Hs.72163 ESTs 
Hs.278436 KIAA1474 protein 



109861_1 AA063458AA063018AI444822 

1221005 1 AW303201 BE079700 BE062940 

1249443 1 AW851186AW996967 BE143456 

136886J F20956 AA129374 AA133740 AW819878 

143612_1 Z24878AA494098 F1 3654 AA494040AA1 43127 

174666_1 AW962081AA218925AA354237 

179138_1 AA809632AI917245AI701732AA228406 

183793.1 AA689591.AW974261 AA236240A1077451 AA631399 AW974262 

203 



wo 02/102235 
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420637 195241.1 AW976153AA278946 M747691 

424341 238294.1 AA385074 AA339054 M339115AW956359 

428002 285602.1 AM18703AA418711 BE071915BE071920BE071912 

428679 294049J AA431765 AA432015 

429163 300543.1 AA884766 AW974271 AA592975AA447312 

430153 313709.1 AW968128AA468102 AA468165 

431229 330060 1 AA496479 T89859AW020056AW135251 AI221100AA628705AI263148 T79074 

431322 331543.1 AW970622AA503009 AA502998AA502989 AA502805 T92188 

434415 385931 1 BE177494 AW276909 AA632849 

436812 427323 1 AW298067AA731645AA810101 AW194180AI690673AW978773 

437938 44573_2 AI950087 N70206 R97040 N36809 AI3081 1 9 AW957677 N35320 AI251473 H59397 AW971 573 R97278 W01069 AW967671 AA908598 

AA251875 AI820501 AI820532 \V87391 T85904 U71456 T82391 BE328571 T75102 R34725 AA884922 BE328517 AI219788 AA884444 N92578 
F13493 AA927794 AI560251 AW874066 AL1 34043 AW235363 AA663345 AW008282 AA488964 AA283144 AI890387 AI950344 AI741346 
AI689062 AA282915 AW1 02898 AI8721 93 AI763273 AW1735e6 AW150329 AI653B32 AI762688 AA988777 AA488892 AI356394 AW1 0381 3 
AI539542 AA642789 AA855975 AW50551 2 A1961530 AW629970 BE612881 AW276997 AW513601 AW512843 AA044209 AW856538 
AA130009 AA337499 AWg61 1 01 AA251669 AA251874 AI819225 AW205862 AI683338 AI858509 AW276905 AI633006 AA972584 AA908741 
AW072629 AW613996 AA293273 AA969759 N75628 N22388 H84729 H60052 T92487 A1022058 AA780419 AA551005 W80701 AW6f 3456 
AI373032 AI664269 F00531 H83488 W37181 W78802 R66056 AI002839 R67840 AA300207 AW959581 T63226 R04005 

438966 467436J AW979074 AA334841 AA828650 

438993 467651 1 AA828995 AA834879 A1926361 

439042 468079 1 AW979172 AA829595 R96050 

439780 47673J AL1 09688 R23665 R26578 

442438 542469 1 AA995998AI916584 R61781T77332 F07756F08149 F07647 

449034 794817.1 A1624049AW117770A1858360 
451024 85565J AA442176 AA259181 

452453 918300.1 AI902519 AI902518AI902516 

455700 1351264 1 BE068115BE068104BE068102BEC68096BE068103BE0681S4BE068198 
458861 798085.1 AI630223 AI630470 

TABLE 14C: 

Pkey: Unique number consspondlng to an Eos pmbeset 

Ref: Sequence source The 7 digit numbers In this column are Genbank Identifier (Gl) numbers. 'Dunham I. et aL' refeis to the publication entitled The DMA sequence of 

human chromosome 22* Dunham, et al. (1999) Mus 402:489-495 
Strand; Indicates DNA strand from which exons were predicted 
Ntjiositlon: Indicates nucleotide positions of predicted exons 




175318-175476 

37999-38145,38652-38998.39727-39872.40557-40674,42351-42450 
124361-124520,124914-125050 
71754-71944 



1553-17U1878-2140.4252-4385.5922-6077 



Table 15A lists about 499 genes up-regulated in ovarian cancer compared to normal adult tissues that are likely to be extracellular or cell-surface proteins. These were selected 
as for Table 14A. except that the ratio was greater than or equal to 3.0. and the predcted piatoln contained a structural domain that Is indtealive of extracellular kicalizalion (e.g., 
Ig, fn3, egf, 7tm domains). Predated protein domains ate noted. 

TABLE 1 5A: ABOUT 499 UP-REGULATED GENES ENCODING EXTRACELLULAR«£LL SURFACE PROTEINS. OVARIAN CANCER VERSUS NORMAL ADULT TISSUES 

Pkey: Primekey 
UGID: UniGenelD 
Title: UniGene title 



Prat. Ckim.: Predicted protein stniotural domains 















Pkey 
415989 


Ex.Accn 


UG ID 


Tile 


PtoLDom. 


ralto 


AI267700 




ESTs 


TM 


42.7 


428579 


NM_005756 


Hs!l84942 


G protein-coupled receptor 64 


TM 


30.5 


428153 


AW513143 


Hs.98367 


similar to SRY-box containing gene 17 




30.1 


436982 


AB018305 


Hs.5378 


spondin 1, (f-spondin) extracellular matrix 




29.4 


427585 


D31152 


Hs.179729 


collagen; type X; alpha 1 (Schmkl metaphy 


C1q.Collagen 


27.0 


430691 


C14187 


Hs.103638 


ESTs 


TM 


26.2 


418CC7 


M13509 


Hs.83169 


Matiix metalloprotease 1 0nlerstlEalcollag 


S8„Peplldase_M10 


20.6 


400292 


AA250737 


Hs.72472 


BMPR-lb; bone nwrphogenetic protein rec 




20.6 


424086 


AI361010 


Hs.102267 


lysyl oxidase 


L^.O]ddase 


17.7 


424906 


NM_002497 


Hs.153704 


NIMA (never in mitosis gene a]-related kin 


pklse.pkinase 


174 


427356 


AW023482 


Hs.97849 




TM 


17.4 


407638 


AJ404672 


Hs.288693 


EST 


TM 


17.1 


427469 


AA403084 


Hs.269347 


ESTs 


TM 


17.0 


438993 


AA828995 




Integrin;beta8 


SS,integrin.B 


16.7 


421155 




Hs.102267 






16.1 


431989 


AW972870 


Hs.291069 




SS 


15.9 


428976 


AL037824 


Hs.194695 


ras homolog gene family, member 1 




15.1 


416209 


AA236776 


Hs.79078 


MAD2 (mitotic arrest deficient, yeast, hom 


TM 


15.0 



204 



wo 02/102235 



PCT/US02/19297 



413623 


AA825721 


Hs.246973 


ESTs 


TM 




447350 


AI375572 


Hs.172634 


ESTs; HER4(Mrbfl4) 


SS,TM,Furin-like,pkinase 


14.2 


428227 


AA321649 


Hs.2248 


INTERFERON-GAMMA INDUCED PRO 


IL8 




452461 


N78223 


Hs.108106 


transcription factor 


G9a,PHD 


13.7 


461106 


BE332701 


Hs.25960 




Myc_N_term 


13.6 


416208 


AW291168 


Hs.41295 


ESTs 




13.5 


462249 


BE394412 


Hs.61252 


ESTs 




13.4 


416666 


NM_003914 


Hs.79378 






12.8 


416661 


AA634543 


Hs.79440 


IGF-II mRNA-binding protein 3 


TM 


12.6 


431725 


X65724 


Hs.2839 


Norrie disease (pseudoglioma) 


SS,CysJ<not 


12.3 


458027 


L49064 


Hs.86196 


ESTs, Higtily sinnilar to t(3S)(q25.1;p34) f 


TM 


12.2 


408460 


AA064726 


Hs.286674 


ESTs 


TM 


112 


415263 


AA948033 


Hs.1 30853 


ESTs 


hislone 


11.9 


400298 


AA032279 


Hs.61636 


STEAPl 


TM 


11.8 


421451 


AA291377 


Hs.60831 


ESTs 


TM 


11.6 


443715 


AI583187 


Hs,9700 


cyclinEI 




11.5 


413472 


BE242870 


Hs.75379 


solute carrier family 1 (glial high afiinily gl 


TM,SDF 


WJS 


410102 


AW248508 


Hs.279727 


ESTs; 






408662 




Hs.31141 


Homo sapiens mRNA Ibr KIAA1568 prote 




11.3 




BE379594 


Hs.49136 






427344 


NM_000869 


Hs.2142 




TM,neur_chan 


11.2 




A1263307 


Hs.146228 


ESTs 


histone 


11.2 


412723 


AA648459 


Hs.179912 


ESTs 


TM 




400250 






0 


HisLdeacetyl+F105 


11.1 


438167 




Hs.24286 


ESTs 


7tm 1 


11.1 


434539 


AW748078 


Hs.214410 


ESTs 


TM 


10.9 


450375 


AA0C9647 


Hs.8850 


a dislntegrin and metalloprotelnase domain 


TM 




400289 




Hs.2258 


Matrix Metalloprotehase 10 (Stromolysln 2 


SS,hemopexin 


10.8 


446142 


A1754693 


Hs.145968 


ESTs 


Cadherin_C_term 


10.7 


421285 


NM_000102 


Hs.1363 


cytochrome P450, sulifamilr XVII (steroid 


TM.P450 


10.6 


433496 


AF064254 


Hs.49765 


VERY-LONG-CHAIN ACKl-COA SYNT 


SS,TM 


10.6 


418506 


AA084248 


HSJ5339 


G protein-coupled receptor 39 


TM 


10.5 


433447 


U29195 


Hs.3281 


neuronal penfraxin II 


SS 


10.4 


414245 


BE148072 


Hs.75850 


WAS protein family, memt)er 1 




10.3 


426462 


U59111 


Hs.1 59993 


dcrmatan sulphate proteoglycan 3 


SS,LRRNT 


10.3 


418601 


AA279490 


Hs.86368 






10,3 


415227 


AW821113 


Hs.72402 


ESTs 


TM 


10.2 


409269 


AA576953 


Hs.22972 


Homo sapiens cDNA FU13352 fs, clone 0 


TM 


10.1 


426471 


M22440 


Hs.1 70009 


transforming growth factor, alpha 


SS,EGF 


9.8 


407881 


AW072003 


Hs.40968 


heparan sulfate (glucosamine) 3-Osulfotran 




9.7 


445537 


AJ245671 


Hs.1 2344 


EGF-l!l<eKlomain: multiple 6 


SS,EGF 


9.7 


414972 


BE263782 


Hs.77695 


KiAAOOOS gene product 


TM 


9.4 


435509 


AI458679 


Hs.181915 


ESTs 






445413 


AA151342 


Hs.1 2677 


CGI-1 47 protein 


UPF0099 


9.2 


446999 


AA161620 


Hs.279525 


hypctheticai protein PRO2605 


TM 


91 


414569 


AF109298 


Hs.1 18253 


Prostate cancer associated protein 1 






406687 


M31126 


Hs.272620 


pregnancy specific bela-1-glycoprolein 9 


hemope)an 


9.0 


408908 


BE296227 


Hs.48915 




pkise,TM 


9.0 


451607 


W52854 


Hs.27099 


DKFZP564J0863 protein 


TM 


as 


420159 


AI572490 


Hs.99785 


ESTs 


TM 


a8 


432677 


NM.004482 


Hs.278611 


UDP-l<l-acetyl-alpha-D-galactosamine:poly 


TM,Ricin_B_iectln 


8.7 


408829 


NM.006042 


Hs.48384 


heparan sulfate (glucosamine) S-Osulfolran 


TM 


a7 


438885 


AI886558 


Hs.184987 


ESTs 


TM 


a7 


447342 


AI199268 


Hs.19322 


ESTs; Weakly similar to III! ALU SUBFAM 


TM 


ae 


437212 


AI765021 


Hs.210775 


ESTs 


UDPGT 


8.5 


424717 


H03754 


Hs.152213 


wingless-type MUTV Integration site fami 


wnt 


a4 


450505 


NM.004572 


Hs.26051 


plakophilin2 


TM 


8.4 


436396 


AI683487 


Hs.299112 


Homo sapiens cDNA FU1 1441 lis, clone H 


wnt 


as 


425695 


NM.005401 


Hs,169238 


protein tyrosine phosphatase, non-receptor 


Y_phosphatase 
Rlbosoinal_S8 


S.3 


447268 


AI370413 


Hs,36663 


Homo sapiens cDNA; FU22418 tis, clone 


a2 


400195 








TM 


ai 


424906 


A1566086 


Hs.153716 


Homo sapiens mRNA for Hmob33 protein. 


TM 


ai 




AW16g287 


Hs.22588 


ESTs 




ai 


439759 


AU59055 


Hs.67709 


Homo sapiens mRNA lull length Insert cDN 


TM 


8.0 


453102 


NM_007197 


Hs.31664 


frizzled (Drosophila) homdog 10 


TM,Fz,Frizzled 


ao 


424001 


W67883 


Hs.1 37476 


KIAA1061 protein 


TM 


ao 


442655 


AW027457 


Hs.30323 


ESTs 


TM 




446657 


AW612141 


Hs.279676 


ESTs 


7tm.1 




426320 


W47595 


Hs.169300 


transforming growth factor, I)eta2 


SS,TGF-ljeta 






□15532 


Hs.73729 


very low density lipoprotein receptor 


TM.ldl_recepLb,EGF 




436476 


AA326108 


Hs.53631 


ESTs 


™ 














76 


437789 


AI581344 


Hs'l27812 


ESTs, Weakly similar to AF141326 1 RNA 


TM 


76 


450192 


AA263143 


Hs.24596 


F!AD51-interacting protein 


TM 


7.5 


408826 


AF216077 


Hs.48376 


Homo sapiens clone HB-2 mRNA sequence 






413627 


BE182082 




ESTs 


TM 




446293 


AI420213 


Hs!l49722 


ESTs 


LIM,homeobox 




409242 


AU)80170 


Hs.51692 


DKFZP434C091 protein 


TM,7tiTLl 




450262 


AW409872 


Hs.271166 


ESTs. Moderately similar to ALU7_HUMA 


TM 




451659 


BE3797ei 


Hs.14248 


ESTs, WeaWy similar to ALU8J1UMAN A 


TM 




414342 


NM_014398 
AW172356 


Hs.10887 
Hs.99083 


similar to lysosome-associated membrane g 
ESTs 


TM 
7tnLl 




429126 
421464 


AA291553 


Hs.190086 


ESTs 


TM 




420362 


U79734 


Hs.97206 


huntingtinlnteracthg protein 1 


TM 





wo 02/102235 



PCT/US02/19297 



444743 
415138 
429418 
409178 



25 

30 
35 
40 



55 

60 



AF157326 
H69125 
W31178 
T16971 

X64942 



452097 
453746 
423248 
452281 
424620 



448706 AW291095 



422867 
421502 
412733 
422095 



453431 
417366 
430291 



Hs.99283 ESTs 
Hs.50915 
Hs.161700 
Hs.184786 
Hs.133525 
Hs.154140 
Hs.289014 
Hs.83758 
Hs.27693 
Hs.27916 



Kunltz_BPn,&gamma 



Hs.125845 
Hs.28792 
Hs.151254 
Hs.29981 



C6I-124 protein 
ADAM-TS3: " 

Homo sapiens HSPC316 itiRNA, partial cd 
rlbulose-S-phosphale-S^piineiase 
ESTs 

Nllkrein 7 (chyimtryptlc; siralum comeum 
solute carrier My 26 (suUiale transporter) 
ESTs, Weakly similar to katanin p80 subun 
parathyroid hormone receptor 2 

troponin I, cardiac 

ESTs, Weakly similar to unnamed protein p 
class II cytokine receptor ZCYT0R7 
Homo sapiens cDNA: FLJ21971 fis, clone 

MAGE-like2 

ESTs. Moderately similar to plakophiDn 2b 




426514 BE616633 
452904 AL157581 



ESTs, Moderately similar to translation Init 
bone morphogeneti; protein 7 (osteogenk; p 
Homo sapiens mRNA; cDNA DKFZp434E 



AW411194 

AW975942 

D60730 

AA776655 

L32137 

AF111856 

AA984472 

A1868872 



N77793 

AF094764 

AW067903 

AV660345 

/((NOCATJ 

AB037734 

AB033023 

NMJ01851 

AW960146 

AIB73878 

AA441837 



Hs.120051 ESTs 



Hs.89232 
Hs.64311 
HS.4B659 



Hs.6982 

Hs.154850 

Hs284137 

Hs.91789 

HS.902S0 

Hs,89764 

HSJ1704 

HS301570 

Hs.102910 



Hs.gt 



iolog5(DrosophilaHP1alp 
a aisimegnn and melallopratelnase domain 
ESTs, H^hly similarto UttlJIUMAN L 
glycine receptor, beta 
"collagen, type XI, alpha 1" 
CGI49 protein 
0 

ESTs 

tiypothelical p'otein FU10201 

collagen, type IX, alpha 1 

Homosapiens cDIi^ FU12888fis, clone N 

ESTs 

ESTs- 

fragile X mental retardation 1 

ESTs, We^y similar to KIAA0227 [H.sap 

ESTs 

general transcription factor IIH, poiypeptid 
ESTs 

Homer, neuronal immediate early gene, IB 



407216 N91773 

413930 1^85153 

414315 Z24878 

441645 AI222279 

449318 AW236021 

441433 AA933809 

445495 BE522641 

410153 BE311926 
442611 



Hs.201555 ESTs 



amytoid beta (A4) precursor protein (protea 
ESTs 

lysyl oxidase 

fWBI 1A, member RAS oncogene family 
gb:HSB65D052 STRATAGWE Human sk 



Hs.108788 
Hs.42746 
Hs.38489 
Hs,15830 
Hs,177537 ESTs 



ESTs, Weakly similarto zesle p.melanoga 



inscDNAFU12691 fis, clone N 



NM_007115 Hs.29352 



419948 AB041035 



AI745325 
AW897741 
AW514472 
W74653 



Hs.108ig6 ' 

Hs.271923 

Hs.21380 

HS.23841S 

Hs,271593 

Hs.288617 

Hs.137516 , 



TM,ABC_niembrane 



HSPC037 protein 

ESTs; ^klde^ately sMarto III! ALU SUB 
Homo sapiens mRNA; cDNA I]KF2p586P 
ESTs, Moderately sMarto A1JU8.HUMA 
ESTs 

Homo sapiens cDNA: FU22621 fis, clone 
fidgetin-likel 

206 



wo 02/102235 



PCT/US02/19297 



416530 U62801 

413384 NIL000401 

445236 AK001676 

406357 #(NOCAT) 

442500 AI819068 

450101 AV649989 

419140 AI982647 

417791 AW965339 

437496 AA462378 

418849 AW474547 

428093 AW594506 

408621 A1970672 

418852 BE537037 
404939 

447020 T27308 

410824 AW994813 

417423 AA197341 

421477 A1904743 

443555 N71710 

424539 LC2911 

416565 AW000960 

431130 NM_006103 

408938 AA059013 

436754 AI061288 

409049 AI423132 

458627 AWDBB642 

41BB82 NM.004996 

422505 AL120862 

428555 NM_002214 

452909 NIL01S368 

449535 W15267 



35 
40 
45 
50 
55 
60 



Hs.79361 
Hs.75334 
Hs.12457 

Hs.209122 
Hs.24335 
Hs.215725 
Hs.111471 
Hs.170144 
HS53565 
Hs.104630 
Hs.46638 
Hs.273294 



kallikrein 6 (neurosh zyme) 
exostoses (multiple) 2 
liypotlietical protein FU10814 



hypotlietical protein FU11046 



Hs.104650 
Hs.21398 
Hs.150402 
Hs.44970 
Hs.2719 
Hs.22607 
Hs.133437 
Hs.146343 
Hs.97984 
Hs.89433 
Hs.124165 
HS.1B4908 
HSJ098S 
HS23672 
HS271698 



epididymis-specific; whey-aoidic pmtein ly 
ESTs 

ESTs, Moderately similar to gonadotropin i 
ESTs 

ESTs; Weakly similar to WASP-Mly pro 
ATP-b'mding cassette, sub-Mly C (CFTR 
ESTs; (HSA)PAP protein (programmed ce 
inlegriabetaS 

low density lipoprotein receplor-relaled pro 



TM,ABC_raembrane 



423161 


AL049227 


Hs.124776 


Homo sapiens mRNA; cDNA DKFZp564N 


Cadheiin.CJerm 




428405 
433330 


Y00762 
AW2070B4 


HS2266 
Hs.132816 


cliolinergic receptor, nicotinic, alptia polype 
ESTs 


tm' 


4.7 


443933 


AI091631 


HS.135S01 


Homo sapiens two pore potassium channel 




4.7 


440351 


AF030933 


Hs.7179 


IWD1 (&pomb8)homolog 


TM 


4.7 


426300 


U15979 


Hs.169228 


dells^ike homolog (Drosophila) 


TM,EGF 


4.7 


453775 


NHL002916 


HSJ5120 


replication factor C (activator 1) 4 (37kD) 


AAA,DEAD,helicase_C 




429944 


R13949 


H3.226440 


Homo sapiens dace 24881 mRNA sequenc 




47 


434988 


A1413055 


Hs.161160 


ESTs 


TM 


4.6 


406400 


#(NOCAT) 
AW628666 




0 


tiypsHTM 


4.6 


446254 


BE179829 




Homo sapiens cDNA FU12832 Us, done N 


TM 


4,6 


459574 


AI741122 


Hs'lOISIO 


Homo sapiens cDNA FU14232 lis, clone N 


TM 


46 


409928 


AL137163 


Hs.57549 


hypothetical protein dJ473B4 


TM 


4.6 


435241 


N77221 


Hs.187824 


ESTs 


pkinase,fn3 


4.6 


404995 


#(NOCAT) 






PepHdase_C1 


4.6 


407905 


AW103655 


Hs^2905 


ESTs 


SS,Ephrin 


4.6 




AI914329 


HS5461 


ESTs 


TM 


4.6 


420275 
422529 


AA290938 
AW015128 


Hs.190561 
Hs^703 


ESTs, Highly similar to mosaic piotein LR1 
ESTs 


TM,fh3,ldLrecepLa 
TM 


4.5 
4.5 


438018 


AK001160 


Hs.5999 


hypothetical piotein FU102g8 


TM 


4.S 


457465 


AW301344 




ESTs 


TMlpkbT" 


4.5 


418848 


AI820961 


Hs!l93465 


ESTs 


4,5 


447499 


AyV262S80 


Hs.147674 


KIAA1621 protein 


TM 


4.5 


432731 


R31178 


H5.287820 


tibroneclin 1 


SS 


4.5 


434699 


AA643687 


Hs. 149425 


Homo sapiens cDNA FLJ11980(is. ctone H 


Nucleoskle_tra2 


4.4 


409092 


AU077143 
AI735283 


HS179565 
Hs.172608 


minichramosome maintenance deficient (S. 


TM 
TM 


4.4 


451369 
453331 


N73222 
AI240665 


Hs.21738 
Hs.8895 


KIAA1008 protein 


TM 
TM 


4!4 
4.4 


448133 


AA723157 


Hs,73769 


folate receptor 1 (adul) 


TM 


4.4 


429597 
453279 


NM.003816 
AW893940 


Hs,2442 
Hs,59698 


a disinlegrin and metalloproteinase domain 


TM 
TM 


4.4 


409469 


D86407 


Hs,64481 


low density lipoprotein receptor-ielated pro 


TM,EGF,ldLreoept_a 




431708 


AI698136 


HS.1C8873 


ESTs 


TM 


4.4 


433906 




Hs,43355 


ESTs 


TM 


4.4 


441423 


A1793299 


Hs,126877 


ESTs 


TM 


4.4 




AV660309 


Hs.154986 


ESTs, Weakly simBar to AF137386 1 plasm 


TM 


4.3 


412078 


X69699 


Hs.73149 


paired boK gene 8 


TM 




423123 


NM_012247 


Hs.124027 


SELENOPHOSPHATE SYMTHETASE ; H 


AIRS 




44839C 




Hs.21068 


hypothetical protein 


TM 


43 


453628 


AW2433C7 


Hs.170187 


ESTs 


TM 


4.3 


452367 


U71207 


Hs.29279 


eyes absent (Drosophila) homobg 2 


TM 


4.3 




AW409934 


Hs.75528 


nucleolar GTPase 


MMRJJSR1 


4.3 


451692 


AI805416 


Hs.213897 


ESTs 


TM 


4.3 


419311 


AA689591 




gb;nv66a12.s1 NCLCGAP.GCBI Homos 


TM 


4.2 


452943 


BE247449 


Hs.31082 


hypotheUcai protein FU10525 


TM 


4.2 


428679 


AA431765 




gb:zw80c03.s1 SoaiGSjestis_NHT Homo s 


TM 


4.2 


436209 


AW850417 


Hs.254020 


ESTs, Moderately similar to unnamed pmte 


TM 




406076 


AL390179 


Hs.137011 


Homo saptens mRNA; cDNA DKFZp547P 


TM 


412 


426819 


AL135623 


Hs.193914 


KiAA057S gene product 


TM 


4.2 


406671 


AA129547 


Hs.285754 . 


met protoK)ncogene (hepatocyte growth fac 


F-aclin_cap_A 


4.2 



wo 02/102235 



PCT/US02/19297 



431750 AAS14S86 

449554 AA682382 

409073 AA0634S8 

433929 AI375499 

415457 AVW)81710 

444381 BE387335 

415539 AI733881 

421515 Y11339 

453293 AA382267 

409564 AA045867 

429628 H09604 

440452 A1925136 

443695 AW204099 

425322 U6363C 

417300 A1765227 

417389 BE260964 

452334 A1638627 

428771 AB028992 

412314 AA825247 

436291 BE668462 

450654 AJ245587 

409365 AA702376 

413374 NM_001034 

417655 AA780791 

445941 A1267371 

441134 W29092 

411630 U42349 

418301 AW976201 

411945 AL033527 



414869 AA157291 

420281 A1623693 

416658 1*3272 

411274 NM.002776 

437222 AL1 17588 

431958 X63629 

430634 AI860651 

415716 N59294 

420179 N7463C 

461250 AA491276 

429496 AA453e00 

421764 AI681535 

447197 R36076 

422939 AW394055 

414737 All 60386 

411773 NM.006799 

425247 NM.005940 

424433 H04607 

431846 BE019924 

407792 AI077715 

417531 NM_003157 

434836 AA651629 

439810 AL109710 

418693 AI750878 

407864 AF069291 

436304 AA339622 

452259 AA317439 

453488 W00712 

428943 AW086180 

411402 BE297855 

425176 AW015644 

400296 AA305627 

407340 AA810168 

418524 AA300576 

438279 AA805166 

439453 BE264974 

441111 AI806857 

461806 NM_003729 

409542 AA503020 

425441 AA449644 

428137 AM21792 



414343 ALJ036166 

414812 X72755 

410361 BE391804 

415786 AW419196 

427177 ABC06537 

427687 AWD03867 

444619 BE538082 

447336 AW139383 

412519 AA196241 



HS27379 

Hs.7369 

HS283713 

Hs.72472 

Hs.105352 

Hs.10653 

HS54943 

Hs.13253 

Hs.55150 

Hs.1 12759 

Hs.155637 

Hs.55610 

Hs.82045 

Hs.105585 

Hs.193143 

Hs.250899 

Hg.5101 

Hs.25275 

Hs.226440 

Hs.75319 

Hs.14014 

Hs.172636 

Hs.7678 

Hs.71119 

Hs.187618 

Hs.92137 

Hs.12727 

Hs,72163 

Hs.191533 

Hs.79432 

Hs.69423 

Hs.299963 

Hs.2877 



Hs,98427 

Hs.125087 

Hs,72C26 

Hs.165324 

H5.9218 

Hs.271580 

Hs.3g384 

Hs.1087 

Hs.118088 

Hs.85568 

Hs.87409 

Hs.40639 

Hs.108887 

HS.287C7 

Hs.32990 

Hs.37636 

Hs.69855 

Hs.301430 

Hs.139336 

Hs.232119 



ESTs, Weakly similar to ALULHUMAN A 



ESTs, Weaidy similar to CAYP.HUMAN 
ESTs, Weakly similar to AF126780 1 retina 
protein kinase; DNA-activated; catalytfcpo 
solute earner family 30 (zinc transporter), m 
Midkine (neuiite growtii-prDiiiotIng (actor 2 

KIAAIOeg protein 
heat sliock factor binding protein 1 
ESTs; Highly similar to protein regulating c 
Krtjppd-type zinc finger protein 
Homo sapiens clone 24881 mRNA sequenc 
titxinucleotide reductase M2 polypeptide 
ESTs, Weakly similar to KIAA0973 protein 
ESTs 



hypcthetfcal protein FU2161C 

ESTs 

ESTs 

Hbrillin 2 (congenital contractural arachncd 
ESTs 

Cadheiin 3, P-cadherin (placental) 
ESTs 

Homo sapiens cDNA FU1 1689 lis, done H 
ESTs 

Homo sapiens cDNA inJ12542 Us, clone N 
ESTs 

ESTs, Weakly similar to KCC1_HUMAN C 



ESTs, Weakly similar to KIAA1392 protein 



acidic 82 kl^pioteinmRNA 

ESTs, ModaalelysimilartoAUU8_HUMA 

thyroid hormone receptorlnteractor 13 



PI-PLaX,Pl-PLC>Y 



TGF-beta.Myc_N_terni 




wo 02/102235 
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25 
30 
35 
40 



418792 


AB037805 


Hs.88442 


KIAA1384pratem 


TM 


3.S 


408031 


M081395 


Hs42173 


Homo sapiens cDNA FU10366 Us, done N 


TM 


as 


416892 


124498 


Hs^9 


growth anest and DNA-damage-inducible, 


TM 


as 


418793 


AW382987 


Hs^74 


piostaglandin-endoperoxide synthase 1 (pro 


EGF 


as 


448089 


AI467945 


Hs.173696 


ESTs 


SS 


as 


422278 


AF072873 


Hs.1 14218 


ESTs 


TM,Fz,Fn'nled 




442133 


AWB74138 


Hs.129017 


ESTs 


TM 


as 


410908 


M121686 


Hs.10592 


ESTs 


GTP_EFTU 


as 


452198 




Hs.61210 


ESTs 


TM 


as 


408730 


AV660717 


Hs.47144 


DKFZP586N0819 protein 




a4 


436488 


BE620909 




hypottielical protein FIJ20gS8 


TM 


a4 


409745 






gb:7B14E12 Chromosome 7 Fetal Bra'n cD 


TM 


a4 


445870 


AW410053 


Hs.13406 


syntaxiti 18 


TM 


a4 


451743 


AW074266 


Hs.23071 


ESTs 


TM 


14 


407846 


AA426202 


HS.4&403 


Cbp/p300-interacling transactivatcr, with G 


TM 




432350 


NM_005865 


Hs.274407 


protease, serine, 16 (thymus) 




a4 


412348 




Hs.70832 




TM 




413625 






ESTs 






417301 


AA417383 


Hs.82582 


integrin, beta-lilte 1 (with EGF-Ike repeat d 




a4 


422972 


N59319 


Hs.145404 


ESTs 


TM 




429170 


NM.001394 


Hs.2359 


dual specificity phosphatase 4; MAPIanas 


DSPc,Rhodanese 


a4 


450377 


AB033091 


Hs.24936 


ESTs 


TM 


3A 


443475 


AI066470 


Hs.134482 


- ESTs 


TM 


3A 


419452 


U3363S 


Hs.90572 


PTK7 protein tyrosine Idnase 7 


TM^kise,ig,SRF-TF 


3.4 


409744 


AW675258 


HS.5626S 


Homo sapiens mRNA; cDNA DKFZpS86P 


TM 


3.4 


422789 


AK001113 


HS.120B42 


hypothetical protein FU102S1 


TM 


a4 


404440 


#(NOCAT) 




0 


TM,neur_chan 


a4 


417412 


X168g6 


Hs.82112 


intedeuldn 1 receptor, type 1 


SS,T1R,lg 


a4 


411828 


AW161449 


Hs.72290 


wingless-type MMTV integration site fami 


wnt 


a4 


417177 


NM.004458 


HS.814S2 


Mty^id-Coenzyme A ligase, long-chain 4 


SS 


3.4 


421013 




Hs.1345 


mutated In colorectal cancers 


TM 




427072 


H38046 




gbyp58c10.rl Soanes fetal liverspteen INF 


Ribosomal_L22e 


3.4 


433703 


AA210863 


Hs.3532 


nemo-lil<e kinase 


pUnase 


3.4 


434294 


AJ271379 


Hs.21175 


ESTs 


TM 


3.4 


444133 


AI393165 


Hs.19175 


ESTs 


TM 


3.4 


446109 


N67953 


Hs.145920 


ESTs 


TM 


a4 


400381 






0 


Asparaginase_2 


3.3 


450236 


AW1 62998 


Hs.24684 


KIAA1376 protein 


TM 


3.3 


418836 


AI655499 


H3.161712 




TM 


3.3 


437951 


T34630 


Hs.4210 


Homo sapiens oDNA FU13069 lis, clone N 


TM 


a3 


446396 


T15767 


Hs.22452 


Homo sapiens cDNA: FLJ21084fis, done 


TM 


3.3 


430687 


BE274217 




heterogeneous nuclear protein sMarto rat 




a3 


410060 




Hs .58367 


glypican-4 


SS 


a3 


419546 


AA244199 




gbjic06c05.s1 NCLCGAP_Pr1 Homosapi 


TM 


33 


429609 


AF002246 


Hs.210863 


cell adhesion molecule with homology to L 


TM,fh3,lg 


3.3 


413289 


AA128C61 


Hs.1 14992 


ESTs 


TM 


3.3 


440006 


AK000517 


Hs.6844 


hypotheOcal protein FU20S1C 


TM 


3.3 


401435 


(((NOCATJ- 




0 


TM 


3.3 


420072 


AW961196 


H3.207725 


ESTs 


TM 


3.3 


421426 


AA291101 


Hs.33020 


Homo sapiens CDNAFU20434 Us, done K ■ 


TM 


3.3 


425851 


NM 001490 


Hs.1 59642 


glucosaminyl (N-acelyl) transferase 1, cone 






443295 


AI049783 


Hs.241284 


ESTs 


TM 


3.2 


453116 


AI276680 


Hs.146086 


ESTs 


RIbosomaLLS.C 


3.2 


456546 


AI690321 


HS.20384S 


ESTs, Weakly similarto TWIK-related add 


TM 


3.2 


430016 


NtL004736 


HS.2276S8 


xenottopic and polytropk: retrovirus recepto 


TM 


3.2 


41 8281 


U09550 


Hs.1 154 


oviductal glycoprotein 1, 120kD (mucin 9, 


asp,Glyco_hydro 18 


a2 


433800 


A1034361 


H5.13S1S0 


lung type-l cell membrane-assodated glyco 


TM 


3.2 


425159 


NM_004341 


HS.1S4868 


catbamoyl-phosphate synthetase 2, aspartat 


TM 


3.2 


428882 


AA436915 


Hs.131748 


ESTs, Moderately similarto ALU7JiUli/IA 


carb_anhydrase 


3.2 


409533 




Hs.21291 


mitogan-adivafed protein kinase kinase kin 






411248 


AA551538 


Hs.69321 


KIAA1359 protein 


TM 


3.2 


421379 


Y15221 


Hs,103982 


small Inducible cytokine subbmlly B (Cys- 


SS,IL8 


3,2 


430259 


BE550182 


Hs.127826 


KalGEF-like protein 3, mouse homolog 


TM 


a2 


414945 


BE076358 


Hs,77667 


lymphocyte antigen 6 complex, locus E 


SS 


a2 




AB020684 


Hs,11217 


KIAA0877 protein 




3.2 


421674 


T10707 


Hs,296366 


neuronal PAS domain protein 2 


Ribosomal L31e 


a2 


434163 


AW974720 


Hs.25206 


ESTs 


TM 


3.2 


421991 


NM 014918 




KIAA0990 protein 




3.2 


409589 


AW43g900 


Hs.256914 


ESTs 


TM 


3.2 




BE091634 




gb:IL2-BT0731-2404OO-069-C03 BT0731 




3.2 


414661 


T97401 


Hs.21929 


ESTs 


TM 




437537 


AA758974 


Hs,121417 


ESTs, Weakly similar to unnamed protein p 


TM 


3.2 


439702 


AW085525 


Hs.134182 


ESTs 




3.1 


420552 


AK000492 


Hs.98806 


liypothelicai proiein 


TM 


3.1 




A1333660 






lCE_p20,CARD 




425264 


AA353953 


Hs!20369 




TM 


3.1 


422109 


S73265 


Hs.1473 


gastrin -releasing peptide 


SS,Bombesin 


ai 


441859 


AW194364 


Hs.128022 


ESTs, Wteakly similarto FIG1 MOUSE RG 


TM 


ai 


415451 


HI 9415 


Hs.268720 


ESTs, KMeralely similarto ALULHUMA 


SS,Ephitn 




447866 


AW444764 


HS.211S17 


ESTs 


homeobox 


3'.1 


419978 


NM_001454 


Hs.93974 


fbricheadboxJI 


Foik.head 




446219 


AI287344 


Hs.149827 


ESTs 


MIP 


3]l 


448428 


AF282874 


Hsi1201 


nedin 3; DKF2PS66BC846 protein 


TM.ig 


ai 


407615 


AW753C85 




gb:PII41-CT0247-1S1299-OOS-a03 CT0247 


TM 


11 



wo U2/102235 
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410516 


AW976443 


Hs.285655 


ESTs 


itoGEF,PH,RhoGEF 


3.1 


418396 


AI76580S 




ESTs 


m 


3.1 


4278SS 


R612S3 




ESTs 


TU 




429272 


W25140 


Hs'l 10667 


ESTs 


TM 


3A 


450171 


AL133661 


Hs.24583 


hypothefcat protein DKFZp434C0328 


TM 




414774 


X02419 


Hs .77274 


plasminogen activator, urokinase 


SS,i<ringle,trypsin 


11 


422363 


T55979 


Hs.115474 


replication factor C (activator 1) 3 (3BkD) 




3.1 


420062 


AW411096 


Hs.94785 


tiypottietical protein LOC57163 




3.1 


428698 




Hs.297939 


ESTs; Weakly similar to neogenin [H.sapie 


TM 




427051 


BE178110 


Hs.173374 


ESTs 


TM 




428242 




Hs.2250 


leukemia Intiibitoty factor (ctiolir.ergic diffe 




3.1 


452906 


BE207039 


Hs.76621 


serine (or cysteine) proteinase inhibitor, c!a 


TM 




429419 


ABC2322S 


Hs.202276 


KIAA1 009 protein 


TM 


3.1 




AF001176 


Hs.82238 


P0P4 (processing of precursor , S. cerevisia 


TM 




406137 


#(NOCAT) 




C 


TM 




424800 


AL035583 


Hs.1 53203 


MyoD family inhibitor 


TM 




410262 


AW821182 


Hs.61418 


microfibriilar-associated protein 1 


TM 




420392 


AI242930 


Hs.97393 


KIAA0328 protein 


SS 






AW021173 


Hs.18612 


Homo sapiens cDNA: FU2190S fis, clone 


voltage_CLC,CBS 




429334 


D63078 


Hs.186180 


Homo sapiens cDNA: FU2303e lis, clone 


Glyco h/dro 2 




449802 


AW901804 


Hs.23984 


hypothetical protein FLJ201 47 


TM 




450506 


NM_004460 


Hs.418 


fibroblast activation protein; alpha 


SS,Peptidase S9 


3.0 


433849 


BE465884 


Hs.280728 


ESTs 


TM 


3.0 


411984 


NM_005419 


Hs.7296e 


signal transducer and actvator of transcript 


SH2,STAT 


3j0 


422530 


AW972300 


Hs.1 181 10 


bone mangw stramal cell antigen 2 


TM 


3.0 


422128 


AW88t14S 




gb:(WOT003341Q400-182-a07 OT0033 




3j0 


409757 


NM.001898 


Hs.123114 


cystatinSN 


SS,cystatln 


3j0 


418727 


AA227609 


Hs.94834 


ESTs 


TM 


3j0 


422244 


Y08890 


Hs.1 13503 


karyopherirt (importin) beta 3 


TM 


3.0 


456844 


AI264155 


Hs.152981 


CDP.<liacylglycerol synthase (phosphatldal 


TM 


3.0 


432358 


AI093491 


Hs.72830 


ESTs 


SS 


3.0 


416896 


AI752862 


Hs.5633 


KIAA1572 protein 


BTB 


3.0 


447312 


A1434345 


Hs.36908 


activating transcription factor 1 


TM 


3.0 


445021 


AK002025 


Hs.12251 


Homo sapiens cDNA FU1 1 163 fis, clone P 


TM 


3.0 


422611 


AA158177 


Hs.1 18722 


fucosyllransferase 8 (alpha (1 ,6) fucosyllran 




3.0 


453597 


BE28113a 




m/o-inositol 1-phosphate synthase A1 


TM 


3.0 


401197 


#(NOCAT) 




0 


arf,Ets 


3.0 


403000 


BE247275 




U5 snRNP-specific protein, 1 16 kD 


TM 


3.0 


410008 


AA079S52 




gb:2m20h125l Stratagane pancreas (93720 


TM,FG-GAP 


3.0 


413268 


AL039079 


Hs.75256 


regulator of G-protein signalling 1 


RGS 


3.0 


414080 


AA135267 


Hs.47783 


ESTs, Weakly similar to T12540 hypotheti 


TM 


3.0 


426882 


AA393108 


Hs.97365 


ESTs 


TM 


30 


427651 


AW405731 


Hs.18493 


Homo sapiens cDNA FU 12277 fis, clone M 


TM 


3.0 




A1277652 


Hs .54578 


ESTs 




3.0 


433001 




Hs .279905 


clone HQ0310PRO0310p1 




3.0 


444895 


AI674383 


HS.3G1192 


EST cluster (not in UniGene) 


TM,ASC 


30 


441952 




Hs.289008 


Homo sapiens cDNA FU21314 fis, clone 




3,0 


414725 


AA769791 


HS.1203S5 


Homo sapiens cDNA FU13148 lis, done N 


TM,7tm_1 


3.0 


434241 


AF119913 


Hs.2e3607 


hypotheScal protein PRO3077 


SS 


3.0 


424962 


Nt/L012288 


HS.1539S4 


TRAM-IIke protein 


TM 


3.0 


411987 




HS.1833B0 


ESTs, Moderately similar to ALU7_HUMA 


TM 


3.0 




A|WS975 


Hs.110165 


ribosomal protein 126 hcmoiog 






436481 




Hs.5199 


HSPC150 protein similar to ubiquitin-conju 


TM 


io 


407872 


AB039723 


Hs.40735 


frizzled (Drosopliila) iiomolog 3 


TM.7tm.2,Fz,Fri22led 


ao 


442577 


AA292998 


Hs.163900 


ESTs 


TM 


3.0 


416120 


H46739 




gb^oUhOZsl Soares adult brain N2b5HB5 


TM 


ao 


443775 


AF291664 


Hs.204732 


matrix metalloprotelnase 26 


TM,Reptidase_M10,7tm.1 


ao 


414664 


AA587775 


HS.6629S 


Homo sapiens HSPC311 mtm. partial cd 


TM 


ao 


457590 


AI612B09 


HS5378 


spondin 1, (T-spondin) extiaceliuiar matrix 


SS 


ao 


418946 


AI798841 


Hs.132103 


ESTs 


TM 


ao 


457940 


AL36015g 


Hs.30445 


Homo sapiens mfm. U\ length insert cDN 


TM,SPRY,7tm-1 


3.0 



410008 
414147 
414315 
416120 
419311 
419546 
422128 
427072 



115237J 
116812.1 
142127_-1 
143512.1 



AW753035 AW753082 AW054744 AW753107 AW753C87 
AA063468AAQ63018AI444822 
AA077391 AI347618AI351453AI088754AW207491 AWg60912AA921874AA286833AA150722BE152353i 
AA079552 BE142525 BE142527 
BE091634 

Z24878AM94098F13654AA494C40AA143127 ■ 
H46739 H51513H19779 

AA689591 AW974261 AA236240 AI077451 AA631399 AW974262 
AA244199AA244272 H57440 

AW88114S AA490718 M85637 AA304575 T06067 AA331991 
H38046 W69645 AA397968 H38047 
AA431765AA432015 
AA828995 AA834879 AI926361 
R36075AI366546 R36167 



210 



wo 02/102235 



PCT/US02/19297 



TABLE 15C: 

Pkey: Unique number corresponding to an Eos probeset 

Sequence source. The 7 digit numbers in this column are Genbanli Identifier (Gl) numbers. "Dunham I. et al." refers to the publication entitled "The DMA sequence of 
human chromosome 22" Dunham, et al. (1999) Nature 402:489495 
Strand: Indicates DNA strand from which exons were predicted 
NLposition: Indicates nucleotide positions of predicted exons 

Pl<ey Ref Strand NI_position 

400881 2842777 Minus 91446-91603, 



401435 8217934 



176341-176452 



8043C-31581 
6862697 Pius 175318-175476 

6007890 Rus 37999-381453652-38998.39727^9872,40557-40674,42351-42450 
1054740 Plus 124361-124520,124914-125050 
9166422 Minus 30487-31058 
9256126 Minus 58313-58489 



Table 16A iists about 92 genes up-regulated in mucinous-type ovarian cancer compared to normal adult tissues. These were selected as fbr Table 14A, except thai the 
■average" ovarian cancer level was set to the 75lh percentile amongst various muchous-type ovarian cancers, and Ihe lumor/nomnal tissue ratio vies greater than or equal to 2.5 

TABLE 16A: ABOUT 92 UP-REGULATED GENES, MUCINOUS OVARIAN CANCER VERSUS NORMAL ADULTTISSUES 
Pkey: Primekey -j 
Ex.Accn: Exemplar Accession 
UGID; UnlGenelD 

Title: UniGenelitle 

Prot. Dom.: Predicted protein domain structure 





tumor vs. normal tissues 












UGID 






ratio 


430691 


C14187 


Hs.1 03538 


ESTs 




34.9 


432938 


T27013 


Hs.3132 


steroidogenic acute negulatoiy protein 


START 


28.0 


418007 


M13609 


Hs.83169 


Matrix metalloproleasa 1 (interstittal collag 


SS,Peptid3se_M10 


223 


451181 


AI796330 


ils.207461 


ESTs 




10.8 


452838 


U65011 


Hs.30743 


Preferentially expressed antigen In melanom 




10.0 


407638 


AJ404672 


Hs.2886g3 


EST 




93 


450159 


AI702416 


Hs.200771 


ESTs, WeaMy simile to CAN2.HUMAN 




9.2 


426890 


AA393167 


Hs.41294 


ESTs 






421156 


H87879 


Hs.102267. 


lysyl oxidase 


SS.LysyLoxidase 


63 


437099 


N77793 


Hs.48659 


ESTs, Highly similarto LMA1 HUMAN L 


laminin_EGF 


7.6 


453866 


AW291498 


Hs.250557 


ESTs 




7.6 


435496 


AW840171 


Hs.265398 


ESTs, Weakly similarto Iransformation-rel 




7.4 


418738 


AW388633 


Hs.6682 


solute earner family 7, member 11 




7.2 


431956 


AK002032 


Hs.272245 


Homo sapiens oDNA (^U11170 Ss, clone P 


RA 


7.0 


449579 


AW207260 


Hs.134014 


prostate cancer associated protein 6 




6.7 


424586 


NM 003401 


Hs.150930 


X-ray repair complementing defecUve repa 




6.7 


445891 


AW391342 


Hs.199460 




6.2 


424717 


H037S4 


HS.1S2213 


wingless-type MMTV integraUon site fami 


wnt 


6.1 


452705 


H49805 


Hs.246005 


ESTs 




6.1 


421285 


NM-000102 


Hs.1363 


cytochrcme P4S0, suUanily XVII (steroid 


TM,p450 


S.S 


408562 


AI436323 


HSJ1141 


Homo sapiens mRNA fbr KIAA1S68 prote 




5.3 


420159 


AI572490 


Hs.99785 


ESTs 




5.3 


451105 


AI761324 




gb.-wi60b11j(1 NCI_CGAP_Co16 Homos 
ESTs 




5.2 


409049 


AI423132 


HS.146343 




5.0 


448674 


W31178 


Hs.154140 


ESTs 




5.0 


423811 


AW299593 


Hs.50895 


homeoboxC4 




4.9 


427469 


AA4O3084 


Hs.269347 


ESTs 






447033 


AI357412 


Hs.157601 


EST-notlnUniGene 


PH 


4.9 




H04607 


Hs.9218 


ESTs 




4.9 


448811 


AI590371 


Hs.174759 


ESTs 


TM 




444330 


AI597655 


Hs.49265 


ESTs 






409041 


AB033025 


Hs.50081 


KIAA1 199 protein 




4J 


418735 




Hs.44609 


ESTs 




45 


416661 


AA534543 


Hs.79440 


IGF-II mRNA-binding protein 3 


KH-domain 


45 


430073 


U36136 


Hs.232070 


telomerase-associaled protein 1 






407381 


AW072003 


Hs.40968 


heparan sulfate (glucosamine) 3-0-sulfotran 






422260 


AA315993 


Hs.105484 


ESTs; Weakly sir^iilar to LITHOSTATH^N 




4.4 


421110 


AJ250717 


Hs.1355 


cathepsir E 






445675 


A1247763 


Hs.16928 


ESTs 




42 


430704 


AW813091 




gb:RC3-ST0186-240400-111-d07 ST0186 


Epimerase 


U 


414569 


AF109298 


Hs.118258 


Prostate cancer associated protein 1 




a8 


438078 


AI016377 


Hs.131693 


ESTs 




3.7 


434032 


AW009951 


HS.206B92 


ESTs 




3.7 


445657 


AW612141 


Hs.279575 


ESTs 


7tm_1 


3.6 


439759 


AL359055 


Hs.67709 


Homo sapiens mRNA full length insert cON 




as 




BE065313 




gb:RC2-BT0318-1101 09-012-308 BT0318 




as 


448844 


AI681619 


Hs.miS4 


ESTs 




3.5 


449048 


Z45051 


Hs.22920 


similar to S68401 (cattle) glucose induced g 


S8 


as 


438018 


AKC01150 


Hs.5999 


hypothetical protein FIJ10298 


TM 


3.4 


458ia 


AW892676 




gb:CM3-NN0004-280300-131-c12 NN0004 




3.4 


407385 


AA610150 


Hs.272072 


ESTs, Moderately similar to ALU7_HUMA 




a4 


424894 


H83520 


HS.1S3678 


reproductions 


SS,UBX 


3.3 



211 



wo U2/102235 



PCT/US02/19297 



424639 AI917494 Hs.131329 ESTs 

414083 AL121282 Hs2577B6 ESTs 

426471 M22440 Hs.170009 

428927 AA441837 Hs.90250 ESTs 

406129 »(NOCAT) 0 TM^NMP_blnding 

462699 AW295390 Hs.213062 ESTs 

425842 A1587490 Hs.159623 NK-2(Drosophila) horaolog B homeobox 

428976 AL037824 Hs.194695 ras horaolog gene family, member 1 ras 

436396 AI683487 Hs.299112 Homo sapiens CDNAFU11441 lis, clone H wnt 

454077 ACC05952 Hs.37062 insulin-Jke 3 (Leydig cell) SS,lnsulin,pklnase 

404253 #(NOCAT) 0 histone 

452461 N78223 Hs.108106 Iranscription factor G9a,f^D 

429597 NI\fl_003816 Hs.2442 a disinlegiin and metalloproteinase domain TM 

413289 AA128061 Hs.1 14992 ESTs 

429703 T93154 Hs.28705 ESTs 

407829 AA045084 Hs.29725 Homo sapiens cDNAFU13197fis, clone N 

424796 AW298244 Hs.293507 ESTs 

424086 AI351010 Hs.102267 lysyl oxidase Lysyl_oxlda6e 

408427 AW194270 Hs.177236 ESTs 

450375 AA009647 Hs.8850 a dislntegnn and metalloprotelnase domain 

446999 AA151520 Hs.279525 hypottielical protein PRO2605 

428819 AL136623 Hs.193914 KIAA0575 gene product 

422956 BE546072 Hs.122579 ESTs 

428949 AA4421S3 Hs.104744 ESTs, Weakly similar to AF2088SS 1 BM4 

426300 U15979 Hs.169228 deUke homolog (Drosophila) TM,EGF 

420380 AA640891 Hs.102406 ESTs 

428651 AF196478 Hs.188401 annexInAlO TM,annexin 

417849 AW291587 Hs.82733 Nidogen2 EGF,ldl_rBcepLb 

453700 AB009426 Hs.560 apoIipaproteinBmRNA editing enzyme, oa TM . 

417975 AA641836 Hs.30085 Homo sapiens cDNA: FLJ23186fis, clone 

448756 A1739241 Hs.171480 ESTs 

425087 R62424 Hs.1 26059 ESTs 

444153 AK001610 Hs.10414 hypothetical protein FLJ10748 Kelch 

443211 AI128388 Hs.143655 ESTs 

415263 AA948033 Hs.130853 ESTs hislone 

432867 AW016936 Hs.233364 ESTs GSHPx 

433639 A12783S0 Hs.31409 ESTs 

455386 AW935875 gb:QV3-DT0019-120100.055Hi06DT0019 

419092 J05681 Hs.89603 mucin 1,' 

452055 AI377431 Hs.293772 ESTs 



Accession: Genbanl< accession numbers 

Pl(ey CAT Number Accession 

430704 322217.1 AW813091 AW206665AA484440 

451105 859083.1 AI761324AW880941 AW880937 

455386 1287756 1 AW935875BE069116BE1602S1 

4SS666 1349545.1 BE06S813 BH)65788 BE06S889 BE065832 

4S8123 479942.1 AW892676AA853877 D44747 



Ref: Sequence source. The 7 digit numbers in litis column are Genbank Identifier (Gl) numbers. "Durtiani 1. et al." refers lo the publkalion enfitled "The DNA sequence of 
human chromosome 22" Dunham, et al. (1999) Nature 402'.489-49S 

Strand: Indicates DNA strand from which exons v«ene predicted 
NLposition: Indicates nucleotide positions of predicted exons 



2.5. 

TABLE 17A: ABOUT 183 UP-REGUtATm GENES, ENDOMETRIOID OVARIAN CANCER VERSUS NORMAL ADULT TISSUES 

Pkey: Primekey 

Ex.Accn: Exemplar Accesskm 

UGID: UniGenelD 

TiHe: UniGene title 

ProlDom.: Predkled protein domains 



435094 
428153 
428187 
449034 
453102 



Ex.Accn UGID 

U65011 HSJ0743 

AI550129 Hs.277523 

AW513143 Hs.98367 

AI687303 Hs.285529 
AI624049 

NM.007197 Hs.31664 



Preferentially expressed antlgsn in melanot 
EST 

hypotheteal protein FU22252 sMIarto SR 



wo 02/102235 



PCT/US02/19297 



438817 AI023799 



Hs.179243 
Hs.163242 
mum Hs.157501 
AW514472 Hs.238415 



413623 
440901 
416661 
421478 
448706 



427121 
420610 
427356 
446577 
431118 
448112 
461106 



AA634543 
A1683243 
AW291095 



AWg72870 Hs.291069 
AA825721 Hs.246973 
Hs.128612 
Hs.79440 
Hs.97258 
Hs.21814 
Hs.43047 
AA828995 

AI272815 HS.17355S 
AI683183 Hs.99348 
AW023482 Hs.97849 



A1672096 Hs.9012 
AF0S3306 HSJ6708 
AA0837B4 HS541334 



s1NCLCGAP_Kid5Hoiiiosa 



KIAA1500 protein 

carbomcanhydraseVlll 

ESTs. Weakly similar to ALUB_HUMAN 



AA250737 Hs.72472 



iiscDNAFLJ12684lis,cI( 



AI613318 
AI584019 
H05840 
U60669 



AW248508 H3.279727 



Homo sapiens cDI\IA FU133S2 tis, clone 0 
ESTs 

ESTs, Moderately similar to piakophlin 2b 
ESTs 

cytochionie P450, sutjfamiiy XXIV (vltami 
gb:xw82f0lJ!l NCLCGAP_Pan1 Homo sa 
steroidogenic acute regutatoty protein 



462461 
430691 
441675 



418862 
446431 
434891 
416139 
453197 
447112 
420633 



418506 
423123 
437960 



4S7979 
402606 
426471 



AB032959 Hs.161700 

N78223 Hs.108106 

C14187 Hs.103538 

AI914329 Hs,5461 

NM 005401 Hs.169238 

AW895503 Hs.125276 

N^L005756 Hs.184942 

AK001468 Hs.62180 

AI797515 HS.270S60 

AI266S07 HS.14S689 

AA948C33 HS.1308S3 

M81883 HS.7S668 

AL137578 Hs.276fl7 

BE637037 Hs.273294 

R46652 Hs,153486 

AA814309 Hs,123583 

AW975942 H3,48624 

AI916269 Hs.109057 

H17800 Hs.7154 

NM_014681 Hs,99626 

AI741122 Hs.101810 

CI 8366 Hs,78046 

AL121282 Hs.257786 

AW9751B3 Hs.292663 

AA125985 Hs.66146 

AI566086 Hs.1 63716 

NM_002448 Hs.1494 

AB011540 Hs!4930 
AI630223 

AA084248 Hs.85339 

NM.012247 Hs.124027 

AI669586 Hs.222194 

AA032279 Hs.61635 

N638SS Hs.142634 

AI970672 Hs.46638 

A14S2457 HS.U5526 

AW409872 Hs.271166 

AA776655 Hs.270942 
#(NOCAT) 

M22440 Hs.170009 



ESTs 
ESTs 

protein tyrosine phosphatase, non-receptor 
ESTs 

G protein-oxipled receptor 64 
ESTs 

ESTs, Moderately similar lo ALUZJUMA 



gkitamale decarboxylase 1 (brain, 67kD) 
Homo sapiens mRNA; cDNA DKFZpS64N 
sinFU20069 



ESTs, Weakly similar to ALUSJtUMAN A 

ESTs 

odorant-blnding protein 2B 

Momo sapiens cONA FU14232 lis, clone N 

tissue factor pathway inhibitor 2 TFPI2 



Band_41,Yjihosphatase 



Kunltz_BPTI,&gamma 



low density lipoprotein receptor-related pro 
gb:adC6g08.r1 ProlHerating Eiythroid Cells 
G protein-coupled receptor 39 
SEl^NOPHOSPHATE SYNTHETASE ; H 



11 open reading Irame 8; feta 
ESTs, Moderately sMar to ALU7_HUMA 



wo 02/102235 



PCT/US02/19297 



448027 
432619 
413627 
441377 
4410B5 
433527 
450171 
419807 
418867 
419335 



413415 
438965 
431041 

412140 
431707 
448816 



related to the Nteiminus Of tie 



25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



416624 
446870 
441962 
447342 



439876 
428479 
436406 
437938 
419917 
434836 
448404 
444078 
409757 
443775 



417389 
451009 
429774 
439951 
417576 
416B06 
420900 



HS.1B1245 
Hs.133020 
Hs^4583 



AL133661 
R77402 
D31771 
AW960146 



R21325 Hs.267905 

AB033052 Hs.22151 

AW444764 Hs.211517 

AA328102 Hs.24641 

U07807 Hs.194762 

H69125 Hs.133525 

NM.002407 Hs.97644 
Hs.158951 

AW0B8642 Hs.97984 

AI377431 Hs.293772 

M68874 HS211587 

AF061198 Hs^1941 

R26830 Hs.106137 

N73222 Hs,21738 

NM.004655 Hs.127337 



420181 AI380089 



NM_014420 Hs.15 
T16971 Hs.28 
U30872 Hs.77 



U71207 Hs.29279 

AW594506 Hs.104830 

U78722 Hs.55481 

AA336399 Hs.153797 

AW450420 Hs.21335 

AA897461 Hs.158469 

AA393167 Hs.41294 
T0S477 

AL120862 Hs.124165 



L49054 Hs,85195 

AW129261 HS,250565 

AI376278 Hs.l 00921 

Y00272 Hs.l 84572 

AW1 05723 Hs.l 26346 
AI950087 

AA320068 Hs.93701 

AA651629 Hs.l 18088 
BE089973 

BE246919 Hs.10290 

NM_001898 Hs,123114 

AF291664 Hs.204732 

AW136934 Hs!97162 

H03754 Hs.152213 

AF151534 Hs.92023 

BE260964 Hs.82045 

AA013140 Hs.1 15707 

AI5222t5 Hs.50883 

AI347067 Hs.124636 

AA33944S Hs.82285 

NM.fl002B8 Hs.73393 

AIJ045633 Hs.44269 

AI301740 Hs.173381 



ESTs 



JAFU12532lis,clc 



hypothetical protein DKFZp434C0328 
gb:yi75f1 1.s1 Soares placenta UHZHP Horn 
msh (Drosophila) homeo tiox homolog 2 
Homo sapiens cDNA FU12888 fis. clone N 
zinc finger protein 239 
ESTs 

gb:EST391 184 lUIAGE resequences, MAGP 



RAB5 interacting, l<lnesln-lil(e (rabklnesiifi 
hypothetical protein FIJ10422 
KIAA1226 protein 



ESTs; We*y similar to WASP-family pro 



dickkopf (Xanopus laevis) Ixmolog 4 

ESTs 

centromere protein F (3SW400kD, mitosin) 



eyes absent (Di 
ESTs 

zinc finger protein 165 

ESTs 

ESTs 

ESTs, Weakly similar to envelope protein I 
ESTs 

gb:EST03366 Fetal brain, Stratagene (cat93 
ESTs; (HSA)PAPpnDlein (programmed 08 
^ gb:yr77h05,s1 Soares felal liver spleen INF 
syntaxin18 

Homo sapiens cDNA; FU21814 fis, ckme 
ESTs; Weakly similar to III! ALU SUBFAM 
general transcriptlanfactorllH, polypeplid 



lot(3:5Kq25,1;p34)f 
irtoALU7_HUMANA 



gb:RC6-BT0709-310300-021-G07 BT0709 
U5 snRNP-spedfic 40 kDa protein (hPrp8- 

cyslatin SN 

matrix metalk)praleinase 26 



wingless-type MMTV Integration site fami 
cote histone macroH2A2.2 

Midkine(ne 



C2.PLA5LB 
TWSNF 

TM,7tnu2,Rho_GI 



WD40 
8S,cyslatln 

TM,PepfidaseJ«10,7lni_1 




wo 02/102235 



PCT/US02/19297 



446693 


AW750373 


Hs.42315 


Homo sapiens cDNA FU13036 fe, clone N 


407289 


AAiasisg 


Hs^49 


Homo sapiens cDNA FU12149 lis, clone M 
0 


400882 

431322 


AW970622 




gb.EST3B2704 MAGE resequences. MAGK 


424081 


NM_006413 


Hs.139120 


ribonucleaseP (301(D) 


461996 


AW514021 


Hs.245510 


ESTs 


403381 


mom) 






419488 




Hs.90691 


nudeophosroinJnucleoplasmin 3 


418882 


NM_004996 


Hs.89433 


ATP-binding cassette, subfamily C (CFTR 



TM,ABC_menibrane 



TABLE 17B: 

Pkey: Unique Eos probeset identifier numbe 
CAT number: Gene clusternumber 
Accession: Genbank accession numbers 



438993 
442438 
448404 
449034 
452771 
458861 



CAT Number 
1099611J 
1604694J 
183252_1 



467651 
542469. 
761515. 



798085J 



AW337237 AW861 542 AW861655 AWa58008 AW857990 AW858007 
H69044 T47557H75691 T50292 
R77402AA262462AA250968 R03794 
AW970622AA503009AA502998AA602989AA502805T92188 

AI950087 N70208 R97040 N36809 AI30B1 1 9 AW967677 N35320 AI251473 H59397 AW971573 R97278 W01059 AW967671 AA908598 AA261875 
AI820501 AI820532 W87891 T85904 U71455T62391 BE328571 T75102 R34725 AA884922 BE328517 AI219788 AA884444 N92578 F13493 
AA927794 AI560261 AW874068 AL1 34043 AW235363 AA663345 AW008282 AM88964 AAa3144 AI890387 Aig60344 A1741346 A1689062 
AA282915 AW102898 AI872193 AI763273 AW173586 AW150329 AI653832 AI7526ee AA988777 AA4e8892 AI356394 AW1C3813 AI539642 
AA642789 AA8S6975 AW5Q551 2 AI9S1 530 AW629970 BE612881 AW276997 AW513601 AWS12843 AA044209 AW8S6S38 AA1 80009 AA337499 
AW9611C1 AA251 669 AA251 874 AI819225 AW205862 AI683338 AI858509 AW276905 AI633006 AA972584 AA9C8741 AW072629 AWS1 3996 
AA293273 AA969759 M75628 N22388 H84729 H60052 T92487 A1022058 AA780419 AA551005 WB0701 AW613456 AI373032 AI564269 F00531 
H83488 W371 81 W78802 R66056 AI002839 R67840 AA300a07 AW959581 T63226 F04005 
AW979074AA834841 AA828650 
AA828995AA834879A1926361 

AA995998 AI916584 R61781 T77332 F07756 F0814g F07647 

BE089973AI498612AW805032 
AI624049 AW1 17770 AI858360 
T05477T07855AI917711 
AI630223AI630470 



TABIi17C: 

Pkey: Ma,ue number corresponding to an Eos probesel 

Ref: Sequence source. The 7 digil numbers In Ihls column are Genbank Identifier (Gi) nu 

human chromosome 22' Dunham, et al, (1999) Mm 402:489-495 
Strand: Indicates DItiA strand from which exons were predicted 
NLpositkin; Indicates nucleotide posKlons of predicted exons 



s. -Dunham I. et al.' refers to the publication ent'tled The DI4A sequence of 



Table 18 lists lists 178 genes up-reguldted In ovarian cancer compared to normal adult tissues that are liitely to encode proteins that are secreted into blood, lymph, or other 
bodily Hulds. These genes, and/br their protein produds, in combination or alone, are ideal candidates for the early diagnosis of ovadan cancer. These were selected from 
59680 probesets on the Aflymetrixajs Hu03 GeneChip array such that the ralio of "average" ovarian cancer to "average" nomral adult Issues was greater than or equal to 2.4, 
and that are likely to encode secreted or extracellularly-shed proteins. The 'average" ovarian cancer level was set to the 9Dth peicenhle amongst various ovarian cancer 
samples. The "average" normal adult tissue level was set to the goth percentile amongst various non-malignant tissues. In order to remove gen»«pecHic background levels of 
non-specific hybridizatkin, the 1 Sth percentile value amongst various 14g non-malignant tissues was subtracted from both the numerator and the denominator before the ratio 
was evaluated. 

TABLE 18: ABOUT 178 UP-REGULATED GENES ENCODING SECRETED PROTEINS, OVARIAN CANCER VERSUS NORMAL ADULT TISSUES 

Pkey; Primekey 

Ex. Accn: Exemplar Accession 

UGID: UniGenelD 

Title: UniGene title 

ratio: ratio tumorvs. normal tissues 



Pkey 

428579 

436982 


Ex. Accn 

NM_(X)5756 

AB018305 


UGID 
Hs.184942 


Title 

G protein-coupled receptor 64 
spondin 1, (f-spondin) extracellularmat 




427585 


D31152 


Hs.179729 


collagen; type X; alpha 1 [Schmid metaph 


21.0 


423739 




Hs.97600 


ESTs 


72.1 


418007 


M13509 


Hs.83169 


Matrix metalloprolease 1 (interstitial c 


20.6 




M73780 


Hs.52620 


integrin;beta8 




428664 


AK001566 




simlarto SALL1 [sal [DrasapNla)-Eke 


ia5 


439820 


AL360204 


Hs!283853 


Homo sapiens mRNA full length insert cDN 


las 


400289 


K07820 


HS.22S8 


Matrix Metalloprotelnase 10 (Stromolysin 


1&2 


421155 


H87879 


Hs.102267 


lysyl oxidase 
ESTs 


16.1 


431989 


AW972870 


Hs.291069 


15.9 


426635 


BE39S109 


Hs.129327 


ESTs 


lag 


424581 


M62062 


Hs.150917 


catenin (cadherin-assodated protein), a 


15.7 


428976 


AL037824 


HS.19469S 


ras homolog gene family, member 1 


15.1 


416209 


AA236776 


Hs.79078 


MAD2 (mitotic arrest deHdenL yeasL h 


15.0 


439706 


AW872527 


Hs.59761 


ESTs 


14.7 


452055 


AI377431 


Hs.293772 


ESTs 


13.2 


410102 


AW248508 


HS579727 


ESTs; 


12.5 


428392 


H10233 


Hs.2265 


secretory granule, neuroendocrine protei 


12.4 



215 



wo 02/102235 



PCT/US02/19297 



443715 
433496 
418601 



427344 
428479 
429782 



445413 
438078 
447342 
415138 
413478 



425371 
448706 
441031 
447207 
420440 



440870 
417866 
437960 



411571 
432158 
447020 



448276 
419926 
420736 
419790 
414343 



A1583187 Hs.9700 

AF064254 Hs.49765 

AA27949C Hs.86368 

AA576953 Hs.22972 

AJ245671 Hs.12844 

NM_000869 Hs.2142 

Y00272 Hs.184672 



hypothe«cal protein FU22418 
VLCS-H1 protein 



ESTs 



Homo sapiens cDNA aJ13352 (is, 
EGF-like-domain; multiple 6 
S-hydraxyttyplaniine (serotonin) receptor 
cell division cycle 2, G1 to S and G2 to 
NM_005754 Hs.220689 Ras-GTPase-aclivaling protein SH3*nialn 
AA219691 Hs.73625 RAB6 interacting, kinesin-llke (rabHnes 
AWC72003 Hs.40968 heparan sulfate (glucosamine) 3-O^ulfot 
AI45B679 Hs.181915 ESTs 



AW763676 Hs.39982 

AA1 51342 Hs.12677 

AI016377 Hs.131693 

AI19925a Hs.19322 

C18356 Hs.78045 

U38945 Hs.1174 

W47595 Hs.169300 

W67883 Hs.137476 

NM_007358 HsJ1016 

L18964 Hs.1904 

U25128 Hs.159499 

AA101(M3 Hs.151254 

BE39394e Hs.50915 



rs; Weakly similar to!!!! ALU SUBFA 



Z45051 



Hs.64173 

Hs.22920 
Hs.288966 
D49441 Hs.1 55981 
AW291095 Hs.21814 
AI584019 Hs.1 69006 
AA442233 Hs.17731 
NM 002407 Hs.97644 
AI301740 Hs.173381 




DNA-BINDING PROTEIN M96 
protein kinase C; lota 
parathyroid Irannone receptor 2 
kaIMn 7 (chymotryptic; stratum com 

! protein 2 
ESTs, Highly similar to VAB] 
similar to S66401 (cattle) glucose Indue 
ceruloplasmin (f^mKldase) 
mesothelln 

class II cytokine receptor ZCYT0R7 
ESTs, Moderately similar to plakophllln 
hypolhelical protein FU12892 

mammaglobin 2 
dihydropyrimidinase-like 2 



IS cDNAFL 



Hs.179312 
Hs.1 584 
Hs.240 
AW969626 Hs.31704 
NM.002916 Hs.35120 
AA32006B Hs.93701 
102911 Hs.150402 
AI222279 ■ Hs.201555 
AK001380 Hs.145479 
BE616633 Hs.301122 
NM_001851 Hs.154850 
U62801 Hs.79361 
AK001676 Hs.1 2467 
AW195285 Hs.194097 
NM_0C6103 Hs.2719 
AA122393 H3.70311 
W33165 Hs.55548 
T27308 Hs.1 6986 
AI80C271 Hs.1 29445 
AA723157 Hs.73769 
Nl«_004996 Hs.89433 
NM_002214 Hs.1 84908 
AU077143 Hs.1 79565 
AA343629 Hs.104570 
R96696 Hs.35598 
U16979 Hs.169228 
AI458437 H3.177224 
NM_001333 Hs.87417 
AI091631 Hs.135501 
D86407 Hs.54481 
U30872 Hs.77204 
NHL012247 Hs.12i|027 
BE514434 Hs.20830 
AW900992 Hs.93796 
AI263022 Hs.82204 



ESTs 

■collagen, type XI, alpha 1" 
ESTs 

a disinlegiln and melalloproleinase doma 

neuronal pentraxin II 

ESTs, Highly similar to ti^l 

small nuclear RNA activaling complex, pe 

cartilage oligometic matrix protein 



Homo sapiens cDNA FU10518 fis, clone NT 
Ijone morphogenetic protein 7 (osteogenic 
collagen, type IX, alpha 1 



hypolheUcal protein FU10814 

ESTs 

epididymis-specllic; whey-acidte protein 

hypothetical protein FU20516 

ESTs, Weakly similar to unknown prot* 

hypothetical protein FU11046 

hypothetical protein FU12496 

folate receptor 1 (adull) 

ATP-binding cassette, sul)-family C (CFTR 

integrin, beta 8 



kallikrein 8 
ESTs 

delta-like homolog (Drosophlls) 



ESTs 

low density lipoprotein teceptor-rolatod 
centramero protein F (3S(V400kD, mltosln 
SEt^NOPHOSPHATE SYNTHETASE 
synaplte Ras GTPase activating protein 1 
DKFZP5B6D2223 prolan 
ESTs 

glycerol-3-phosphate dehydrogenase 2 (mi 
coated vesicle membrane protein 
Kmppel-type zinc finger protein 



NM_005940 Hs.155324 



wo 02/102235 



PCT/US02/19297 



430634 AI860651 Hs.26685 ESTs 

431846 BE019924 Hs.271580 UroplakinIB 

416658 U03272 Hs.79432 fibrillin 2 (congenital conlraotural ara 

407792 AI077715 Hs.39384 putative secreted ligand homologous to f 

420585 AW5C5139 Hs.279844 hypottietical protein FU1Q033 

407756 AA1 16021 Hs.38260 ubiquitin specific protease 18 

411773 NM_006799 Hs.72026 protease, serine, 21 (tesBsin) 

421928 AF013758 Hs,109643 polyadenylate binding prolein-inleractin 

431958 X53629 Hs.2877 Cadtierin 3, P-cadtierin (placental) 

410467 AF102546 Hs.63931 dachshund (Drosophila) homolog 

418793 AW382987 Hs.88474 prostaglandin-endoperoxide synthase 1 ft) 

422278 AF072873 Hs.114218 ESTs 

431840 AA534908 Hs.2860 POU domain, class 5, transcripUon fade 

408730 AV660717 Hs.47144 DKFZP586N0819 protein 

419452 U33635 Hs.90572 PTK7 protein tyrosine kinase? 

421841 AA908197 Hs.108850 KIAA0936 protein 



410587 U24389 Hs,55436 lysyloxi 

414774 X02419 Hs.77274 ' ' 

420552 AK000492 Hs.98806 

421991 NM 014918 Hs.11048« 

418140 BE613836 Hs.83551 

458924 BE242158 Hs.24427 DKFZP56601 646 protein 

411789 AF245S05 Hs.72157 Homosapiens.mRNA;cDNADKFZp564l19 

434241 AF119913 Hs.2B3607 hypothetical protein PRO3077 

422611 AA158177 Hs.118722 fuccsyltransferase8(alpha(1,6)fucosy 

409533 AW969543 Hs.21291 mitogen-activated protein kinase kinase 

416391 AI87e927 Hs.79284 mesoderm specllic transcript (mouse) hom 

412804 AW978324 Hs.47144 DKFZP586N0819 protein 

425851 NM 0C1490 Hs.1 59642 glucosamlnyl{N^I) transferase 1,0 

431259 NM_0C6580 Hs.251391 claudin16 

418557 BE140602 Hs.245645 ESTs 

428242 H55709 Hs.2250 leukemia inhibitory factor (cholinergic 

419359 AL043202 Hs.90073 chromosome segregation 1 (yeast homolog) 

457590 AI612809 Hs.5378 spondin 1, (f-spondin) extracellular mal 

419741 NM_007019 Hs.93002 ubiquitin carrier protein E2-C 

428330 L22524 Hs.2256 matrix metalloproteinase 7 (matrilysin, 

417315 AI080042 Hs.180450 ribosomal protein S24 

438777 AA825487 Hs.142179 ESTs, Weakly similar to 0RF2 [M,musculus 

442295 AI827248 Hs.224398 ESTs 

428248 AI126772 Hs.40479 ESTs 

403019 AA834626 Hs.66718 fV>D54 (S.cerevisiae)-like 

436252 AI539519 Hs.120969 Homosapiens cDNA FU11562fis 

419488 AA316241 Hs.90691 nucleophosmin/nuoleoplasrain 3 

434288 AW189075 Hs.116265 ESTs 

407872 AB039723 Hs.40735 frizzled (Drosophlla) homolog 3 

431611 U58766 Hs264428 Ussue specific transplantation antigen 

443881 R64512 Hs.237146 Homosapiens cDNAFU14234fls,ckjne NT 

453779 N3SIB7 Hs>l33BB ESTs 

433068 NI\L0064S6 Hs^lS slalyKtsnsferase 

426841 AI0S2358 Hs.193726 ESTs 

428778 AK000530 Hs.193326 fibroblast growth factor receptor-like 1 

461346 NfC006338 Hs,26312 glioma amplified on chromosome 1 protein 

443883 AA114212 Hs.9930 serine (or cysteine) proteinase inliibito 

42C162 BE378432 Hs.95577 cyclin-dependeni kinase 4 

447149 BE299857 Hs.326 TAR (HIV) RNA-binding protein 2 

433656 AW974941 Hs.292385 ESTs 

408210 N8118g Hs,43104 ESTs 

43C651 AA961694 Hs.1051B7 kinesin protein 9 gene 

422599 BE387202 Hs.118638 non-metastatlc cells 1, protein (NM23A) 

421802 BE261458 Hs.108408 CGI-78 protein 

446211 AI021993 Hs.14331 S100 calcium-binding protein A1 3 

404029 W72881 Hs.266470 protccadherin beta 2 

463012 T95804 Hs,31334 putative mitochondrial outer membrane pr 

419981 AA897581 Hs.128773 ESTs 

448153 Y10805 Hs.20521 HWT1 (hnRNP methyllransferase, S.cerevi 

419220 AA811938 Hs.291759 ESTs 

432180 Y18418 Hs.272822 RuvB (E coli homolog)-like 1 

406860 AI624300 Hs.172928 collagen, type I, alpha 1 

409893 AW247090 Hs.57101 minichromosome maintenance deScienl (S. 

421664 AW163267 Hs.106469 suppressor of varl (S£erevislae)3-llke 

409956 AW103364 Hs.727 H.saplensactivlnbeta-Asubun'it(exon2 

407584 W25945 Hs,18745 ESTs 

448796 AA147829 Hs.33193 ESTs, Highly similar to AC007228 3 BC372 



Table 19 lists about 17 genes up-regulated in ovarian cancer compaied to normal adult tissues thai are likely to encode proteins that are secreted into blood, lymph, or other 
bodily fluids. These genes, and/or their protein products, in combinatkin or alone, are Ideal candidates for the eariy diagnosis of ovarian cancer. These were selected from 
59680 prabesets on the Affymetrix/Eos Hu03 GeneChip array such that the ratio of "average" ovarian cancer to "average" normal adult tissues was greater than or equal to 2.4, 
and that are likely to encode secreted or extracellularly-shed proteins. The "average" ovarian cancer level viras set to the 90th percentile amongst various ovarian cancer 
samples. The 'average' normal adult tissue level was set to the 90th percentile amongst various non-malignant tissues. In order to remove gene-specific background levels of 
non-speclfiB hyhridizaBdn, llie 1 5lh percentile value amongst various non-malignani tissues was subtracted from both the numerator and the denominator before Sie ratio was 
evaluated. 
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TABLE 19; ABOUT 17 GENES, AND COMBINATIONS THEREOF, USEFUL FOR DIAGNOSIS OF OVARIAN CANCER 

UGID:UnlGenelD 

Ti8e: UniGene Title 

% tumors: percent of tumors detected expressing gene 
Single genes: 

UGID Title % tumors 

Hs.5378 ~ 
Hs.12844 
Hs.1 51254 
H6.97644 
Hs.155981 
Hs.2258 

Hs.50915 kallikreinS 

Hs.301 1 22 bone morphogenetic protein 7 (osteogenic protein 1) (BMP?) 
Hs.79361 kallikrein 6 (neurosin, zyme) 
Hs.83169 MMP 1 |lnterstitial collagenase) 
Hs.72026 protease, serine, 21 (testisin) 
Hs.39384 putative secreted ligand homologous to Ijxl 
Hs.2719 epididymis-speoifio; whey-acidic protein iipe, four-disBllide core 
Hs,155324 matrix metalioproteinase 1 1 (siromelysin 3) 
HS.1S84 cartilage oligomeric matrix protein 
Hs.1693(» TGFbela2 

Hs.2250 leukemia Inhibilory factor (chollnetglcdnfei^ntlalion Uir) 



. )in 2 + bone morphogenic protein 7 
EGF-llke-domain 6 * l)one morphogenic protein 7 
kalikrein7'» bone morphogenic protein 7 testisin 



Table 20A lists about 1025 genes up^gulated In ovarian cancer compared to normal adult Issues, Genes associated with ovarian cancer were selected from the 53462 
probesets such that the ratio of "average" ovarian cancer to "average" normal adult tissues was greater than or equal to 50. The "average" ovarian cancer level was set to th 
g3rd percentile value amongst various ovarian cancer specimens; lite "average" nomial aduH tissue level was set to the 96th percentile value amongst various non-mallgnanl 
tissues. In order to remove gene-specific background levels of non-specHic IvbrMization, the 1 5th percentile value amongst various non-malignant tissues was subtracted fro 



Tables 20B-24B list accession numbers for Pkey's lacking UnigeneltTs for tables 20A-24A For each probeselis feted a gene cluster number from which oligonucleotides were 
designed. Gene clusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence similarity using 
Clustering and Alignment Tools (DoublcTwisI, Oakland Califomia), Genbank accession numbers for sequences comprising each cluster are listed in the "Accession" column. 



TABLE 20A: 

Pkey: Unk^ue Eos probesetMenttfier number 
" " lar Aocesstan number, Genbank aa 



Exemplar Aocessbnni 
}: Unlgene number 



Ri; Ratio of 












Pkey 




UniGene ID 


UnigeneTille 


Fred, Protein Dom, 


R1 


42-1296 


NM 002666 


Hs,1C3253 






32.5 


453028 


AB006532 




RecQ proldn-llke4 


DEAD,tvellcase_CForl(_head 


27.6 


422310 


AA316622 


H3!98370 


cytochrome P450, subfamily IIS 


SS,TU,pkinase,fh3,ig 


26.5 


437897 


AA770561 


Hs.146170 


hypothetical protein FLJ22969 


SS.TM,zf-DHHC 


26.3 


446374 


AA329266 


Hs.24756 


ESTs, Moderately similar to al 




22.6 


441021 


AW578716 


Hs.7644 


HI histone family, member 2 




22.3 


409518 


BE384836 


Hs.3454 


KIAA1821 protein 




21.3 


413436 


AF238083 


Hs.68061 


sphingoslne kinase 1 


DAGKc 


21.2 


424420 


BE614743 


Hs.146688 


prostEQlandin E synthase 


MAPEG.SS,TM,fi«APeG 


20.7 


422645 


L40027 


HS.11889C 


glycogen synthase kinase 3 alp 


pklnase,SS,Ets 


20.7 


422098 


H03117 


Hs.1 11497 


similar to mouse neuronal prot 


TM 


20.2 


429556 




Hs.98988 


ESTs 


SS.pkinase,Pfi4P22^aaudin 


20.1 


436485 


X59136 


HS.15611C 


immunoglobulin kappa constant 


SSjg,SS 


19.9 


423652 


AF052122 


Hs.130712 


Homo sapiens clone 23929 mRNA 


ABC1,SS,P1D,P1D 


19.8 


431773 


BE409442 


Hs.268557 


pleckstrin homology-like domai 


PH,SS,LlM,Troponin 




422179 


AF091619 




dyneln, axonemal, intermediate 


WD40,SS 


19.3 


420839 


AI792582 


Hs!282960 


hypothetical prolan UGC10870 


SS,DS,UPF0139,Glyco_hydro 


18.5 


44135S 


BE384361 


HS.18288S 


ESTs, WeaUy similarto JC5024 


SS,TM,ank 


185 


424659 


AW8gi298 


Hs.331601 


Homo sapiens, Similarto cyste 


SS,Foci(.head 


184 


439924 


AI9B5B97 


Hs.1 25293 


ESTs 


SS 


18.1 


458814 


AI49B957 


Hs.170861 


ESTs, weakly simllartoZ195_H 


SS,TM.ldLrecepta,ldLre 


m 


451643 


MB4437 


Hs.234799 


breakpoint cluster regton 
synaptogyiin 3 


RhoGEF,RhoGAP,PH,C2 


\12 


439108 


AW163034 


Hs.6467 


Swaptogyrin,SS,TM,PDZ,WD 


16.9 


432945 


AL043683 




hypothetical protein FU10803 


m 
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421612 
456177 
414837 
432631 
454017 
401278 
444804 
410259 
406620 
423081 
421495 
416893 
413244 



C81382 Hs.63325 

AI912498 Hs^895 

AI81C721 Hs.95424 

W33191 Hs.28907 

AF151254 Hs.106195 

NM_012391 Hs.79414 

U24266 HS.7744B 

H08379 Hs.165563 

AW023617 Hs.347130 

AI084452 Hs.22158 

AK000337 Hs.61485 

M81105 Hs.146550 

AF262992 Hs.123159 

AI583067 Hs. 1491 52 

AA455583 Hs.62406 

AW955951 Hs.159265 
M14624 

AA324251 Hs.78950 

BE390717 Hs,6074 
AW995839 

H56995 Hs,37372 

AI922821 Hs.32433 

AW613841 Hs.301394 

AA336522 Hs.12854 



hypolhetical protein FU14996 
ESTs 

hypothetical protein FU20258 

8D6.anligen 

prostate epittielium-specific E 
aldehyde dehydrogenase 4 famil 
hypothetical protein DKFZp434N 
hypothetical protein FU22709 
Target Exon 

hypothetical protein FU21 988 
hypothetical protein 
myosin, heavy polypeptide 9, n 
sperm associated antigen 4 

hypothetical protein FU22573 
l(ruppel-related zinc finger pr 
gb:Human 4-bela-galaclosyltran 
branched chain keto acid dehyd 
similar to S. pombe dimi 
gb:QV4-BN0044-110200-108-h07 B 
Homo sapiens DNA binding pepti 
ESTs 

hypothetical protein MGC3101 



SS.TM,ldLreoepLa.tiypsi 



TIUI.DnaJ.UBAAfGap.homeob 
SS,TM,myosin_head,RA,DAG_ 
Band_41 



SS,PI-PLC-X,PI-PLC-Y,C2 



432481 
447304 
421182 
407767 
456642 
437457 
430178 
430399 
436725 
410219 
442620 
439233 
425018 
423801 



428648 
432329 
417061 

417595 
426500 
433124 



AA2B4855 Hs.1(V1480 

W15398 Hs.38628 

AW451623 Hs.109752 

AA757900 Hs,270823 

AW449612 H5.152475 

AI916284 Hs.199671 

BE045223 Hs.136912 

T98225 . Hs,171952 

C00138 Hs,8535 

AA831893 Hs,292767 

BE245277 Hs,154196 

NM 015071 Hs,132942 

T85105 Hs,15471 

BE315042 Hs,19210 

F09247 Hs.247735 

BE144684 Hs.314341 

AI924533 Hs.105607 

H18283 Hs.132753 

AA285136 Hs.301914 



hypothetical protein 
putative c-Myc-responsive 
ESTs, Weakly similar lo 8656 



hypothetical protein MGC10796 

Homo sapiens mRNA for KIAA1668 
hypothetical protein FU23109 
E4F transcription factor 1 
GTPase regulator associated wi 



gb7h61gOU1 Soares placenta 
I Interleuldn 6 receptor 
: IntranafBicaudalDhainolagl 



AAsieeee hs.204918 

AB014S24 Hs.138380 

BE326786 Hs.187646 

BE395161 Hs.1390 

AF062728 Hs.188021 

NM_002962 Hs.2960 

AI675944 Hs.188691 

U10492 Hs.438 

AA424317 Hs.6259 

NM_014638 Hs.170156 

U51712 Hs.13775 

AI095087 Hs.152299 

AA853479 Hs.89890 

AW207440 Hs.186973 

U76666 Hs.69949 

BE246449 Hs.2157 



proleasome (prosome. macropain 

potassium voltage-gated channe 
S100 oateium-binding protein A 
Homo sapiens cDNA FU 1 2033 «s 
mesenchyme homeo box 1 
KIAA1 698 protein 
KIAA0450 gene product 
hypothetical protein SMAP31 
ESTs, Moderately similar to 85 
pyruvate carboxylase 
degenerative spermatocyte (horn 



1^17 Hs.301338 

W03512 Hs.6479 

Atj040183 Hs.123484 

BBI)91926 Hs.16244 

AI076765 Hs.269899 
AW83S8S8 

BE182896 Hs.211193 

AW663317 Hs.SS56e 

BBSTSeO HS33719 



Target Exon 
Wiskott-Aldrich syndrome {eoze 
Target Exon 

Homo sapiens cDNA FU12300 lis 
ESTs, Weakly similar to Z195_H 
EosContnii 

hypothetical protein FU12587 
hypolhetical protein MGC13272 
Homo sapiens, done iMA6E:4178 
mitotic spindle coiled-coii re 
ESTs, Moderately similar to AL 
gb:PM0.LT0017-031299-001-h07 L 



ESTs 



SS,Conagen,COLFi,TSPN 
SS,Peplidase_C2 
SS.TopoisomeraseJ.Topois 
SS,2f-CCCH 



Seo7,PH 

SS,TM,Occludin,BIR 

SS,RNAjwl_K 

2f-C3HC4,TM,Sulfale_trans 

zf-C2H2,LiM,SS,Exo_endoj) 

I^GAP,SH3,PH 

SS,cadherlaCadheiin_C_te 

cadherin,SS,TM^adherin 

HC03_colransp,SS,TM 
F-lx)x,SaF-t)OX,HOIW^ 
UM,SS,UM 



>S,TM,Glyco_hydno_31,Glyc 



CPSase_L_chaln.PYC_OADA,H 



lSS,SS.GIyco_tranL43,COLF 

SS,Sema.pklnase,TiaPSI.e 
SS,TM,cysjich_FGFR 
Traponin,SS,glycolytlc_en 
SS,TM,BIR,UQjcon 



rnuSS.rrm 
SS 
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42894B 


BE5143S2 




FKSOe-binding protein 3 {25kD) 


FKBP.PIP5K 


424707 


BH)61914 


Hs.10844 


Homo sapiens cDNA FU14476 fls 


SS,SS,TM.Sema 


416819 


U77735 


Hs£0205 


pim-2 oncogene 


pkinase,SS,TM,OTU,K_letra 


419341 


N71463 


HS.118B88 


ESTs, Weakly similar loALUI.H 


K,TM,UPF0016 


444359 


AI697160 


Hs.143594 


ESTs, Weakly similar to HS4L_H 




404333 






C7001735*:gi|7768636|db]|BAA95 


vwd 








C12000519:gi|7710045|re(|NP_05 




467941 




Hs.14587 


ESTs, Weakly similar lo AF1518 


SS,TM,SS,TM 


401594 






NM_024817:Homo sapiens hypothe 




441790 


AW294909 


Hs.132208 


ESTs 




444008 


BE544855 


Hs.236572 


ESTs, Weakly similar to SFR4_H 


SS,SS,SAC3_GANP 


438185 


Y19188 


Hs.320461 


ESTs 


SS 


432031 


AF039196 


Hs.272367 


tiairless protein (putative sin 


imjC 


410471 


T88B72 




gb:yd31a12.s1 Soates fetal liv 




433573 


AF234887 


Hs.57652 


cadtierin, EGF LAG seven-pass G 


SS,TM,7tm_2,EGF,cadheiin, 


417371 




Hs.269149 


ESTs 






AA770021 


Hs, 16332 


ESTs 


SS,ig.fn3 




AA526235 


Hs.193162 


Homo sapiens cDNA FU11983fis 




412674 


X04106 


Hs,74461 


calpain 4, small subunit(30K) 


ethand.SS,CAP_GLY 


425B63 


U43604 


Hs,159901 


Human unldenlfied mRNA, parti 






NM 007274 


Hs,8679 


cytosolic acyl coenzyme A thio 


Acyl-CoA.hydro,SS,TM 


429469 


M64590 


Hs,27 


glycine dehydrogenase (decaibo 


GDC-P.GDC-P 


420029 


BE258876 


Hs.94446 


polyamine-modulated factor 1 


aldo_ketred,SS,TM,gla 


445625 


BE246743 




hypctheOcal protein FIJ2263S 


88.TM 
SS.ras 


435339 
407235 


AI358300 
□20569 


Hs,169407 


ESTs 

SAC2 (suppressor of actn muta 


SS,TM,RlbosomaLS13,Galac 


428758 


M433988 


Hs.98502 


CA125 antigen; mucin 16 




401349 










437915 


AI637993 


Hs.202312 


Homo sapiens clone N1 1 NTera2D 




424511 


BE300512 


Hs,1 93557 


ESTs, Moderately similar to AL 




423366 


Z80345 


Hs.127610 




Acyl-CcA^dh>teyl-CoA^dh_M 








NM_013432*:Honio sapiens nuclea 


ank.SS,TM,CPSFJ\ 


427497 


AW139476 


Hs.31240 


ESTs 




420423 


AA827718 


Hs.88218 


ESTs 


SS 




BE270734 


Hs.2795 


lactate dehydrogenase A 


ldhJdh_C,SS,ldh 


450052 


AI631298 


Hs.236524 


ESTs 


*C3HC4,*BJbox 


412738 


N34731 


Hs.74562 


siah binding protein 1;FBP in 




444202 


AL031685 


Hs,12785 




SS,TM,NaLH^h?nger/BC2 


451165 


AI340575 


Hs.286084 


MRIP-1 protein 




411450 


H49619 


Hs.127301 


ESTs 


SS,pklnase 


405371 






NM.005569*:Homo sapiens LIM do 


pkinase,UM,PDZ 


435782 


N49433 


Hs.285737 


Homo sapiens cDNA: F1J20895 fi 


SS,G6PD,6lucosamlneLlso,6 


416866 
405474 


AA297356 


Hs.80324 


NM.0010g3':Homo sapiens acetyl 


CPSase_Lchain,biotinJip 


412837 


A1922293 


Hs.58389 


hypothetical protein UGC4090 




448133 


AA723157 


Hs.73769 


folate receptor 1 (adult) 


FolatejecSS 


431081 


AA491594 


Hs.75813 


polycystic kidney disease 1 (a 


SS,TM 


427640 


AF058293 


HS.18001S 


DKlopachrome tautomerase 


MIF,late_ptoleln_L2,SS,GS 




AW938739 


Hs.115412 


tvpothetica! protein FU13881 




407688 


W25317 


H6.37616 


Human 09 splice variant B mRI^A 




407507 


U73799 




gbiHuman dynactin mRNA, partia 


SS,TM,HC03jMlransp,CAP_G 


40D833 






C11000890:gi|3746443|gb|AAC639 


SS,TM,7lmJ 


422064 


AW462589 


Hs.335742 


ESTs 


TM 


452434 


D30934 


Hs.29549 


C-type lectin-like receptor-1 


leclin_c,SS,TM 


451752 


AB032997 






TBC,SS,TM,pkinase,lamlnin 


432931 


AF174487 


Hs.293753 






407893 


BE408359 


Hs.43621 


Homo sapiens, Similar to hypol 


SS,SS,arf,ras.fn3,ras 


427397 


AI929685 


Hs.1 77556 


calmodulin 1 (phosphorylase ki 


elliand,RmaAD,SS,efhand 


405159 






ENSP00000243337':CDNA FU13984 




422283 


AW411307 


Hs.1 14311 


CDC45 (cell division cycle 45, 


GDC46 


407058 


X94563 




gb;H.sapiens dbi/acbp gene exo 


SS 


448045 


AJ297436 


Hs.20165 


prostate stem cell antigen 


SS,TM,UPAR_LY6,toxin,SST 








NM 003105*:Homosaplenssortll 


!dl recepLa,fn3,ldLrece 


427315 




Hs.175553 


Homo sapiens mRNA; cDNA DKFZp5 


speclrin,SH3,PH,CH 


414391 


BE409872 




gb;601299656F1 NIH_MGC_21 Homo 




447867 


AI525268 


Hs.164303 


ESTs 


TM 






Hs.173724 


creatine kinase, brain 


ATP-gua.Ptrans.AfP-guaJ't 


454319 


AW247736 




ESTs, Weakly similar to T32527 






AF164799 


Hs!l93384 


putalative 28 kDa protein 




408645 


AW246738 


Hs.109274 


hypothetical protein MGC4365 


SS,TM 


429527 


AA454184 


Hs.289014 


ESTs 




406651 


AI559224 




gb:tq32c02x1 NCLCGAP_Ut1 Hom 




430893 


BES02068 


H5.282067 


ESTs 




414413 


BE294877 




gb:601 174162F1 N1H_MGCL17 Homo 


SS 


413726 


AJ278465 


Hs.75510 


annexInAH 


annejdn,SS,annexin 


432211 


BE274530 


Hs.273333 


hypothetical protein FU10986 




421694 


BE387430 


Hs.106880 


bystin-IIke 




453683 


AL079854 


Hs.1 18698 


Homo sapiens mRNA for KIAA1878 




456741 


W37608 


Hs.184492 


ESTs 


SS,pkinase 


442995 




Hs.288455 


Homo sapiens cDNA: FU23270 fi 




415898 


Z43379 


Hs.177193 


gb:HSC1AE111 normalized in^t 




456977 


AK000252 


Hs.169758 


hypothetical protein FtJ20245 




439632 


Ami0714 


Hs.334437 


hypothetical protein MGC4248 


SS,TM,transmembiane4 


431462 


AW583672 


Hs.256311 
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25 
30 
35 
40 



55 
60 
65 



400128 






Eos Control 


TM,E1-E2J\TPase,HMA.Hydro 


8.7 


438582 


AI521310 


Hs,28336S 


ESTs, Weakly similar to ALUS H 


SS 


8.7 


450958 


AL137669 


HsJ48012 


Homo sapiens mRNA; cDNA DKFZp4 




8.7 


410855 


X97795 


Hs.66718 


l«D54(S.cetevisiaeHil(e 


SNF2.N,hericase_C,SS 


BJ 


415126 


D60945 




gb;HUM141D04B Qontecti iiumanf 


SS,TM 


8.7 


418736 


T18979 


Hs.87908 


Snf2-related CBP aclivalorpro 


SS,helicase_C/iT_hook,SS, 


a6 


431157 


AI823959 


Hs.132678 


ESTs 


SS,MAPEG,8S.MAPEG 


8.6 


418843 


AJ251016 


Hs.89230 


potassium inlermed late/small c 


THCali4BD,SK_channel,TM 


a6 






Hs.94875 


ESTs, Weakly similar to A36363 




16 


432343 


NM_002950 


Hs.2961 


S100 calcium-binding pratein A 


S_100.SS,emand.S_100,e1t( 


6.6 


458440 


AI095463 


Hs.135254 


Homo sapiens clone 1 thrambosp 




8.6 




Y10141 




gb:H.sapiens DAT1 gene, partia 
ESTs 


SNF,SS,TM 


a6 


452851 


AW173191 


Hs.213117 


SS,Sema 


a6 


422418 


AK001383 


Hs.1 16385 


hypothetical protein FU10521 


RhoGEF 


a6 


420836 


AW958453 


Hs.204959 


hypothetical protein FU14886 


SS,ras 


8.6 


455588 


AI129903 


Hs.74669 


vesde-assodated membrane pr 


synaptobtevln,SS,TM 


as 




AW972689 


Hs.200934 


ESTs 


bZIP 


as 


410720 


AF035154 


Hs.65756 


regulator of G-protein signall 


RGS,&gamma,DEP,SS,R6S,DI 


as 


449751 


AW207115 


Hs.25555 


ESTs 




as 




AW162336 


Hs.3709 


low molecular mass uMquinone- 


SS 


as 


405557 






Target Exon 


Ets,SAM_PNT 


as 


443780 


NM 012068 


Hs.9754 


activalng transcription facto 


bZIP.NTP Iransl 2,SS,TBC 


as 


423860 


U33291 


Hs.194301 


microtubule-associated protein 


M 


as 


421901 


AB014554 


Hs.109299 


protein tyrosine ptiospliatase, 


8AM,SS,TM,tnn,PDZ 


8.4 


401885 






Target Exor -> 


klnesln,SS,TM 


8.4 


449382 


A165C407 


Hs.197875 


ESTs 


8S,mn,zf^?anBP 


8.4 


432862 


AW004958 


Hs.236720 




SS,MATH.z{-TRAF^{-C3HC4 


8.4 


441363 


AW450211 


Hs.126825 


ESTs, Wealdy similar to A46302 


SS.TM,HSP20,7lin.1 


a4 


407363 


AF035032 


Hs.181125 


gb:Homo sapiens clone MCA1 L my 


SS,lg,SS,GjliLtranspepl 


a4 


425380 


AA356389 


Hs.32148 


AIW)15 protein 


SS,TM,U?R,P,PBptldase_S8 


a4 


424893 


AW296112 


Hs.153648 


Homo sapiens cDNA FU13303 fis 


SS,SAM,SS,TM,7lm_1 


8.4 


424060 


AW189983 


Hs.139119 


Homo sapiens cDNAFU10967fis 




8.3 




AL365406 


Hs.10268 


Homo sapiens mlWIiiliiengtti 




a3 


431765 


AF124249 


Hs.268541 


novel SH2-«intalning protein 1 


SH2,SS,TM 


8J 


404365 






Target Exon 


SS 


8.3 


424310 


AA338548 


Hs.50334 


testes developmenlrelated NYD 


SS.TM 


8J 


401935 






Target Exon 


PH 


8.3 


434796 


AA812046 




ESTs 


SS.myb.DNA-bMing,myb.DN 


8.3 


423093 


AA321930 


Hs.204632 






8.3 


434552 


AA639618 


Hs.325116 


Homo sapiens, clone MGC;2962, 


SS 


8.2 


457082 


AA470587 


Hs.1 04772 


ESTs 


SS 


a2 


432603 


AA554920 


Hs.1 05794 


UDP-alucose:glycoprotein gluco 


SS.TM 


8.2 


402445 






Target Exon 


fn3,SS,TM,BNR 


8.2 


422073 


AW872378 


Hs.1 201 70 


hypothetical protein FLJ21415 




8,2 


418361 


AW505368 


Hs.12460 


gb:UI-HF-BN0-alu-d-03-O-Ul.r1 




8.2 


431354 


BE046956 


Hs.251673 


DNA {cytosine-5-)-methyltransf 


SS.PWWP.PHD 


8.2 


403835 






Taiget Exon 


TM,Suirale.lransp,STAS,HM 


8.2 


450029 


AW073380 


Hs.267963 


hypothetical protein FU10535 


SS,Pyridox_oxidase,zf-C2H 


a2 


452512 


AW363486 


Hs.337635 


ESTs 


SS 


8.2 


420138 


BE268854 


Hs.177729 


ESTs 


SS 


8.2 


439788 


N71241 


Hs.119275 


ESTs 




8.2 


423662 


AK001035 


Hs.130881 


B-cell CLUIymptioma 1 1 A (zinc 


SS 


8.2 


449656 


AA002008 


Hs.188633 


ESTs 


PIP5K 


ai 


452295 


BE379936 


HS.2B866 


programmed cdl death 10 


SS,serpin 


. 8.1 


448650 


AW769385 


Hs.204891 


ESTs 


SS,ILB 


ai 


446035 


NM_006558 


Hs.13565 


Sam68-likephospholyroslnepro . 




ai 


444406 


AI147237 




Immunoglobulin heavy constant 


SS 


ai 


437215 


AL1 17488 




Human clone 23564 mRNAsequenc 




ai 




NM 006577 


Hs.284284 


ESTs, Highly similar to beta-1 


SS,TM,DIX,POZ,DEP,Disheve 


ai 


400409 


AF153341 




Homo sapiens winged heWoik 




6a 


443801 


AW206942 


Hs.253594 


inlronof: Inchorhlnophalang 


GATA 


8.0 


425281 


AA444390 


Hs.155482 


hydroxyacyl glutathione hydroi 


lactamase_B,SS 


ao 


458216 


AW024282 


Hs.104938 


hypothetical protein MGC1 5906 




8.0 








C15000810*:gl|11131272|sp|P793 




8.0 


401180 






eukaryotic translation elongat 


SS,TM,kJn_ttans,lQ 


8.0 


454291 


AW334847 


Hs.213534 


ESTs, Weskly similar to MUC2.H 


SS,XRCC1_N,BRCrjactamase 


8.0 


444014 


AI095718 


Hs.135015 


ESTs 




8.0 


412128 


AW894709 




gb:CM1-NN0032-020500-212-d05 N 


SCAN,zf-C2H2,KRAB 


7.9 


408363 


NM 003389 


Hs.44396 


ccronin, aclin-binding protein 


WD40 


7.9 


425694 


U51333 


Hs.159237 


hexokinase3(whitecell) 


hexoklnase,hexakinase2,he 


7.9 


425263 


NM 001197 


Hs.155419 


BCL2-lnteractlng killer (apopt 


SS,TM,TspOJMBR 


7,9 


447045 


AW392394 




sorting nexin 17 


SS,1F-2B.PP2C 


7.9 


457613 


AA598869 


Hs,173770 


ESTs 




7.9 


410338 


W03445 


HS.382C5 


gbsaOSgll.rl Soares melanocyt 


pMnase 


7.9 








Target Bran 






454246 


AW245185 


Hs,6996 


ESTs 




73 


410079 


U94362 


Hs.58Se9 


glycogenin 2 


G|yco_transf.8 


7.9 


443678 


AW009605 


Hs.231923 


ESTs 




7.9 


404676 










7.9 


406649 


AI569392 




gb-hSjcl NCLCGAP_Ut2Hom 




7.9 


420230 


AL034344 


Hs.284186 


(jrlcliead box C1 


Forlt.head,SS,Forletiead 


7.9 


413534 


BE146961 




gb:QV441T0222-01 1 199-0194)1 2 H 


SS,TM 


7.8 


444628 


U01120 


Hs.242 


glucose^phosphatase, catalyt 


RAP2,SS,TM 


7.8 


410839 


NM.006849 Hs.66581 


.protein disullideisomeiase 


thlofed,Rho_GDl,gntR,SS,T 


7.8 



wo 02/102235 



444046 


AI360834 


Hs.135094 


ESTs 


SS,GTP EFTU.EFG C,GTP EFT 


439501 


AF086321 


Hs.287452 


Homo sapiens cDNA FIJ11760 fis 


TM 


415441 


R13977 


Hs.9634 


ESTs 




450461 


BE408081 


Hs.46736 


hypotheticsl protein FLJ23476 




448993 


AI471630 




KIAA0144gene prodxt 










Target Exon 


SS,TI^,DUF28g 


440546 


AI491994 




gb;toC7g09.x1 NCI CGAP Ut2Hom 


SS,HATPase_c 


419757 


M773820 


Hs.63970 


ESTs 


SS.TIW 


461721 


NM_006946 


Hs.26915 


spectrin, beis, non-eryllitooyt 


spectrin,PH,CH,SS,Peptida 


458834 


AI566883 


Hs.196446 


ESTs 




422633 


X56832 


Hs. 11 8804 


enolase 3, (beta, muscle) 


enolase,SS,TM,klnesin,FHA 


438452 


AI220911 


Hs.288959 


hypotlietical protein FLJ20920 


SS 


421445 


AA913059 


Hs.104433 


Homo sapiens, done IMAGE;4054 


asp,SS,TM,h)n trans,KJet 


434743 


AI353410 




ribosomal protein 818 


SS,TM 


450635 


AW403954 


Hs.25237 


mesenchymal stem cell protein 


4HBT 


442394 


R62926 


Hs.285193 


ESTs 




434333 


AA186733 


Hs.292154 


stromal cell protein 




427221 


L15409 


HS.1740C7 


von Hippel-Lindau syndrome 


VHUTM 


429099 


BE439952 




phosphorylase kinase, gamma 2 


pkinase,SS,SNF2_N,helicas 


444670 


H58373 




hypothetical protein MGC3370 


SS,zf-RanBP,»4Dfl42 




AIS52833 




gb:ivb22c11.x1NCI CGAP GCOHom 


88 


444607 


AW405635 


Hs.293687 


ESTs 


SS,PI-PLC-X,PH,PI-PLC-Y,C 


449125 


AI671439 


Hs.196029 


Homo sapiens mRNA for KIAA1657 


TIMP 


447151 


AI022313 


Hs.92679 


Homo sapiens clone CDABP001 4 m 


SS,THiUJ?R.aminotran.1_2 


448626 


W27670 


Hs.55613 


hypothetical p[DlelnFU22531 




430432 


AB037758 


Hs.241419 


KIAA1337 protein 


TM,Patched,TM 


401822 






C17001422:gi|269Se66|emb|CAA7S 




428909 


AI190714 


HS.9894S 


ESTs 




414534 


BE257293 


Hs.76366 


BCL2-antaganist of cell death 


SS,harmone_rec,zf-C4 


421620 


AA446183 


HS.918B5 


ESTs,WeddysiniilarlalSS214 




441660 


AI2S1960 


Hs.132545 


ESTs 


SS,TM,KOW 


442232 


AI357813 


Hs.337460 


ESTs, Wealdy similar to A47S82 


SS.TM,TGFb_propepllde,TGF 


439539 


BE348395 


Hs.121589 


ESTs 


SS,ForK.head 


400286 






C16000922:gi|7499103|plr|rr209 


TM,ABCLIran,ABC_membrane 


452833 


BE559681 


Hs.30736 


KIAA0124 protein 


WD40 


417390 




Hs.85852 


hypothetical protein MGC3169 




427721 


AI582343 


Hs.180465 


F!A023 (S. cerevisiae) homolog 


ubiquitin,UBA,lntegrin_B, 




T57758 


Hs.10255 


ESTs 






AF211976 




gb iHomo sapiens LENG9 raRI^A, pa 




413956 


AI821351 


Hs.193133 


ESTs, Weakly similar to ALU7_H 






AA829286 


Hs.332053 


serum amyloid A1 


SS,SAA_proleins,SS,SAA_pr 


406495 






Target Exon 


SRCR,TU,Acelyltransf 


430387 


AW372834 


Hs.240770 


nuclear cap binding protein su 


mn.SS,TM,rrra 


408601 


U47928 


Hs.86122 




SS,7tm.1,SS,ig,WD40,zf-UB 


424364 


AW383226 


Hs.163834 


ESTs, Weakly similar to G01763 


SS,ras 


409832 


AW963293 




gb;EST375366 l*\GE resequences, 




448043 


AI458553 


Hs.201881 


ESTs 


PHD 


421148 


AF008936 


Hs.102178 


syntaxin 16 


Syntaxin,SS,Peptidase_ll417 


420970 


AA305079 


Hs.1342 


cytochtonie c oxidase subunh V 


C0X5B 


419295 


BE397712 
AU)36449 


Hs.144027 


ESTs 


myb_DNA.blndlng,mybJ5NA* 


448330 
419639 


AK001S02 


Hs.91753 


hypothetical protdn 




431488 


AB037785 


Hs.257594 


K1AA13&4 protein 


SS,CH,U14,SS 


456487 


AF064804 




suppressor otTy (S^cerevlsiae 




448615 


Aigi0868 


Hs.212957 


ESTs 


SS 


427433 


D82070 


Hs.177972 


chromosome 4 open reading tram 


SSjldnase 




N49809 


H6.11197 


Homo sapiens, clone IIVIAGE:3343 




452554 


AW452434 


Hs.5e006 


ESTs,WfeaMysirtiilarloALU5_H 


SS,PAS,HLH 


411448 


AA178955 


Hs.271439 


ESTs, Weakly similar to 138022 




442318 


AI792199 




ESTs 


S8,zf-C2H2 


425055 


AW961959 


Hs.96940 


ESTs 




412935 


BE267045 


Hs.75064 


tubulin-speclfic chaperone c 




403748 






Target Exon 


TM 


447282 


AI989963 


Hs.197505 


ESTs 




422306 


AI92B242 


Hs.293438 


ESTs, Highly similar to API 984 




416472 


AA1 80756 


Hs.340316 


ESTs, Moderately similar to AL 


zf-C2H2 


427273 


AW1 39032 


Hs.107376 


hypothetical protein DKFZp434N 


SS.SS.TM 


412265 


AA101326 


Hs.86154 


hypothetical protein FU 12457 


UPP_synlhetase,HH<G14_17 


447359 


AK002194 


Hs.19851 


peroxisomal biogenesis factor 




432747 


NM_C14404 


Hs.278907 


calcium channel, voltage-depen 


PMP22_aaudin,SS,TH<.PMP22 


406727 


AI2192B2 


Hs.2186 


eukaryotic translation elongat 




404199 






ENSP00000211797*:Helicase SKI2 


SS,RasGAP,PH,SS,PHD 




BE391590 


Hs.9265 


hypothetical protein FU20917 


SS,PWWP,Exonuclease,lipoo 


428550 


AW297880 


HS.9B661 


ESTs 


SS,homGobox,hameobox 








gb:QV4-ST0212-120100-075^i10 S 


SSiATP-synLab,ATP-synt_a 


407686 




Hs.125043 


chromosome 21 open reading fl^ 


SS,TII4,ISK_Channel 


418304 






gbar97g10/1 NCLC6AP_GCB1 Ho 




424263 


M77640 


Hs.1757 


LI cell adhesion molecule (hyd 


fn3,ig.lRK,SS,TM,fh3,lg,R 


407581 


R48402 


Hs.173508 


P3EC8L 


SS,TM,7tm_1 


430746 


AW977370 


Hs^12 


ESTs 




402651 






NH4.000721*:Homosapiens calciu 




407323 


AA181183 


Hs.143504 


gb:zp57c02.s1 SIralageneendot 


^r^omal_S4e,ubiquiti 


407619 


AIJ0S0341 


Hs^165 


collagen, type IX, alpha 2 


SS,CoSagen,SS,C(dlagen 


434035 


AI762074 


H5.204769 


ESTs, Weakly similar to T28770 


SS,TM 



222 



wo 02/102235 



PCT/US02/19297 













AW995948 


Hs. 182339 


Homo sapiens pyruvate dehydfog 




















AW959607 




gD:EST371677 MAGEresequenceSr 




















A135o522 


Hs. 103834 






BE4G4016 


Hs.238956 














































AW27i70o 






406247 






Target Exon 












AA490544 




















Target Exon 






Hs.182313 


retinot-binding protein 2, eel 




AA443443 


Hs.85524 


for muscle specific ling tinge 




AA954810 




human lionxitog of [Drosoptiila Sc 








dual specificSy pliosphatase 1 




AA937112 


Hs.207788 






AW819273 


gb:CM2-ST0284-061299-046-a12 S 








CI 800fl090*,gil6678656|ref|NP_0 










































































BE3o4526 








AK001714 




iiypotiietiGal protein similar t 












































N38970 
































































AA345241 


Hs.55950 






AI318680 




gb;ta49g09.x1 NCl_CGAP_Lu25 Ho 




AA345824 


Hs.76688 


carboxytesterase 1 (monocyteMi 




AB029010 


Hs. 143026 


KIAA1087 protein 




AA250970 


Hs.251946 


poly[A)-binding protein, cytop 






Hs.22380 






AA532807 


Hs. 105822 














NM_001440 




exostoses (mulUple)-like 3 








































Homo sapiens cDt^ f*LJ14015 tis 




























































AF131786 




Homo sapiens clone 25220 mRNA 








Homo sapiens, clone IMAGE:3610 


441794 


AW197794 




ESTs 


450287 


AW291483 


Hs.255909 


ESTs 


441523 


AW514263 




ESTs, Weakly similar to ALUF.H 


452798 


AI916771 


Hs.257170 


ESTs 


451937 


API 19664 


Hs.27299 


transcriptonal regulator prat 


421417 


AA291004 


Hs.326088 


ESTs 


440317 


BE561888 




gb:601346093F1 NIH_MGC_8 Homo 


421321 


NM 00530 


Hs.103502 


glutamic^ yruvate transaminase 


444904 


AW452054 


Hs.161139 


ESTs 


449730 


R72290 


Hs.1 17557 


ESTs, Weakly similar ki 138022 



llpocalin,lipocalin,WD40 

SS,TM,lg 
Rhodanese,! 
rmsushi 

SS,laminlr_Nterm,laininln_ 
88 

dlslnlegrin,Reprolysln,Pe 

SS 
TM 

SS,cy: 
SS.TH/ 
SS 

SS,WD40,EPO_TPO 
ank,TM 



SS.perilipin 

PAF-AHJb,Upase_GOSUSS. 
E1-E2.ATPase,HI*\,Hy(iiolas 
PI3_PI4.kinase,PI3K_C2,PI 



SS,COesterase,SS,COestera 
SS,TH,Na_Ca_Ex,Calx-beta, 
SS.PABP,mupkinase,14-3- 

ssjiyi 

SS,TM,pl(inase 

Exostosln.SS,TM 
SH2,SH3j>kinase 
PHD.Virus_HS,SS,ian_clian 



TM,Retlcuton 

Yj)hosphatase,DSPcTM 

oxklara(LqS,SS,TM,inn 



aminotranJ_ZSS,TM,lKR 
RasGAP,ttiyroglobulin_1,Ri 

223 



wo 02/102235 



PCT/US02/19297 



30 
35 



450622 


AI660285 


Hs.58210 


ESTs. Highly similar to ITH4_H 




6J 


425424 


NM_004954 


Hs.157199 


EU<L motif kinase 


pkinase,KA1,UBA,SS 


6.7 


435864 


AL036499 


Hs.1 88491 


ESTs 




6.7 


410397 


AF217517 


Hs.53042 


DKFZp564J157 protein 


SS,homeobox,UPF0160,DUF23 


6.7 


454252 


AW612232 


Hs.254835 


ESTs 


SS,TM,voltage CLC.CBS 


6.7 


453023 


AW028733 


Hs.31439 


serine protease inlilbitor, Kun 


Kunitz_BPTI.SS,TM,ion_tra 


6.6 


419157 


AA234540 


Hs.23871 


ESTs 


pkinase 


6.6 


412464 


178141 


Hs.22826 


ESTs, Wealdyslmlar to 155214 


SS,cadherin,crystall 


6.6 




AI801565 


Hs.200113 


Homo sapiens cDNA FU11379 lis 


SS,adhjshort,Transglutaml 




456643 


AW751497 


Hs.98370 


cylocliromeP450. subfamily IIS 




6.6 


411490 


R39474 




gb:yh95b09.r1 Soares placenta 


SB 


6.6 


466835 


BE1 53524 




gb:PM0-HT0339-241199«)2-C03 H 


SS,pklnase 




438857 


AI627912 


Hs.1 30783 


Fbrssmansynftetase 


SS,RA,Ras6EF,Ras6EFN 


6.6 


420307 


AW502869 


Hs.55219 


ESTs 


SS.TM 


6.6 


453495 


AA442103 


Hs.33084 


solute earner fanily 2 (fecil 


sugar_tr,SS,TM 




419132 


AA234822 


Hs.55147 


ESTs 


SS,TM,ion.lrans,kin.trans 


6.6 








Target Exon 


TM 


6.6 


433938 


AF151536 




ubiquinol-cylochrome c reducta 


TM 


6.6 


448930 


AL137527 


Hs.239038 


hypolhelical protein MGC4125 




6.6 


454095 


AW178110 


Hs.1 91 705 


gb:IL3-HT0061-0109994)13-H04 H 


SS,TM,homeobox,liypsin,PD 


6.6 


459702 


AI204995 




gb:an03c03j(1 Stratageneschiz 




6.6 


422201 


NM_001505 


Hs.1 13207 


G protein-coupled receptor 30 


7tnLl.SS,TM 


6.6 


406779 






CGl-39 protein; cell deatli-reg 


SS,SS 




404149 






C6002509*:gi|5031885|rel|NP_00 


SS,TM,kringle 




418576 


AW968159 


Hs.302740 


Epithelial calQium channel 2, 


SS.TM 




421363 


NM 001381 




docking protefn 1, 62kD (downs 


PH,IRS,TM,PH,IRS,lrypsin, 


6.6 


468919 


AI681567 


Hs.1 3349 


KIAA0756 protein 


TM 


6.6 


427502 


AI811865 


Hs.7133 


Homosapiens, clone IMA6E;3161 


SS,TM,ABC_tran,Glyco_tran 




412289 


AW935967 


Hs.170162 


KIAA1357 protein 


SS 


6.5 


447105 


AW377610 


Hs.1 1123 


DKFZP564G092 protein 


SS,TM 


6.5 


444672 


Z95636 


Hs.1 1669 


lanilnln,alphaS 


lamhiii.EGF,laminin_G,EGF 


6.5 


429299 


AI620463 


HSJ47408 


hypothetical protein M6C13102 


SS,TM,gla 


6.5 


420003 


AA256906 


Hs.1 11364 


ESTs. Weakly similar to ubiqui 


SS,TM 


6.5 


431849 


A1670823 


Hs.85573 


hypothetfcal protein M6C10911 
h^uronan-binding protein 2 


SS,TM 


6.5 


430396 


D49742 


Hs.241363 


bypsin,kringle.EGF,SS 


6.5 


437662 


AA765387 




ESTs 


WD40,RCC1,SPRY 


6.5 


436543 


NM.002212 


Hs.5215 


Integrin beta 4 binding prolei 


elF6 


6.5 


405375 






CX00074r:gl|4885461 |reqNP_0O 


SS,TM 


6.5 


430116 


AA465350 


Hs.119400 


ESTs 


SS,TM,adh_shori ' 


6.5 


406109 






Target Exon 




6.5 


414871 


BE649179 


Hs.29008 


gb:601078714F1 NIH_MGC_12 Homo 




6.5 


440656 


AI979248 


Hs.148221 


ESTs 




6.5 


438951 


U51336 


Hs.6453 


inositol 1,3,4-tiiphosphate 5/ 


SS,oxidored.nitra,SS 


6.5 


405376 






Target Exon 


SS.TM 


6.5 


426925 


NM_001196 


Hs.315689 


Homo sapiens cDNA: RJ22373 fi 






400500 






Target Exon 




6.5 


408294 


BE141732 




gb:CM)-HT01 01 -061099-032-607 H 




6.5 


447904 


AW206303 




ESTs 




6.4 


439211 


AI890347 


Hs.271923 


Homo sapiens cDNA: FU22785 fi 




6.4 


426828 


NM 000020 


Hs.172670 


activin A receptor type ll-lik 


pkinase,AcSvlnjecp,SS,T 


446100 


AW957109 


Hs.13804 


hypothetical protein dJ462023. 


SS.TM 


6.4 


442146 


R62699 




gb:yg81g01.r1 Scares infant br 


TM 




425041 


A1377150 


Hs.150914 


ESTs 






457584 


AA147979 


HS.28S005 


mitochondrial Impart receptor 


jQsephIn 




435449 


AA682379 


Hs.303460 


EST 






406284 






Homosapiens mRNAfull tenglh 






425944 


AK000664 


Hs.164256 


hypothetical protein FU20657 






453367 


AW32B47 


Hs.70573 


PKa-1-related HIT protein 


SS.TM 




419725 


U66048 




Homo sapiens clone 161455 brea 






412452 


AA215731 




suppression of tumorigenicity 






421273 


AJ24S416 


Hs.103106 


U6 snRNA-associated Sm-like pr 


Sm.SS,tRNA«ynL1,GST_C.G 


6.4 


432746 


AA564512 


Hs.24301 


polymerase (RNA) II (DNAdirec 


SS,TM,EF1BD 


6.4 


429398 


AA452239 




KIAA0970 protein 




6A 








Ca000066':gl|10432395|emb|CAC1 




6.4 


427339 


A1734109 


Hs.97984 


SRY (sexdelemining tegioti Y) 




6.4 


435389 


AI811706 




CHMP1 .5 protein 




6.4 


423890 


AA525226 




ESTs, Moderately sirrrilar to 15 




6.4 








hypothetical protein FIJ12538 






414756 


AW451101 


Hs.159489 


ESTs, Moderately similar to JC 


hexokinase2,hexokinase 


6.4 




U16997 




F!AR-related orphan receptor C 


hormone_rec,zf-C4,SS,TM,h 


&4 


408493 


BE206854 


Hs.46039 




PGAM.BRCT,RNA_pol L 


6.4 


439569 


AW602155 


Hs.222399 


CEGP1 protein 


CUB,EGF,SS 


6.4 


457274 


AW674193 


Hs.227152 


mannan-binding lectin serine p 


SS,TM,SS,TM,Clathrin_lg c 


5.4 


444550 


BE250716 


Hs.87614 


ESTs 


SS 


6.4 








gb;yv04a07.s1 Soares fetal liv 






423228 


AL1 37491 




Homo sapiens mRNA; cIDNA DKFZp4 


SS,TM,sushi 


64 


422682 


WQ5238 


HsImSI^ 


ESTs, Weakly similar to T31613 


SS,TM,DEAD,helicase CLara 


&3 


447887 


AA114050 


Hs.1 9949 


caspase 8, apoptosis-related c 


ICEjl20,DED,ICEjl10 


6.3 


400137 






Eos Control 




a3 


408784 


AW971350 


Hs.63386 


ESTs 


SS 


d3 


435028 


AW1 93035 


Hs.187370 


ESTs 




6.3 


438113 


AI467908 


Hs.8882 


ESTs 


SS,TM,7bn.1 


a3 


417810 


D28419 


Hs.82609 


hydroj^methylbllane synthase 
.ESTs 


Poiphobil.deam 


6.3 


436050 


A1057205 


Hs.14584 




6J 



wo 02/102235 



PCT/US02/19297 



452072 
431161 
413055 



BE6223S8 Hs.61260 

N47863 Hs.180450 

AW084791 Hs.133122 

AB000824 H5.129712 

AA430651 HS.2C9249 

BE167242 H5.47099 
BE163341 

AF035959 Hs.24879 

R82342 Hs.79856 

AA436011 Hs.98187 

U53823 Hs.171952 

BE258867 Hs.27744 
AA493591 

AV6557C1 Hs.75183 

BE264649 Hs.251377 



413272 
445050 

453130 
449940 
440390 



401121 
425188 
457216 



C4001244:gi|53gg33|plc||A61275 
hypothetical protein FU13164 
ribosoinal protein S24 
hypotheScal protein RJ14524 
trehalase (bnish-border membra 



tubulin.™ 
Rlbosomal_S24e 



AA769294 

AF086141 Hs.50760 

BE391090 Hs.280278 

AA770424 Hs.98162 

AA830402 Hs.221216 

N91071 Hs.109650 
AA491600 
AI21B431 
X13075 

AW578003 Hs.22826 

AW162263 Hs.312468 
AA127923 

AW2Q54B3 Hs.1 47260 

AA115811 Hs.6838 

AW291126 Hs.187520 

AW207385 H3.36475 

N52572 Hs.13702 

AA832417 Hs.1 39650 

H66891 

AW862707 

AB004662 Hs.77867 
AF181286 



RAB3A, member RAS oncogene tam 
gb:nh01a12.s1 NCI_CC3AP_Thy1 Ho 
cytochrome P450, subfamily liE 
taxol resistance associated ge 
Target Exon 

m 024944';Homo sapiens hypolti 
gb:nz36g03.s1 NCi_CGAP_GCB1 Ho 
Target Exon 

ESTs, Highly similar to BimL( 
EST 
ESTs 



gb:Human 2a1 2 mRNA for kappa-i 
troponxidulln 3 (ubiquitous) 
ESTs, Weakly similar to ALUC_H 



ras homolog gene {amily, membe 
Homo sapiens, clone IMAGE:3834 
KIAA0493 protein 
ESTs, Moderately similar to AL 



AK002052 Hs.1 55071 

AA452554 Hs.283697 

BE246528 Hs.250726 

AW406289 Hs.95593 

AA0B1594 Hs.1 58311 

AF129085 Hs.25197 
AA766420 

AI868753 Hs.76372 



gb:yr71c03.r1 Soaies fetal liv 
G-rich RNA sequence binding fa 
adenosine A1 receptor 
Homo sapiens mutant dystrophin 
NM 001cg3*;Homo sapiens acetyl 
cullln4B 

lectin, galactoside-blnding, s 
C12001638':gl|7291960|gb|AAF47 
hypothetical protein FLJ11190 
ESTS, Weakly similar to A41796 
gb:TCBAP1D5030 Pediatric pre-B 
hypothetical protein 
MusashI [Drasophlla) homokig 1 
STIP1 homology and imixconta 
ESTs 



412777 A1335773 



448877 
413045 
408054 
417852 
410445 
415870 
438723 
441307 
406575 
401488 



451277 
421190 
401215 
408117 
426357 
418630 
400389 
447128 
431297 
431857 



protein phosphatase 5, catalyt 
ESTs 



AL138255 

AI351311 

AL135841 

AI271898 

AA651771 

W19144 

AA158243 

AW952776 

AA532576 



Hs.3076 ESTs 

Hs.271742 ADP-ribosyltransferase(NAD;p 
Hs.227729 FK506-blnding protein 2 (13kD) 
Hs.94943 ESTs 

ESTs, Moderately similar to AL 



3S,TM,«a,FG-GAP,integrin 



gbmeeoaiul NCI_CGAP_Ew1 Hom 



I^GAP,C2,PH,BTK 

SS,tiypsln,kringle,lh2,EG 

Ss!zf-C2H2 



CPSase_L_chain,blotinJip 

SS,SS.Cullin,Cullin 

Gal-bindjectin 

TM 

bZIP_Maf,SS,P5CREF1BD 



BE263928 Hs.323806 gb:601191272F1 NlltMGC_7Homo 

AI583696 Hs.253313 ESTs 

X92121 Hs.75180 

AW816490 Hs.8102 



AA199830 gb:zq75h01.r1 St 

HI 5578 Hs.21017 ESTs 
M3'1429 gb:Human PVT-IGLC fusion piote 

AW071696 Hs.209055 hypolhetfcal protein FU14225 

TargS Exon 

Target Exon 
AA814338 Hs.292297 ESTs 



Hs.89751 membrane-spanning 4-domains, s 
Hs.109731 gb:MRO-HT0407-180100-004-h05H 

gb:C15044 Clonlech human aorta 
Hs.26176 hypothetical protein FU10261 
Hs.102482 mucin 5, subtype B, Irachecbro 

C12000457*:gi|7512178|pir||T30 

ESTs, Weakly similar to 138022 
Hs.12396 gb:RC3-CT0283-271099^)21-a08C 
Hs.251946 poly(A)-binding protein, cytop 

olfactory receptor, family 2, 



SS,TM 

Metallophos,TPR 



SS.TM 

SS,planase,pk1nase.C,RFX_ 
Glyco_hydro_1 



TM,SS,TM,death,DED 
Cys_l(not,vwc 
tiyp5n,SS,TM 
SS,zl-C3HC4,BiR 



SS,Patatin,ank 

225 



wo 02/102235 



PCT/US02/19297 



430132 
459111 
405770 
415611 
453413 
424415 
426048 
435750 
439469 
445664 
418830 
452113 
449101 
437640 
400748 
442370 
442419 



431424 
442297 
457845 
446912 
451381 



433333 
450251 
408511 
414348 
456960 
412173 
404001 
446263 
441583 



457125 
430600 
421707 
436127 
414347 



411602 
433357 
404311 



hypothetical protein FU20647 



NM 001975 Hs.146580 
AI768853 Hs.134478 
AB029012 Hs.4990 



NM_0O2362:Homo sapiens raelanom 
gb:AB123C11Rlnfenlbran,LLN 
gb:AJ003294 Selected diranioscnn 



KIAA1089 protein 
gb:zd48a02r1 Soaresjetaljiea 
ESTs,Wbal<ly similar to KIAA06 



SS,SS,TM,lg 

EMP70 

lUAGE 

SS,TM,rnii,suslii 
SS,Fblate_canier 
enolase,SS^tropliin-Utr 
TM 



BE513731 Hs,88959 liypotlietical protein MGC4816 

gb:wiii11a02j(1NCI_CGAP_Ut4Hom 

to 138022 



AI143S93 Hs.129419 

AI749893 Hs.270532 

AW750272 Hs.128608 
Z1116B 

AI222969 

NM_006202 Hs.89901 

H93040 Hs.297729 

AI347650 Hs.128521 

BE241831 Hs.172330 

AWB86484 Hs.343522 

NM 013272 Hs.14805 

AW136372 Hs.1852 

NM 004605 Hs.94581 

AA424313 Hs.98402 

AI016521 HS.7181G 
BE080483 

AW206404 Hs.27268 

AF041430 Hs.75922 

AF111170 Hs.306165 
T71071 



gb:H.sapiens 5HT1 A receptor re 



ESTs, Moderately sir 
hypothetical protein MGC270S 
ATPase,Catp " ' 
solute canter far 



Ca|^poithg,pla 
Iter family 21 [orga 



AL1382C0 

AA323968 Hs.283631 

AK001237 H5.319088 
AA216672 

BE018635 Hs.58582 
BE066529 

AW444451 Hs.134812 

AW950967 Hs.274348 

NM_014921 Hs.107054 



AW564971 Hs.259646 

AW161050 Hs.169611 

AI815474 Hs,343866 

AW014231 Hs,90790 

AW979197 Hs.292073 
AA776638 

W28661 Hs.5288 

L01406 Hs.767 



ESTs 
brain protein 13 

Homo sapiens 14q32Ja9ged2gen 
gb:yc50b05.r1Slratagene liver 
Target Exon 
KIAA1S60 protein 
ESTs, Weakly similar to ALUAJI 
DKFZP434I1735 protein 
gb:MR3-ST0220-151299-027-b10 S 
gb:DKFZp547N052_r1 547(synony 
hypothetical protein DKFZp547G 
hypothetical protein FU10375 
gb:zr96e09,s1 NCLCGAP_GCB1 Ho 
Homo sapiens cDNA FU12789fis 
gb:RC3-BT0333-300300-017-a12 B 
ESTs 

HtA-B associated transcript-3 
leotomedin-2 

RIKEN cOM 2010100012 gene 
gb;601 121639F1 NlltMGCJO Homo 

ESTs 

second mitochondria-dedved ac 
gb:au47f10.y1 Schneider fetal 
Homo sapiens cDNA: FU22930 fi 
ENSP00000247423':[)^'iglec prec 
ESTs, Weakly similar to ALU7 H 
gb:ae78g04j1 Stratagene scliiz 
Homo sapiens mRNA; cDNA DKFZp4 
growtli tiormone releasing hormo 
gb:EST03S28 Fetal brain, Strat 
Target Exon 



TmCDP-OtLPJransf 
actin 

7tm.1,SS,TM 

SS,Peptidase_li«10,hemape!d 

Adaplln_N,AlpliajdaptlnC2 
SS,TM,ISK Cliannel 
SS.TM 



SS,Ribosonial_t^8e 

E1-EZJ\TPase,Hydrolase,Ca 

l<azal,OATP_N,OATP_C 

SS,TU.addj)liosphat 

Sul«)trans<er3S,DA6Kc 

HECT 

homeobox,pidnase,PH,pl<ina 

ss 



SS.TM,efhand,efhand 
SS,TM/iTP-synLC,Galactos 



ubiquitin,SS,THG-patcti,a 
Utropliilin,OLF,7tmJ.Gai 
CDrana_7,SS,TM 



430281 
430490 
450122 
460801 
413413 
445631 
419390 
423139 
426221 
443785 



N58172 Hs.109370 ESTs 

R34993 Hs.226666 ESTs, Moderately similar to 15 

AI633176 Hs.283865 ESTs 

AA724771 Hs.61425 ESTs 

AI878842 Hs.237924 CGI-69 protein 

AW902951 Hs.301723 Homo sapiens cDNA FU12974 lis 

BE313765 Hs.343443 

AI739013 Hs.203348 



AK001822 

AI701162 Hs.90207 

AW402725 Hs.288560 
AB007881 

AW449952 Hs.190125 

BE250127 Hs.82906 

AW204515 Hs.156113 

NW.016558 Hs.274411 

AF008216 HS.28S013 



zincfinger protein 36 [KOX 18 
Homo sapiens cDIMA FU10960 fis 
hypothetical protein MGC1 1138 
hypothetical protein FU21 106 

basic-helix-loop-helix-PAS pro 
CDC20 (cell division cyde 20, 
ESTs, Wealily similar teG0102S 
SCAN domain-containing 1 
putative human HLAdass II as 



SS,PHD,proteasoine 

SS,F5_F8Jype_Cpl(inase,E 

SS,Cl?AL_TRIO,PKi 



SS,TmY_phosptiatase,LON,A 

SS,TM,Hint,HH_signal 

SS,nni,[XIF185 

SS,TM,PMP22_aaudin.PMP22 

P13_R4_i*iase,FATC,SS,TM 

WD40.SS,Tmi.fn3,EGF,fn3,lg 

SCAN 

226 



wo U2/102235 



PCT/US02/19297 



433943 
414274 
431328 
451481 
430344 
419516 
413564 
415958 
401402 
456145 
431536 



432328 
421871 
415514 



457331 
438705 
428624 



408015 
413041 
452349 
434357 
455274 
453904 
424624 
426576 
440382 
419125 



451117 
409547 
412673 
426440 
449225 



447677 
415473 
4C8422 
442780 
461558 



404710 
447827 
448387 
419541 
449686 



AKOOOSig Hb. _ _ 

AA961613 Hs.127838 

AF152099 Hs.278911 

M992805 Hs.44865 

AW300961 Hs.334684 

AA502999 Hs.291591 

AA300228 Hs.295866 

AA476827 Hs.1 71012 
H82550 
BE26012C 
HI 0942 

BE299427 Hs.21446 

AL133C66 Hs.341906 

U9073 Hs.198726 

AI243285 Hs.1 18348 

AI912702 Hs.139135 

AW845318 Hs.12271 

AA19994C Hs.124039 

BE1 44384 

AI348856 Hs.21627 

AI769392 Hs.200215 



AA218628 Hs.202977 
AI286100 

AL137506 Hs.274256 

AI572739 Hs.1 95471 

AK001416 Hs.306122 

F11301 . Hs.1 38329 

AI370379 Hs,132216 

AB007867 Hs.278311 

AV647405 Hs.18443 

AI049624 Hs.283390 

AI125222 Hs.98712 

AI074951 Hs.319095 

AW379130 Hs.18953 

AW136771 ■Hs.244349 

BE061580 Hs.61622 

AF044924 Hs.30792 

AW732284 Hs.3828 
BE151622 
AW003821 

AB032947 Hs.151301 
AA381720 

AW362152 Hs.27181 

AA642452 Hs.130881 

T87615 Hs.14716 

H55776 Hs.207915 

X89887 Hs.172350 
RQ618S 

W4413 Hs.15251 

AA015752 Hs.205173 



.105606 hypothellcal protein FU20S12 



hypothetical protein □KFZp434N 
hypothetical protein FLJ2234g 
ATP-bindIng cassette, sul)-fanil 
gb:601146990F1 NIH_M6CJ9 Hof 
gb:yin06c11.r1 Soares Infant br 
Target Exon 
KIAA1 716 protein 
ESTs 



Target Exon 

gb;MR0-HT0166-191199-004-c11 H 
gb;tb05a05.x2 Na_CGAP_Lu26 Ho 



hypothetical p[0telnFU23563 



Target Exon 

aldehyde dehydrogenase 8 {anill 
ESTs. Weakly similar to 210926 
hypothellcal protein DKFZp434H 

ESTs 



8S,Vps26A»l-CoAjdh 



SS,TM.ABC_tranABC_raernbrB 
SS,TM 

SS,DIX,PDZ,DEP.Dishevelle 
TM,SAM_decarlx)X,SS.pkinas 
CSO,honi6obox,SS,TM,7tni_2, 
TM,ECH,chronn 

SS,SS,TM,HSFJ5NA-blnd 



SS.TM.Ribosonial S25,sugar 
pkinase,SS,TM,P2Xj-ecepto 
SS,tRNA^ynt_2b,HGTP_anti 

SS.TM.GNSUUR4,SS,TM.Rho 

PGAM,6PF2K 

TM,Steroid_dh.SS 

SS,TM 

SS,TM 

Sema,PSI,TIG.SS,TM,TIG.Se 
GTP.EFTU 

SS,E2F TDP,E2F_TDP 
SS,TM^as,MSP domain 
SS,DPPIV_N_temi 



epidermal difterentiatr'onconip 
gb;MRO-BT0249-091299-201-c07 B 
hook2 protein 

mevalonate (diphospho) decarbo 
gb:PM0-HT0302-271099^)01-a08 H 
ESTs 

Ca2'>dependent activator protei 
gb:EST94853 Activated T-cells 
nuclear receptor binding facto 
B-cell CLUymptioma 11A (zinc 
ESTs 

C7000900:gl|4508027|ref|NP_003 



AW409885 Hs.335877 Homo sapiens, clone MGC:4558, 



AL042957 Hs.31845 

BE332756 Hs.169902 

R39108 Hs.6777 

BE244638 Hs.1 66 
BE011183 

AW971345 Hs.292715 

AI419235 Hs.344456 

R39986 Hs.1 2778 

AW977031 Hs.1 43554 
AI017521 

NM_0C1C89 Hs.26530 

AW452791 Hs.249625 

NM_014326 Hs.1 29208 

AI539821 Hs.293799 

AW195237 Hs.7734 

AA744597 Hs.BB854 



solute carter family 2 (fadl 
ESTs 

Target Exon 

sterol regulatory element bind 

gb;PM3-BN0218-100500-003^l09B 

ESTs 

gb:tf21d02.x1 NCI CGAP Bm23H 
ESTs 

ESTs, Highly similartoB45036 
ESTs 

ATP-binding cassette, sub-fami 



C9CC1584:gi|7499208|pir|[T2099 



U73727 Hs.1 971 3 

AI8744Q2 Hs.292590 

AW749617 HS.2B0776 

AW072813 Hs.270868 

AA854219 HS.34B137 

AI769831 Hs.337054 

BE258332 Hs.278362 
AW452105 ESTs 

T09471 Hs.250820 hypothetkal protein FU14827 



tankyrase, Tl^l-lnferacting an 
ESTs,ModeialelyslmilartoAL 
Homo sapiens, clone IMAGE3542 



bZIP,SS>\hpC-TSA 
GHMP_kinases,SS,TM 
SS,TM,RNAjiol L 



sugar_tr,SS,TM,sugarJr 
SS,TM,NaLSUiph_symp 
Ephrin 



SS,TM,7tm.1 

ABCJran,SRP54,SS,TM,ECH 
SS,TM 



SS,ctystdI 
SS 

SS,TM/AA,RibosomaLL2 



wo 02/102235 



PCT/US02/19297 



AW162225 Hs.165909 
AW832917 

A1827721 Hs.284298 

BE018302 HS.2B94 



ESTs, Weakly similar to 138022 
gb;QV2-TT0003-161199Jm06T 
Homo sapiens mRNA full length 
placental growth faotor.va 



T70740 

AA969333 Hs.1 60098 ESTs 



451677 
432656 
417739 

446847 



421542 

453704 
402285 



404807 
436483 
425316 
425565 
413341 



417614 
441368 
401722 
40890S 



411426 
428186 
438470 
427789 
430230 
434314 



Hs.6185 
AW270149 
AW809679 

AB020590 Hs.7782 

BE392896 Hs.129126 

N26362 Hs.1 1615 

NM_004729 Hs.9933 

A1702901 Hs.145582 

AW816214 Hs.143055 

AW373583 Hs.221994 

AA059222 HsJ3538 

NH_000246 Hs.3076 
Z43995 

L29472 Hs.1802 

T51454 Hs.82845 
AI798701 
AI086436 

T99264 Hs.191117 

AF281134 Hs.283741 

AA668764 Hs301637 

NM_002204 Hsi65829 



to 66 



KIAA1557prc 
ESTs, Moderi 
gb:MR4-ST0124-261D99^15-f05 S 
paraneoplastic antigen MA2 
Homo sapiens, clone MGCI 0392. 
map Itinase phosphatase-like pr 
Ac-lilte Iransposable element 
ESTs, Weakly similar to F0R4M 
ESTs 



gb:HSC1QB121 normalized infant 
major histocompatblity compi 
Homo sapiens cDNA: FU21930fi 
ESTs 

gb:ow84g06.s1Soates_fi5laUiv 
ESTs 

exosome component Rrp46 
ESTs 

Integrin, alpha 3 (antigen CD4 



H23333 Hsi9002 KIAA1706 protein 

AW362198 Hs.12503 InteHeukin 15 receptor, alpha 

AA411607 Hs.118964 ESTs, Weakly similar to KIAA11 

AA719572 H3.274441 Homo sapiens mRNA; cDNA DKF2p4 



DSPc,Rhodanese,SS,TM 



SS,ArfGap,vwa,TSPN,fh3,CD 
TM,ig,MHCJI_beta,SS,TM,A 
SS,TM,BNR,fh3,ldLrecepL 



RNaseJ'H.IWase.PK-C.SS.TG 
SS,TM 

integrlnJ\,FG-GAP,Rhabd_g 
SS,TM,Dlhyd[DoiDlase 



AL1 37531 
AA203473 
BE048021 
AW291446 



Hs.283010 
Hs.99010 
Hs.173084 
Hs.igi325 



H78472 

AL1 10255 

AW015227 

AL041152 

W28548 

AA203445 

AW173212 



AV655783 Hs.661 



Hs.285050 ESTs 
Hs.325819 ESTs 
ESTs 



454151 
431627 
422379 
426765 
433325 



Target CAT 

hypothetical pn)telnFU12614 
gb:lL3-CT0220-1S020l)4)6&A11 C 
small GTP-binding protein 



9b:ag90e09.r1StfatagenehNTn 

Hs.334a00 hypothetical protein FU2Cg74 
gb:Homo sapiens fiill length In 
gb:QVO-HT0101-061099-032-c04 H 

Hs.295605 mannosidase, alpha, class 2A, 

Hs.227823 plW5 protein 

Hs.48642 hypothetical protein FLJ23093 

Hs.236361 seb4D 

Hs.3797 RAB2S, member f^S oncogene fern 
Hs.1 84877 solute earner family 25 (mito 
Hs.1 86476 ESTs 

Target Exon 
Hs.1 54088 hypothetical protein FU22756 



AA932860 Hs.133864 ESTs 

AA743603 Hs.172108 nucleoporin 88kD 

AW20B986 Hs.143905 ESTs 

KiAA1033piDtein 

AI421251 Hs.114085 Homo sapiens mRNA tor KIAA1 755 

AA777381 Hs.291530 ESTs, Weakly similar to ALUC_H 

AA830144 Hs.135613 ESTs, Moderately similar to 13 

AI287878 gbx|v23f06j(1 NCLCGAP.LymS Ho 

AA416856 Hs.98170 ESTs 



AV659151 
AW006831 
AA706639 

AF075083 
BE141714 
AW504300 
AW936329 
AA412428 
BE257724 
BE392921 
AW410063 



AA047169 



Target Exon 

vanilloid receptor subtype 1 
ESTs, Moderately similar to T1 
gb:EST68511 FetallungllHomo 
ESTs, Weakly similartoT18967 
Target Exon 

Homo sapiens mRNA; cDNA DKFZpS 
hypothetical protein FU14733 

ESTs 



SS,TM 

SS,TM,ras 
TU 

UPF0027 

SS,TM.ank,kjn_trans,SS,TM 
SS,pkinase,ig 

SS.asp,PGAM 

SS,TM 

TM 

TM,ionJlrans,K_tetta,Rv2c 

TM,PLAT.SS 

Vinculln 
SS,TM,TPR 

TM,synaptal>revln 
SS,TM,Hint,HHj5lgnal,tubu 

fiiamentfilament 

GlycoJiydro_38,SSTM,Pept 
SS,TM 

rmi,SS,2.Hacld_DH.WD40 
ras,arf,SS 

mlto_carr,SS,TM,ptofilln, 
SS,Sulfatase 

SS,TM,Glyoos_transf_4 



SS.TM.tubulln,E6F,F5j:8J 
SSJtansglutsmln.CTransg 
SS,SS,SNF2.N,helk:ase_C 

KRdomaln 

SS.TM,7tm-1,SS,TM 

SS.TM,DUF60,ttypsln,CUB,u 



wo 02/102235 



PCT/US02/19297 



421779 
<H»270 
418437 



400171 
431461 
402197 
449514 
442472 
409679 
439150 
412934 



AI879159 Hs.108219 wingless-type MMTV integration 

AW177805 gb:il^HT0059-180899^)07-B06H 

AA771738 HsJ48000 ESTs, Moderately simlar to AL 

BE083422 Hs^BSI iiypotlietical protein MGC2668 

AI743177 ESTs 

NM_001327 Hs.167379 cancerflestis anUgen (l^-ESO- 

BE397032 Hs.14468 hypotlietical protein M6C14226 

AA054477 Hs.25391 ESTs 



4C1677 
405637 
450437 
408215 



411456 
446634 
453740 
426318 
416470 



401724 
438670 
414757 



416732 
404671 
433675 
426358 
456767 
412915 
443553 



BE299571 Hs.256310 likely orthdog of mouse ZFP28 
Tatgel Exon 

AW970440 Hs.23642 protein predicted by clone 236 

AW806859 gb:MR0-ST0020-081 1 99-004<;03 S 

BE250521 ras homolog gene family, membe 

AF086006 gb:Homo sapiens full lengti in 

BE01 1437 gb:CM4-BN0220-080500-170-f03 B 

AL1 19470 ESTs 



SS,TM 
SS,TM 

SS,TM,zf-C2H2 



3S.THATP1G1_PLM MATS.ig, 

SS.PX,arf,lipocalm,PHD,z 

SS,TM,lnos-1-P_synth,Occl 



R17258 Hs.343557 
AW449920 Hs.248855 
BE281057 Hs.184519 



U15177 HS.2C6984 

X91221 Hs.144465 

AA2S8124 Hs^7B 

T5S571 HS.1C041 

AI022287 Hs.1 11991 

AA974434 Hs.1283S3 

BE221533 Hs.257B58 

R59946 Hs.18 

D16918 Hs.12 



X13956 Hs.24998 



AWig4601 Hs.13219 ESTs 



AK000288 Hs.18800 

AJ271378 Hs.333243 

BE547267 Hs.59791 
AW847588 

AI624849 Hs.344612 

AL120295 Hs.311809 

AA376125 Hs.147112 

N90464 Hs^23 

AL162042 Hs,272348 

AB030816 HSJ6761 

AW021671 Hs.293330 

NlilL004278 Hs.27008 

AI275803 HS.12342B 

U46922 Hs.77252 

AW29S349 Hs.8038 

AW297844 Hs.101428 

H81066 Hs.285017 

AW977653 Hs.75319 
AA376438 

AI086412 Hs.12g064 

AW087727 Hs.74823 

AL040535 Hs.9573 
Z42737 



ESTs 

hypolhetloal protein FU12949 
enoyi Coenzyme A hydratase, sh 
selenoprotein X, 1 
PDZ-LiM protein mystique 
gb:HomasapLensDGS-e partial 
gb:ny51eiasf NCLCGAP_Pn8 Ho 
HumancosiTitdCRI-JC201SatD10 
ESTs 

ESTs, Moderately similar to ZN 
ESTs 

ESTs, Weakly similar te T33900 

ESTs 

ESTs 

KIAA1553 protein 
Homo sapiens cDNA: FU23388 fi 
BAM-assodated protein 3 
Target Exon 

hypotlietkal protein M6C10471 

syntaxin 10 



ESTs 

hypothetical protein MGC13183 
gb:IL3-CT021 3-161 299-038-G09 C 
ESTs, Weakly similar to NEL1_H 
ESTs, Moderately similar to PC 
Homo sapiens cDNA:FU22322li 
beta tubulin l.dass VI 
Homo saptens mRNA; cDIM DKFZp7 



ESTs,Weaklyslirilartop40[H 
phosphatldyllnasltolglycan,c 
C16(W1374:gl|6755086|rel|NP 03 



hypothetical protein FU21799 
NM_015902':Homo sapiens proges 
ribonucleotkle reductase M2 po 
gb:EST88856HSC172cellsllHo 



AW799109 Hs.226755 
AW327515 

AW972553 Hs.293691 

AA992835 Hs.186776 



NM_004541:Homo sapiens NADH de 
ATP-blnding cassette, sub-faml 
gb:HSC0SE081 nomialized infant 
CI 60014ir;gi|7500345|plr)|T21 
ESTs, weakly similar ID RHOPM 
gb:MR2-Cra222-01 1 199-007-elO C 
cillaiyi 



ESTs 
ESTs 

ESTs, Highly similar to CR2J<U 



407757 
452446 
450807 
432540 
449324 
426434 
407652 
443952 
448869 



AI821517 Hs.105866 ESTs 



M17755 Hs.2041 

W27953 Hs.292911 
AI149106 

AI792798 Hs.12496 



ESTs, Weakly similat to A47582 
thyroid peroxidase 
ESTs, Highly similarteS60712 
ESTs 

ESTs, Weakly similar to ALU4_H 



SS,TM,ank,Adap_oomp_sub 
ECH,Peplidase U7,SS,TU 
DUF25,SS,Ribosomal L3,PDZ 
LIM,SS,SH3,Sorb,Metalkip>i 



5S,TM,zf-C2H2,klnesin,Vau 



zlXCHC 

SS.TM 



HECT,zf-UBR1,PABP 

SS,TM,tiyp5in,kringle,UPA 
ABC_tran,SS 

FAI><Kldase_C,FAD_binding 



gb;wi17b08.x1 NCI_CGAP_Co16 Ho 



SS,TM 

229 



wo 02/102235 



PCT/US02/19297 



422837 
407143 
442296 
407722 
427336 
447960 



427687 
407204 
454219 
430613 
435902 
442743 
454923 
440518 
448237 
428924 
412081 

421668 
423467 
417151 
408307 
404752 
453126 
413983 



410660 
427411 
437018 
427029 
452047 
432093 
453099 
441456 
414356 
434067 
436393 
409227 
448680 
439343 



457761 
435294 
445372 

424437 
401539 
417903 
442451 
450536 
425169 
435262 
444855 
433507 
432396 



U25441 Hs.121478 dopamine receptor D3 

C14076 HSJ32329 EST 

NM_007275 Hs^186 lung cancer candidate 

BE252241 Hs.38041 pyridoxal (pyildoxine, vitamin 

NM_0056S8 Hs.2134 TNFreceptor-assodated factor 

AW9S4377 Hs.26412 ring tnger protein 26 



X75C42 
AJ012008 
AA701867 



Hs.172035 tiypollietbal protein similar t 

HS.1C6890 Homo sapiens clone 23771 mRNA 

Hs.284239 ESTs. Weakly similar to 178885 

Hs.140237 ESTs,Weakly5imilartoALU1_H 

Hs.44313 v-rel avian reUculoendolliello 

Hs.241585 66C protein 

Hs.297725 ESTs 

Hs.302110 ESTs, Weakly similar to MUC2_H 

gb:CM0-NI^0O57-16040O-335-c06 N 

Hs.233235 ESTs 

Hs.309386 ESTs 

H3.98959 ESTs, We*ly similar to JC5314 



7tnLl,SS,TM.7tra_1 
SS,TM 

SS,TM,ayco_liydro_56.Glyc 
pfkB,SS 

MATH.SS.MATH>ayi_NA2M,NT 

SS,TM,CbLN,CbLN2.CbLN3 

SS,TM 



X84048 
AK000214 
AA194055 
AI761786 



hypothetical protein RJ2147B 
fi«c)uenin (Drcsophia) homolog 
hypothetical protein FLJ20207 



NM_024778:Horao sapiens hypothe 



AI571514 Hs.133022 ESTs 



Ai061118 

AA402242 

AA889078 

AA397596 

N35953 

H28383 

H62087 

Ai453911 

AW505085 

H18913 

AW022213 

AA806165 

AW245890 

AF0B6161 

AA421020 



Hs.1 27765 
Hs.335147 
Hs.1 24023 



ESTs, Moderately similar to S6 
heparan sulfate proteoglycan 2 
nucleolar protein 3 (apoptosis 
Homo sapiens cDNAFLJ20812fis 



ESTs,W8al(lysimilarloBOXB 
gb:yl52c03j-1 Soanes breast 3N 
thyroid honnone receptor-assoc 



HI histone My, member X 
ESTs 

W45393 HS.5S888 acl'vating transcription Mo 



i.208gi9 
Hs.109804 
Hs.215725 



Hs.1 26465 ESTs 

Hs.4779 KIAA1 150 protein 

Hs.196008 HomosapienscDNAFU11723fc 

Hs.1 44928 ESTs 

Hs.7236 eNOS interacBng protein 

Hs.147049 cut (Drosophila)-llke 1 (CCAAT 



T84084 
N36417 
AF132969 
BE244700 



NM 002342 Hs.1116 lymphotonn beta receptor (TNF 

AI498080 Hs.129616 ESTs 

AI699529 gblt17a02.x1 NCLCGAP_GC6 Horn 

AW292500 Hs.l 28614 ESTs 

AA677088 ESTs 

BE409261 Hs.12084 Tu translation elongatton fact 

AI817336 Hs.191791 ESTs 

AW296956 Hs.1 1900 liypothetical protein FU14972 

AA017514 Hs.6211 methyl-CpG binding domain prot 



TM.Ribosoraal_S7 
SS,TM,lectin_c 
SS.TM.SQSJ'SY.GATA 
SS,TmGlyoos_transf_4 

SS,TM,GDPD,SS.TM,SH3,PDZ, 



SS,TM.aycos_transfJ 



SS,TM,ERG4_ERG24 

SS.ras 

SS 

Band_41,ERM 
SS,TM 

GalactosyLT_2 

WD40,SS.TM,KOW,HLH 

SS,TM,trypsln 



SS,UM,SS 
HMG_box 
SS,PID,PDZ 

SS,TM,MAGE,Ribosomal_S17. 
CUT,ticmeobox,beta-Iactama 
zf-B_box,zf-C3HC4,SS 
TNFR.c6,SS 



NM_014835 Hs.15519 



GTP_EFTU,GTP_EFTU_D3,GTP_ 

pkinase 
SS 

MBD,zf-CXXC,SS,zf-CXXC 



gb:zs41b04.s1 Soares_NhHMPu_S1 TM 



TABLE 20B: 

Pkey: Unique Eos probesetldenSSer number 
CAT number Gene cluster number 



230 



wo 02/102235 



PCT/US02/19297 



CAT Number 
10478_T 



410445 
410471 
410895 



411426 
4114S6 
411490 
411605 
411758 
412091 
412122 
412128 
412173 
412425 



412777 
412934 
413272 

413534 
413564 
414347 
414371 



414944 
415126 
415611 



421879 
423790 
424874 



426576 
427029 
427326 



429398 
430168 
431082 
431161 
431424 
431627 
432093 



115564J 
120374 2 
120472TJ 
1226051 1 



1247426.1 
125123J 
1256751 1 
1276564J 
127838J 
1278726.1 
1280870.1 
1293726.1 
129707_1 



AU38255 BE380045 AAC47314 D82381 T18585 H6497B T10798 

BE61 4290 AA307574 N35629 AA338538 AI193603 AA781096 AI680061 A1613258 AW276647 BE221263 AI348910 AI985031 AI090078 AI359617 
AA666391 Al 160210 AI446461 AI355345 AI343638 AI343640 AI275091 M78746 AW262795 AW250002 AA503756 A1934519 AW272086 N26520 

AA626639 

AW177805AW177895AW177816AW177901 BE141597BE141584AW177822AW177818AW177B99 
BE141732 U75823 BE141331 AW178416 AW178430 BE141343 BE141298 BE141702 BE141285 
S72921 NM_00061 4 X55889 X50542 X60477 
AW32751 5 AW327774 AW327571 
AA706639 AA064707 AL036920 AI651598 

BE250521 AA075837BE249870 AA984291 AW502442 AW501551 A1221491 AA194239 D63046AA193426 AA773243AA193293 
AW963293 AI856310AA077791 AA362540 



T88872AW749857 

AW809679AW809678AWB10113AW810182AW809900AVV8098S1AVV810110AW810228AVV810342AW810181AW809632AW80974S 
AW810372 AW809631 AW809792 AW809806 AW8104S2 AW809675 AW809964 AW810033 AW8101 1 1 AW809846 AW809847 AW809717 
AW83291 7 AW83291 3 AW832906 AW832788 AW83291 5 AW832776 



AW83S848 AW835851 AWB35852 AW835862 AW835855 AW835825 AW835847 AW83S838 
BE141714 AW845993 AW845989 
AW847588AW847715AWB47664AW847592 
R39474AW848420 R76943 

AW006831 AA678298 R12S79 W861S2 AI123683 AA699780AI6721S6 BE092SB7AA094230 AI633815 AA5261S3 W861S1 

AW860667AW860665 ' 

R06185 AW891805 AW901892 AW9018gS 

AW852707 N57282 AA725075 AI703492 AW61 21 37 A1696372 A1879394 A1653605 W26914 

AW894709 N78140 

T71071 AW902279 AW897608 

AA215731 N48523 AA307559 AA130794 BE296746 BE378386 AA190411 A1904194 AA31 1805 A1356291 AA446714 A1818924 AI609152 
AW771476 BE2981 84 AA295023 AA130708 A1078381 AA1 14156 A1198283 AA215665 AI201085 A1694848 AI077572 AA102778 AW016425 

AI923123AA577072AI671 
AI335773AI288496AA120880 

BE01 1437 BE011402 BE011395 BE011428 BE011407 BE01 1421 BE011406 

AA127923 AA127846 AA534131 NS3566AA533669 AW5112S1 AI174441 AA12787S AI68S293 AA127913 N72525 AW770386 N6g010AW070312 
H80275H80289AA972923 

BE146961 BE14S780 BE146788 BE146967 BE146774 BE146963 BE146907 



BE27S83S BE390063 BE388764 BE409101 

A1905865BE294801 BE562308BE2g7g57AW157051Al815883AW1 62529 BE439610AW1 57225 AW157210AW162675AW161998AI816168 

AW162599AI816004AI815820AW1 62158 AW162339 M17733AW157639AI879416 BE258811 AW1 57436 AW1 62433 AW161633AW162155 

AW157410AW157269AW162 

BE409872BE281460 

BE294877BE294759 

BE386764BE387560 

C15044 080943015696 



T26376F12852T75058 
Z42737 ™986 H07956 
H10942 Z42911 R60453 
AA176633 AW961842 AA309418 
H66891R9814gH68467 
Z43995R12357R34740 

AA367375 AA486701 BE152479 BE152800 AW816961 AA214097 
AA215702 AA368C06 AA215703 BE066555 BE006876 
AA235838BE1 80775 

H82550 N43802 AA243820 AL040752 N24315 U66692 

AI218431 AA432232AW183040X86012AA868831AI191788AA912999A1204297A1205744A1218259AA428596AA993742AA703660AI018669 

AA879431 



BE152393AA330984BE073904 
AA347951 AI688463AA883123 

AB007881 U32581 AW131202AW99S994 W31964 N24261 A1033045 H98694AW364848A1222031 AA907216 A1215730AA776981 AW473826 

AA376433 AA376324 AWg63848 AW834782 
AA381720 AA382040 AW963564 
AA397596 All 93827 AA435832 
AI287878AI804160AA400787 
AA402242 AAB13659 AI150316 AA412054 
AW879141 AA421 182 AI734104 AI733923 AA43060O 
AI743177 AA425743 AI804283 A1743189 

BE514362 AI879343 BE272870 BE616390 AW163444 AW161588 AW378754 AW23B803 BE267205 BE047746 BE207213 BE312782 BE266301 
BE266413 BE278348 BE2B0885 BE278833 BE281417 BE4077B6 BE378176 BE392818 AW377597 BE3g5951 BE393978 AW327483 BE3941 75 



AA452239 AI262173Aig25886AI469041 H96628AI768463 AI671422 AI91S624AA7668gi AA521087 AA814103 AWg931S1 AW005927 
AW968343 AA468507 AI478223 AW513008 AI762122 AI554512 AA862642 AA468976 
AA491600 AA491 645 AI920986 

AA493591 AA829120 AA533792 
A1222969 AA806560 AA504839 AA805261 

AW609720 AW609735 AA082757 N88831 R23418 N55837 BE549484 AW816584 AW816947 AW816897 
H28383 AW972670 H28359 AA525808 
AIJ043683 AA570698 AA907496 AL043682 AW362288 
AI085436AA579438AA579002 



36402.1 T05639AF024702 



wo 02/102235 



PCT/US02/19297 



437018 
437050 
437215 
437662 
437834 



439469 
440317 
440546 
441358 



442472 
442780 
442893 
443952 



404485_1 

41894_T 

41903J 
431333_1 
432210 1 
43473_T 
440374 1 
443674 1 



46802J 
46919J 
47274J 



543371J 
551405J 
563987_1 



AW196608 AA884617 M758108 AI126321 AA629291 AW196549 AI208031 

AI363410 AI356019 H00141 T78748AL049365Alj079911 AI750972Z42602AW452523AI223826 AA215407 AI633829AA292122 N42783 

AW505596 AF086096 N90340 N83271 AA131836 AW607273 AA527132T32316 AA421951 T34951 AW966080 M78807 N31947 AA521151 

AA278866AA044784AA700 

AA812046 AW974514 AA764999 AA649302 

AL1 19470 AA669492 AI628351 AI2S3835 AL119498 

AA677088 AI022246 AA677107 

AI3583C0 AI762981 AA678073 AA988621 

AI798701AW008826AA704731 

AI811706AW297940AJ227887AA875850AA228803AI610234AI921618AI7686C6N37039AA081104BE172693 D56503 Z28585 T95651 

AA292389AA293502 N28751 

AW022213 AI274032 AJ227898 AI16C412 AI0B4451 

AA889078 AA907263 AA742199 

AA766420 AA743319 AW976442 

AL1 17488 AL044479 

AA765387 AA832241 AI222134 AI216405AI685043 

AA769294 AW749299 AW749302 AW749295 AW749304 AW749293 AW74929B AW749294 AW749288 AW749291 AW749297 AW749292 

AW749296 AW749289 AW749287 BE535498 
AW753311 AW663081 AA773411 
M34429 M34431 M34432 M25802 AW938720 
AF075083 H52291 H52528 
AF086005 H64722 H65212 H66282 



445625 64558J 



445837 
446780 
446790 



447904 
448330 
448993 



450536 
450807 
451045 



454682 
454718 
454756 



741913J 
758690J 
79225J 



BE561888BES6061SBE562102 

AI491994AW139809AA8Ba258AI700895 

AW173212 AA983948 AI080705 AA931334 

AW51 4263 A1567908 AI299828 AI299043 N51706 AA936483 

AW197794 AW195867 AW197787 AA96B466 

R52S99T65201 F11984 F131B6 AA977679 T77028 H12167 

A1792199AI733491AA991378 

AW806859 AW806852 AF049582 

A1017521 AI017613AW511133 

H78133H90849AI023482 

A1149106AI500318AI334156AI093029A1765679AI769652A1167308AI128885 

AI147237AI800517 
BE163341 AI207755 BE171477 

BE246743 AA436942 AW024744 AW2421 77 AA975476 AW3851 85 F»7536 R73462 AV654529 T57442 A1399986 R5C073 R48743 A1769689 
AI863005 AA317806 AI678000 AW1 89963 AI986207 AW471273 R73463 AI335104 AI590161 AI469257 AI954604 H21954T25141 AA856793 

R50074AI708253AI2 

AK001822 AW860325 AA335296 AW965531 AW130957 AW193951 AI347975 AW081323 AW662527 AI343924 A1380749 AA938153 T66966 
AI655000 AW41 8837 AI380485 AA41 0698 AI520726 BE501355 AI637925 AW7792C0 AI524755 AW593995 AI336927 AI336928 A1357036 R60592 

H19058R11124T1 

AI261700AI793196AI469160AI793007 
R31107 AI341 136 AI653198 H04953 
AW452105AI341280AI917445 

AW392394 AW579531 AW382131 AAD10316 BE146145 AW579562 AW579577 BE146152 BE146040 BE145972 BE146099 AW003280 AAa68470 
BE146306 T85009 AI087294 BE146299 BE146319 BE146307 W44912 AI703134 AW026017 BE382873 AA903733 A1655933 BE551223 AA847664 

AW173582 AW57240 

AI271898 BE048502 AI452509 AI244810 X84721 AI858001 AI553937AA1 49853 H00719A1765259 AW973696 F257B7F35749AI56B815 

AW01 5380 AA554539 C00201 AAg6161 0 AW059537 R77127 
AW206303 AW207644 AI765705 
AL036449 AW016705 A1492482 

AI471630 BE54C637 BE265481 AW40771C BE513882 BES46739 AA053597 BE140503 BE218S14 AW9S6702 AI6S6234 AI636283 AI567265 
AW340858 BE207794 AA053085 R69173 AA292343 AA454908 AA293504 AI659741 A1927478 AA399460 A1760441 AA346416 BE047245 
AA730380AA394063AA454 



804806 1 AIS38706BESS0292R11026 
808345.1 A1652833A1695904AW888916 

829987_1 BE080483 BE080416 AI689298 
837848 1 AI599529 BE161564 BE077251 
847691 1 A1739262R2e418 

85673J AA215672 AI696628AA013335 H86334AA017006 

8835 1 AB032997 N74056BE467119A'A'237035A1141678AA934774AW978722AI761408 H09497AI934521AA716567 H62600AI479668 Z40632 
AA832081 Z44858 H0g496 BE396335 AW295901 BE465977 AI621269 BE465983 M79058 H62533 AA325444 

899664 J AI859393BE1 77742 
966269J AJ003294AJ003315AJ003293 
982731 1 AL138200T71830T71828 
986581.1 AWG03821 AW027475AW025661 

120132.1 AA224053 AA114150AA214275AA224027 T58431 AA211908 AA669657AA199744AA630511 AA164a64T58463AA214394 AA161378 

AA161386AA205211AA167824AA084940AA223625AA191190AA309486AW961804 
1206827.1 AW752781 BE1 43749 AW752727AW752559AW52578AW7525B4R45742 

1225673 1 AW809272 AW809169 AW809179 AW809192 AW809166 AW809172 AW809191 AW809165 AW809197 AW809181 AWB09237 AW809226 
AW809250 AW809199 AW809259 AW809239 AW809273 AW809270 AW809147 AW8C9188 AW809245 
AW816029AW813292AW816156AW813333AW816159AW813302AW813344AW813172 

AW81 51 44 AW81 51 50 AW861 007 

AW819273AW819283AW819287AW8ig281 AW819274AW819282AW819277AW819286 
AW897236AW845405 

AW851734AW851676AW851593AW851713AW851722AW851616AW851731AW851618AW851648AW852215 
BE1S1622 BE1S1636 AW88S648 
BE144384 AW887474 AW887403 BE144386 
AW99S839AW99S907 

BE01 1183 BE011170 BE011333 BEC111BB BE01 1181 BEO1 1324 BE011161 BE01 1 169 



1228976.1 
1230532 1 
1233646.1 
1245024 1 
1249762.1 
1272212.1 
1273576.1 
1325974.1 
1337197.1 
1349914.1 
1364506.1 
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AF064804 AA320309 N89343 AA564588 AF069734 AA3492'I8 AW964366 T98641 AW51 1 100 T98542 AW070452 AAD131 72 AI767005 T321 40 
W05727T30969T30970 N74883AA903211 AI392796AI434622 AA829283 

AA744389AA744270AA744284AA744299AA745380AA744337AA845905AA847e98 - ■ 

AA776638BE439540 

AI2861 00 AA952934 AA918305 

AW27C149 AW66462B AI285912 



Pkey: Unique number corresponding to an Eos prabeset 
Ref: Sequence souree. The 7 digit numlieis in We column are GenbankldenBfier(GI)i 

liuman chromosome 22' Dunliam, et aL (19991 Nature 402:489495 
Strand: Indicates DNA strand from wtiidiexons were predicted 
Nt_po5itbn: Indicates nucleotide positions of predicted exons 



numbers. "Dunham I. et al." nefeis to the publication entitled The DMA sequence of 



Pkey 



Ref 



4C0500 9796136 

4C052B 6981824 

4CC668 8118719 

40C748 8119063 

400762 8131616 

400772 8131629 

400833 8705148 



400923 
401121 
401180 
401203 
401210 
401215 
401264 
401278 
401349 
401402 
401488 



401677 
401722 
401724 
401822 
401885 
401935 
401938 
401984 
402189 
402197 



403003 
403128 
403672 

403748 



404430 
404438 
404571 
404596 
404676 
404710 
404752 
404807 



9438648 
9743387 
7712287 
9858408 
9797154 
9799936 
9930791 
7710964 
7341775 
7534110 
8072433 
8099284 
7230963 

9965537 



6102666 
4454511 
8576043 



9796614 
9797862 
9838114 



Ntjosifion 



472381-472528,474170474277,475328475542,4758784760 

121756-122043 

84237-84398 

7235-7605 

34896-35021,4107841197 
187599-188138 



57211-57525 

172961-173056,173868-173928 

166969.167133,169760-169877,171563-171733 

103739-103919 

130810-130927,133367-133504 
98428-98573 
72440-73030 
75730-76077 



71055-71259 



191 57-139298.139440-139599,139960-140159 



70928-71185 

90925-91064,91172-91331 

8601-8876 

48547-48678,50604-50737,51384-51467 

174215-174380 

361474541-687 

79403-79560,79712-80021 

122884-123018.123134-123283,123372-123695123779-123940,124059-124256 

96600-96881,96951-97280,97393-97594 

129503-130344 



48636-48822 
137996-138317 
71040-71288 

121831-121951,124044-124150 

1669-2740 

149189-149303 

137948-138024,138111-138300 

50151-50319,50859-51098 

42921-43109 

63413-63553 

112450-112648 

104807-105043 

56167-56342,58066-58189,58891-59048,60452-60628 

45190-45339,4750947622,4813748264,4880548946,50073-50345.51467-51588 

120168-120326 

124246-124422 
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6692345 Minus 



7717155 Mnus 



65578-65778,68 
101982-102171 
22478-22632 
47657-47766,48461-48596 
11646-12060,12207-12485 



B8-63234 



153074-153343,154501-154598,156879-156999,158863-159051,159910-160053,161109-161229.183035-163131,16 

165259,165868-166003,167375-167552,169252-169364,171127-171281 

172005-172175 



9256130 
7711328 
7711579 



71716-72515 
74002-74199 

35655-36119 

57291-57494 

188922-189152 

174661-174978 

142034-142473 



Table 21 A lists about 933 genes up-tegulated in ovarian cancer compared to normal adul b'ssues thai ate liliely to encode extracellular or cell-«uiface proteins. Hiese were 
selected as for Table 20A, except Uiat the ratio of 'average' ovarian cancer to 'average' normal adult tissues was greater tlian or eqiel to 3.0, the 'average' ovarian cancer level 
was set to Itie 96th percentile value amongst various ovarian carver specimens, the 'average' normal adult tissue level was set to the 75th percentile value amongst variou non- 
malignant tissues, the 'average' ovarian cancer value was greater than or equal to 400 units [this selects for the most abundant of the upwegulated genes), and the predicted 
protein contained a structural domain that is Indicative of extracellular localization (e.g., Ig, fh3, e^, 7tm domains, signal sequences, transmembrane domains). The predicted 
protein domains are noted. 

TABLE 21A: 

Pkey: Unique Eos probeset Identiiier number 

ExAccn: Exemplar Accession number, Genlrank accession number 

UGID: Unigene number 

Titk UniOene gene title 

Pred. Protein Dcmj Predicted protein domain 

R1 ; Ratio of tumor to normal body tissue 



410418 
431773 
428758 
438424 
450461 
437897 
452554 
422310 
452849 
407722 
416819 
430397 
427725 
454017 
445434 



407117 
402916 
425760 
422098 
442232 



AF044924 
BE252241 
Ur7735 
Aig24533 
U66839 



AJ278466 
U51336 
BE439952 



AI357813 
AF077036 
AF091619 



UGID 

Hs.103253 

Hs.237924 

Hs.63325 

Hs.258557 

Hs.98502 

Hs.25895 

Hs.46736 

Hs.146170 

Hs.53006 

Hs.98370 

Hs.30792 

Hs.38041 

Hs.30205 

Hs.1 05607 

Hs.180533 

Hs.347130 

Hs.9265 

Hs.29344 

Hs.90535 

Hs.146688 

Hs.43621 

Hs.74451 

Hs.227729 

Hs.11669 

Hs.75510 

Hs.6453 

Hs.1 951 77 



Hs.1 59479 
Hs.111497 
Hs.337460 



gb:yv)03b12.s1 Scares melanocyt 

pertliprn 

CGI-69 protein 

transmembrane protease, serine 
pleckslrin homology-like domai 
CA126 antigen; mucin 16 
hypolhetical pralein FLJ14996 
hypothetical protein FLJ23476 
hypothetical protein FLJ22969 
ESTs, Weakly similar to ALU5_H 
cytochrome P460, subfamily IIS 
hook2 protein 

pyridoxai (pyridoxine, vitamin 
pim-2 oncogene 
bicartwnate transporter relate 
mitogen-activated protein kina 
hypothetbal protein RJ22709 
hypothetkal protein aJ20917 
hypothetteal protein, clone 24 



prostaglandin E synthase 
Homo sapiens. Similar to hypot 
calpain4,smallsubunrt(30K) 
FKSOe-binding protein 2 (13kD) 
Iamin!n,alpha5 
annexinAII 

inositol 1, 3, 4-triphosphateS 
phosphoryiase kinase, gamma 2 
glycogen synttiase kinase 3 alp 
serum amyloid A1 
gb:zo71cC7.s1 Stralagene pancr 



galactosamine (N-acetyl)-6-sul 
similar to mouse neuronal prot 
ESTs, Weakly similar to A47582 
DKFZP585G1722 protein 



S8,TlM,8S,TM,DDOST_48kD 

perilipin,SS 

mito_can,SS,TI^ 

SS,TM,!dLrecepLa,trypsl 

PH,SS.LIM,Troponin 

SS,TM 



SS,TM,myasin_head,RA,DAG 
SS,PWWP,Exonuclease,lipoc 
88,perilipln 
Sec1,SS,TIVI 
MAPEG,SS,TM.I*\PEG 
SS,8S,arf,ras,fn3,ras 
elhand,SS,CAP_GLY 
SS,FKBP,SS,PDGF.C2,PI-PLC 
lamlnln_EGF.laminin_G.EGF 
annexin.SS,annexin 
SS,oxidored_nitro,SS 
pklnase,SS,SNF2_N,helicas 
pkinase,SS,Ets 
SS,SAA_proteins,SE,SAA^pr 
SS 



HC03_cotransp,SS 



SS,TM,TGFb_propepllde,TGF 

SS.Tropomyosin 

WD40,SS 

fer4,SS,TM,Vj\TPase_sub_a 
SS,rnn,SS 

SS,TM,Y4)hosphatase 
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419167 


AI589535 


Hs.94875 


ESTs, Vfeakly sMar to A35363 




18.6 


406663 


U24683 




immunoglobufln heavy constant 




18.5 


429712 


AW245825 


Hs^11914 


ENSP00000233627*:NADH-ul)iqiiino 


oxidored_q6,88,TM,rnn 


18.5 ■ 


42S848 


BE242709 


Hs.1 59637 


valyt-tRNAsynttietase2 


GST_C,GST_N,Tfopomyosin,8 


18.4 


447151 


A1022813 


Hs.92679 


Homo sapiens done CDABP0014 m 


SS,TM,U?R,aminolran_1_2 


18.4 


413343 


BE392026 


Hs.334346 


hypothetical protein MGC1 3045 




18.2 


450029 


AW073380 


Hs.267963 


hypothetical protein FU10535 


SS,Pyridox_oxidase,zf-C2H 


17.7 


427721 


AI582843 


Hs.180455 


RAD23[S.cerevistae)honiolog 


ubiquitin,UBA,inlegrin_B, 


17.6 


443780 


NM 012068 


Hs.9754 


aclivalng transcription lacto 
8D6 antigen 


bZIP,NTP_transf_2,SS,TBC 


17.2 


421612 


AF161254 


Hs.106196 


Idl recepLa,SS,TI«l 






AW405635 


Hs.293687 


ESTs 


SS,PI-PLC-X,PH,PI-PLC-Y,C 


16.7 


406621 


X57809 


Hs.181125 


immunoglobulin lambda locus 






443496 


AJ 006973 


Hs.9482 


target of mybl (chicken) homol 


VHS,GAT,TI«l,Heme_oxygenase 




440104 


AA132838 


H3.239894 


hypothetical protein MGC2803 


SS,DS 




427640 


AF058293 


Hs.1 80015 


D-dopachrome tautomerase 


MIF.Iatej)rotein_li,68,GE 


16.2 


445625 


BE246743 




hypothetical protein FLJ22635 


SS,TM 


16.1 


427461 


AA531527 


Hs.332040 


hypothetical protein MGC1 3010 


88,TM,ACAT.LRR 


15.9 


423366 


Z80345 


Hs.127610 


acyl-Coenzyme A dehydrogenase, 


Acyl-CoA.dhA;yl-CoA_dh_M 


15.7 


409017 




Hs.272299 


hypolheUoal protein F?P4-622U 


SS,TM 


15.5 


428167 


AA770021 


Hs.16332 


ESTs 


88,ig,fh3 


15.5 


420029 


BE258876 


Hs.94446 


polyamlne-modulated factor 1 


aldo_keLred,SS,TM,gla 


15.5 


400460 






C11002253':gi|129091|sp|P23267 


8S,TM,SCAN,zf-C2H2,KRAB 


16.4 


407767 


W1539B 


HSJ8628 


hypothetical protein 


88if-CCCH 


15.4 


406918 


M88357 




gb:Homa sapiens DMA-binding pr 


zf.C2H2,SS 




435158 


AW663317 


Hs.65588 


.DAZ associated protein 1 




15.3 


407619 


AIJ050341 


Hs.37165 


collagen, type IX, alpha 2 


88,Coll8gen,88.Collagen 


15J 


421273 


AJ245416 


Hs.103106 


U6snRNA.asscclatedSm4ikepr 


Sm.SS,timsynL1,GST_C,G 
88,88.TM.ig 


15.1 


402365 






Target Exon 


14.9 


450503 


R35917 


Hs.301338 


hypothetical protein FU12587 


88 


14.8 


427602 


AI811866 


Hs.7133 


Homo sapiens, clone IMAGE:3161 


SS,TM,ABC_tran.Glyco_tran 


14.6 


432872 


AI908984 


Hs^79623 


selenoprotein X, 1 


DUF25,88,rebosoma!_L3.PDZ 


14.5 


439233 


AA831893 


Hs.292767 


hypothetical protein FU23109 


zf-C3HC4.TM,8ulfate_trans 


14.5 


416897 


M78146 


Hs.324700 


hypothetical protein MGC2663 


SS 


14.3 


447304 


Z98883 


Hs.18079 


phosphatidylinositol glycan, c 


88,Pept'dase_C2 


14.2 


431643 


AW969619 


Hs.259768 


adenylate cyclase 1 (brain) 


TM 


14.0 


447644 


AA401573 


Hs.288284 


hypothetical protein FLJ2237B 


88,TM 


14.0 


417595 






KIAA1598(ffo!ein 


88,TM,Glyco_hydro_31,Glyc 


13.8 


436127 






RIKENcDi^A 2010100012 gene 


Corona_7,SS,TM 


13.8 


412623 


R28898 


Hs.74170 


melallothionein IE (functional 


SS,TM,melalthio,DEAD,meta 


13.7 


448133 


AA723157 


Hs.73769 


folate receptor 1 (sdult) 




13.5 


453367 




Hs.7a673 


PKCI-lTelated HIT protean 




13.5 


431462 


AW583672 


Hs.256311 


granin-like neuroendocrine pep 


SS 


13.2 


408724 






ESTs, Weakly similar to T22914 


SS,pkinase,lubulin 


13.2 


423464 


NM 015240 


Hs!l28856 


CSR1 protein 


Collagen,SS 


13.1 


428539 


AW410063 


Hs.184877 


solute earner family 25 (mito 


mito can',SS,TM,profilin, 


13.0 


436C14 




Hs.283741 


exosome component Rrp46 


RNase_PH,RNase_PH_C,SS,TG 


12.9 


438857 


AI627912 


Hs.130783 


Forssman synthetase 


SS,R^RasGEF,RasGEFN 


12.8 


444410 


BE387360 


Hs.33719 


ESTs, Moderately simBar to 86 


SS 


12.8 


427527 


AI809057 


Hs.153261 


immunoglobulin heavy constant 


SS,TM.ig 


12.6 


430168 


AW968343 




DKFZP434I1735 protein 


SS,TM.efhand,efhand 


12.5 


437543 




Hs.7117 




SS,TM,ligj;han,ANF_recept 


12.4 


413711 


AW291765 


Hs.75486 


heat shock transciipticn facto 


NASS,E2F_TDP 


12.3 


422625 


AW504698 


Hs.155976 


cullin4B 


SS,SS,Cullin,Cullin 


12.2 


443136 


NM.001440 


Hs.9018 


exostoses (multiple)-like 3 


Exostosin,SS,TI^ 


12.1 


407143 


C14076 


Hs.332329 


EST 


8S,TM 


12.1 


424707 


BE061914 


Hs.10844 


Homo sapiens cDNAFU14476fs 


SS,SS,TM,Sema 


12.1 


425261 


Z22521 


Hs.155342 


protein kinase C, delta 


pkinase,DAG_PE-bind,pkina 


12.0 


427336 


NM_005658 


Hs.2134 


TNF receptor-associated factor 


l*\TH,SE,MATH,A2M_N,A2M,MT 


12.0 


421572 


AA531607 




hypothetical protein FLJ22678 


SS,TM,TGF-betayi8C 


12.0 


447946 


AI566164 


Hs.1 65327 


ESTs 


SS,PTN MK,7tmJ,DAGK0,DAG 


11.9 


425954 


AK000633 


Hs.1 54476 


hypoltietical protein FU20626 


SCAN,zf-C2H2,KRAB,88,KlRAB 


11.7 


427273 




Hs.1 07376 


hypothetical protein DKFZp434N 


SS,SSTM 


11.7 


427397 


AI929685 


Hs.1 77656 


calmodulin 1 (phosphorylaseki 


efhand,RrnaAD,SS,efband 


11.7 


424415 


NM_001975 


Hs.1 46580 


enolase 2, (gamma, neurond) 


enolase, SS,Alrophin-1 Air 


11.7 


417852 


AJ250562 


Hs.82749 


transmembrane 4 superfamily rae 


lransmembrane4,SS,TM 


11.6 


447451 


AI379925 


Hs.207525 


ESTs 


SS,pkinase,PH,pkinase_C 


11.5 


410397 


AF217517 


Hs.63042 


DKFZp564J157 protein 


SS,homeobox,UPF0160,DUF23 




430354 




Hs.239784 


human homolog of Drosophila Sc 


SS,TM,ig 


11.3 


419390 


AI701162 


Hs.90207 


hypothetical protein MGC1 1138 


SE,TM,PMP22_Claudin,PI^P22 




422682 


W05238 


Hs.94316 


ESTs, Weakly similar to T31613 


SS,TM,DEAD,helicase CLarn 






AL1 22083 


Hs.1 12645 


Homo sapiens mRNA; cDNA DKFZp4 


SS,TM 




450122 


BE313765 


Hs.343443 


ESTs, Weakly similar to 138022 


EE,TM,Y_phosphatase,LON,A 




453968 


AA347843 


Hs.62711 


High mobility group (nonhiston 


SS,HI»IG_box 




444744 


BE3g4732 


Hs.147562 


ESTs 


SS 


10.9 


423220 


BE394920 






WD40,TM,Activin_recp,pkln 




417116 


Z43916 


Hs7634 


hypothetical protein FU12287 


SS,TM.filament,IF_lall 


10^9 


406779 


AA412048 


HS279574 


CX3I-39 protein; cell death-reg 


SS,SS 


10.8 


450593 


AF129085 


Hsi5197 


8TIP1 homology and U-Boxconia 


TPR,88.TM,Rhombdd.lactam 


10.7 


406837 




Hs.156110 


immunoglobulin kappa constant 


SS 


10.7 


452434 


D30934 


Hs^S49 


C4ypelectin-likereceplor-1 


lectiiu:,SS.TM 


10.7 


440150 


AW97573B 


Hs.7001 


Homo sapiens, clone IMAGE:3940 


S8.TM.88.TM,Peptidase_M22 


10.6 


418641 


BE243136 


Hs.86947 


a disintegdn and metalloprote 


disintegnn,RepTDlysin,Pe 


' 10.6 


414313 


NKt004371 


Hs.75887 


coatomer protein complex, subu 


WD40,SS,WD40,Ribosomal_S2 


10.6 


420307 


AW502869 


Hs.66219 


ESTs 


ss.m 


10.6 
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414918 


A1219207 


Hs.72222 


hypothetical protein FU13459 


SS,Tk/l,efhand 


10.6 


♦46562 


BE272686 


Hs.15356 


hypothetical protein FU202S4 


hormone,SS,p(kB 


10.5 


419846 


NMJ15977 


Hs.285681 


Wnilams-Beuren syndrome chrom 


SS,Hm,S8,TM,WD40 


10.4 


453023 


AW02B733 


HsJ1439 




Kunilz BPri,SS,TIW,ion tra 


10.4 


438800 


AB037108 


Hs.6418 




SS,TM 


10.3 


431275 


T56571 


Hs.10041 


ESTs 




10.3 


407241 






gb:Human omega light chain pro 


Ss!lg,PH,lg,PH 


10.3 


441238 


AI372555 


Hs.322456 


tiypothetical protein DKFZp751D 


homeobox,SS,TI«l,Rho_GDI,th 


10.3 


436325 


AL390088 


Hs.7393 


hypothetical protein from EURO 


SS,Synapsin_C,SS 


10.3 


435605 


AF151815 


Hs.4973 


hypothelcal protein 


SS,TM,SS,TM,ABC_tran,ABC 


10.3 


444202 


AL031685 


Hs.12785 


KIAA0939 protein 


SS,Tlv1,Na_H_Exchanger,ABC2 


10.3 


425597 


U28694 


Hs.1 58324 


chemokine (C-C motiO receptor 


7tm_1 


10.3 


415200 


AL040328 


Hs.78202 


SWI/SNF related, matrix associ 


SNF2_N,he!icase_C,bromodo 


10.2 


446528 


AU076640 


Hs.15243 


nucleolar protein 1 (120I<D) 


Noll Nop2_Sun,SS,SNF2_N,h 


10.2 




D26351 


Hs.77515 


inositol 1,4,5-lripiiosphate re 


TI«,RYDR_ITPR,ion_trans,MI 


10.2 


423524 


AF055989 


Hs.129733 


potassium voltage-gated channe 


ion_trans,K tetra,thaumat 


10.2 


434552 


AA639618 


Hs.325116 


Homo sapiens, clone MGC:2962, 


SS 


10.2 


406836 


AW514501 


Hs.155110 


immunoglobulin kappa constant 


SS 


10.1 


420233 




Hs.1 94864 


hypothetical protein FU22578 




10.1 


427468 


BE208364 


Hs.29233 


ESTs, Weakly similar to LKHUp 


SS,F5 F8 type CEGFJGT 


10.1 


427672 


AA356615 


Hs.336916 


death-associated protein 6 


SS,BTB,abhydrolase_2,RasG 


10.0 


423218 


NM_015896 


H3.157380 


BLu protein 


zf-MYND,SS,Tlil,Glyco_hydro 




403026 






Target Exon 


SS,lrelbil 


10.0 


41279C 


NII4.014767 


Hs.74583 


'KIAA0275 gene product 


l«EaI,thyroglobuiin 1,zf- 


10.0 


419823 


AW271708 


Hs.1 18918 


-ESTs. Weakly similar to M20IVLH 


8S.TM 


10.C 


433886 


AA613596 


Hs.28412 


ESTs 


SS 


9.9 


428092 


AW87gi41 




ESTs 


SS.TM 


9.8 


450493 


M93718 


Hs.1 66373 


nitric oxide synthase 3 (endol 


flavodoxin,FADJinding,NO 


9.7 


420423 


AA827718 


Hs.88218 


ESTs 




9.7 


452302 


AF1 73867 


Hs.28g06 


glucocorticoid modulatory elem 


SAND,SS 


9.7 


444661 


AJ243937 


Hs.288316 


chromosome 6 open reading fram 


notch,EGF,ank,GoLoco,SS,T 


9.7 


414249 


A1797994 


Hs.279929 


gp2SL2 protein 


SS,TM,EMP24_GP26L,SS,TM,G 


9.6 


424263 


M77640 


Hs.1 757 


L1 cell adhesion molecule (hyd 


fh3,ig,iRK,SS,TM,fn3,ig,R 


9.6 


438627 




Hs.1 23473 


ESTs 


TM,Reticulon 


9.6 


407065 






gb:H.sapiens DATI gene, partia 


SIMF,SS,TM 


9.6 


441307 


AW071696 


Hs.209065 


hypothetical protein FU14225 


8S,TM 


9.6 


40964S 


AA1 59216 




hypothetical protein FLJ20442 


Y_phosphatase,DSPc,TI^ 


9.6 


424487 




Hs.6259 


KIAA16g8 protein 


SS,SS,TIVI,Glyoo_hydro_31,G 


9.6 


444633 


AF111713 


Hs.286218 


Junctional adhesion molecule 1 


ig,SS,TM,HLH 




427747 


AW411425 


Hs.1 80655 


serine/threonine kinase 12 


pkinase,SS,TM,synaptobrev 
SS 


9^4 


450437 


X13966 


Hs.24998 


hypothetical protein MGC10471 




415169 


W42913 


Hs.73089 


ATPase, vacuolar, 14 kD 


ATP-synLF,SS,TI^,CH,Filam 




400201 






NM.006156-:Homo sapiens neural 


ubiquitin,SS,TI^,Transgiut 


9^4 


454319 


AW247736 


Hs.101617 


ESTs, Weakly similar to T32527 


SS 


9.4 


421680 


AL031186 


Hs.239106 


Human DNA sequence from clone 


SS,SS,rTm,2f.RanBP,rTm,GA 


9.4 


445143 


U29171 


Hs.75352 




pkinas9,SS 




407507 


U73799 




gb'.Huraan dynactin mRNA, parlia 


SS,TM,HC03_cotransp,CAP_G 


. 9.4 


450883 


NM.001348 


Hs.25519 




pklnase,GTP_EFrU,EFG_C,GT 






AW861123 




gb:RC3-CT0297-1202004)14-a05 C 


SS 


9.3 


414626 


AA335738 


Hs.76686 


glutathione peroxidase 1 


GSHPx,SS,ras,HLH 


9.3 


456950 


API 11 170 


Hs.306165 


Homo saptens 14(|32 Jagged2 gen 


SS,TiW,DSL 


9.3 


446333 


BE537641 


HS.4427B 


hypothetical protein FU12S38 


SS 


9.2 


407204 


R41933 


Hs.140237 


ESTs, Weakly similar to ALU1_H 


SS,histone,histone 


9.1 


412338 


AA151527 


Hs.69485 


hypothetical protein FU12436 


SS,TM.TIG,Seraa,PSi 




439963 


AW247529 


Hs.6793 


platelet-activab'ng factor ace 


PAFnAH.Ib.Upase_GDSL,S8, 


9.1 


412104 


AW205197 


Hs.240951 


Homo saptens, Similar to RIKEN 


SS,TiM 


9.1 


443553 


AL040535 


Hs.9573 


ATP-bindIng cassette, sub-M 


ABCJran.SS 


9.1 


448984 


AW751955 


Hs.22753 


hypothetical protein FU22316 


SS 


9.0 


418776 


AI401004 


HS.BB411 


lymphocyte antigen 117 


SS,TNF,TNF 


9.0 


418843 


AJ261016 


Hs.89230 




TM,CaMBD,SK.channel.TM 


9.0 


419244 


AI436567 


Hs.89761 


ATP synthase, H transporting. 


ATP.«ynLDE,88,mii,Ephiin 




451855 


R54913 


Hs.1 75804 


ESTs 




8.9 


424825 


AF207069 


Hs.1 53357 


procollagen-lysine, 2-oxogluta • 


20G-FelI Oxy,Glycos trans 


8,9 


447374 


AF263462 


Hs.1 8376 




SS,Myosin_tai!,lvl 


8.9 








FEV protein 


EU.SE.crystail 


8.8 


409936 


AK001691 


Hs.57655 


hypothetical protein FU 10829 


ES,TM 


8.7 




BE383605 




small GTP-binding protein 


SS,TM,TPR 


8.7 


430037 


BE409649 


Hs.227789 


mitogen-activated protein kina 






424919 


BE314461 


Hs.153768 


U3 snoRNP-associated 55-kDa pr 


WD40,88,KH-domain 


8.7 


414534 


BE257293 


Hs.76366 


BCL2-antagonisl of cell death 


SS,liormone_rec,zf-C4 


8.7 


433333 


Aid 6521 


Hs.71816 


v-akt murine thymoma viral one 


horaeobox,pkinase,PH,pklna 


8.7 


423228 


AL137491 


Hs.125511 


Homo sapiens mRNA; cDNA DKFZp4 


SS,TM,sushi 


8.7 


419493 


AF001212 


Hs.90744 


proteasome (prosome, macropain 


Pa,SS,CDK5_activator 


8.7 


420160 


AI492840 




ESTs 


SS.TIM 


ae 




AK001416 


HsJ06122 


glycoprotela synaptic 2 


TM,Sterold_dh,8S 




447827 


U73727 


Hs.19718 


protein ^roslne phosphatase. 


Yj!hosphatase,fn3,ig,MAM, 


8.6 


417193 


AI922189 


Hs.288390 


hypothetical protein FU22795 




B.6 


418678 


NI\L001327 


Hs.167379 


cancerAeslisanligen(NY-ESO- 


SS,TM,zf-C2H2 


6.5 


458963 


AI701393 


Hs.278728 


Rad and Gem-related 2 [rat ham 


ras,SS,Peptidase l^10,herao 


as 


406868 


AAS05445 


Hs.3006g7 


immunoglobulin heavy constent 


SS,TM,ig 


as 


434105 


AW952124 


Hs.13094 


presenilins associated rhombol 


TM,Rhomboid,SS,TIW 


as 


421726 


AK001237 


Hs.319088 


hypothetical protein FU10375 
lectomedin-2 


as 


421707 


NM.014921 


Hs.107054 


Utrophllin,OLF,7tmJ,Ga! 


a2 


453898 


AW003S12 


Hs.232770 


arachidonate lipoxygenase 3 


8S,TiW,lipoxygenase,PLAT,s 


8.2 
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SS,PK,PK_C,myosin_tiead,Rh 



444341 AI142027 

413762 AW411479 • 

436685 W28661 

420932 AW374605 

431493 AI791493 

447598 AI799968 

415758 BE270465 

457022 AW377258 

426440 BE382756 

432747 NM_014404 

441084 W24563 

424443 A1751281 

424198 ABQ29010 

430513 AJ012008 

417900 BE250127 

432891 AF161483 

432234 AA531128 

453486 BE620712 

441327 AK001706 

436540 BE397032 

418256 AW845318 

457274 AW674193 

437141 BE304917 

425428 AL1 10261 

431934 AB031481 

418349 NM_001383 

430600 AW9509S7 

421758 BE397336 

412841 AI751157 

418313 BE244231 

429367 AB007867 

418837 U48263 

423015 U18548 

440188 AK001812 

421976 AW961017 

423858 AL1 37326 

446143 BE245342 

417704 NM 001747 

440869 NM_014297 

435099 AC004770 

438856 N40027 

426268 AF083420 

418373 AW760770 

445087 AW893449 

421748 NM.014718 

413837 AW163525 

426691 NI\La)6201 

409125 R17268 

424251 AA677466 

431630 Nl^002204 

428156 BE269388 

459255 AI493244 

441323 AA928413 

455928 BE170313 



442296 NM_007275 

445937 AI452943 

439732 AW629604 

429542 AF038660 

420190 AI816209 

408215 BE614290 

410277 R88621 

419657 AU077005 

448677 AI660769 

425228 NM_005253 

432538 BE258332 

421864 BE387198 

429962 M69113 

406867 AA157857 

426068 AF029778 

419344 U94905 



Hs.5288 
Hs.11607 
Hs.129873 
Hs.1 99630 
Hs.78793 

Hs.1 69902 
Hs.278907 
Hs.99y 

HsillMO 

Hs.82906 

Hs.279761 

Hs.1 15803 

Hs.33026 

Hs.7778 

Hs.14468 

Hs.12271 

Hs.227152 

HsJ1097 

Hs.1 57211 

Hs.272214 

Hs.84183 

Hs.274348 

Hs.1422 



Hs.7485 

Hs.4756 

Hs.7473 

Hs.158913 

Hs.84344 

Hs.1 2303 

Hs.107809 

Hs.171834 
HS343567 
Hs.143696 
Hsi65829 
HS.18269B 
Hs.239500 
Hs.15g089 

Hs.205736 

Hs.105606 

Hs.8186 

Hs.321231 

Hs.167641 

Hs.206713 

Hs.95867 

Hs,26249 
HS.9220B 

Hs.301612 
Hs.278362 
Hs.108973 
Hs.226795 
Hs.182265 
Hs.166164 
Hs.277445 
Hs.151706 

Hs!l5463 

Hs.133230 

HS.334841 

Hs.279869 

Hs.4241 

Hs.2942 

Hs.159161 



FK506*indingpn . . 
Homo sapiens mRNA; cDNA DKFZp4 
ESTs. Weakly similar to T21697 
ESTs, novel cytochrome P450 
ESTs 

protein kinase C.zeta 
gb:MR2-CT0222-261099-003-a10 C 
solute earner family 2 (facil 



Caklesman,SS,Ritiosomd_S2 

88,™,Reprolysin,Pep_M12B 

FICBP.TPR,SS 

SS,TIVI,pl<inase>^clivin_rec 

S8,bZIP_Maf 



hypothetical protein FU11773 
hypoltietical protein from EURO 
KIAA1 087 protein 
G6C protein 

CDC20(cell division cycle 20, 

HSPC1 34 protein 

ESTs 

hypothelcal protein PP2447 
hypothelcal protein FU10751 
hypotheUcd protein MGC14226 
f-box and leudne-rlch repeat 



diptheria toxin leslslance pro 
HLArB associated transcript-3 
Gardner-Rasheed feline saiccma 
hypothelteal protein MGC1 1352 
CGI-05 protein 

prepronociceptin 
G protein-coupled receptor 12 
N-AcetylgluGosamine kinase 
hypothetksal protein FU11856 
Homo sapiens mRNA; cDNA DKFZp4 



ESTs 

serine/threonine kinase 24 (St 
CGI-135 protein 
suppressor of Ty (S.cerevisiae 
KIAA0726 gene product 
til'n^;ap(i9lethonln) 
PCTAIRE protein kinase 1 
axonal transport of synaptic v 
coactlvator-assoclated atglnin 
Integrin. alpha 3 (antigen CIM 
mitochondrial riiiosomal protei 
hypotheb'cd protein MGC131 14 
ESTs, Weakly simllarto AIJU7_H 
gb:QV4-HT0536-040500-19ig02 H 
HLA class II region expressed 
hypothetical protein FU20512 
lung cancer candidate 
UDP-Gal:belaGicNAc beta 1,4- g 
hypothetical protein from EURO 
UDP-Gai:betaGloNAc beta 1,4- g 
hypothetical protein EST00098 

ESTs, Weakly similar to T2D3_H 
Jislnlegrin and melalbprote 



ESTs 
FOS-like antigen 2 



dolichyl-phosphate mannosyltra 
glutathione S-transferase pi 
keratin 19 
jagged 2 

diacylglycerol kinase, zeta (1 
KIAA0134 gene product 
lymphotoxin beta receptor (TNF 
Homo sapiens, clone tkMGE-.2959 



8S,TM 

pkinase.I]A(3.PE.blnd,pkina 

SS,RlbosomaLL7Ae 

sugarJr,SS,TM,sugar_tr 

PMP22.aaudln,SS,TM,PMP22 

SS,TM,tiormone_rsc,zf-C4 

8S,TM,8S,TW 

SS,TM,Na_Ca.Ex,Cabt-beta, 
S8,TM,GST_Cabhydrolase 
WD40,SS,TM,lh3,EGF,fh3,lg 
SS,TM,ubiqullln,Transglut 

SS,TM 

SS,TM,7tm_1 

SS,TM 

SS,SS,TM,HSF_DNA-bind 
SS,THSS,TM,aalhrinJg_c 
SS,TWGlycos_transf_4 



Rho GDP dssocralion Inhibitor 
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I>phthamide_syn,SS 

ubk|ultin,SS,TM,G-patch,a 

SH2.SH3,pklnase 



SS,TM 

secY,SSTM 

Gelsolin.SS.Geisolln 

lactam ase_B,SS,XRCC1_N,BR 

XPG_N,XPGJ,5J_exonudea 

SS,TM,connexin 

pkinase,pkinase 

SS,TM,PMP22_Claudin,20G-F 

S1,SH2,RibosomaLL23,pkin 

cadherin,TM,TPR 

SS,Melhyllransf_3 

pklnase£S,UCH-2,UCH-1,rr 

SS,klnesin,PH,FHAJ<lnesin 

SS,SNF2_N,helicase_C,b(om 

lntegrin_A,FG-GAP,Rhabd_g 



SS,Peptldase_CUI-C2H2 
SS 

kazal,SS,TM,ig,pkinase 
TM 

SS,TM,Glyco_hydro_56,Glyc 

Galact05yLT_2,6S,Tlkl,tsp_ 

SS,TM,SS,TM,A2M,A2M_N,NTR 

Galactosyi_T_2,ig,SS,TM,A 

SS,dynamin_2,dynamin,PH,G 

SS,SS,TM,HLHTRM,zf-CCCH 

ESTU,SS 

disintegnn,Reprolysin,Pe 
bZl'p,6S 

SS,TliJ,AAA,RibosomaLI^ 

ESJM,SS,TM 

GST_C,GST_N,SS,efhand 

filamenl,bZlP,SS,filamenl 

DSL,EGF,vwc,granUlin,SS,T 

ank,DAGKa,DAGKc,DAG_PE-bi 

helicase_C,PRK.SS,TI«,7lm_ 

TNFR.c6,SS 

88,efhand 

TM,SS,TM,TPR,iBS 

SS,RFX_DNA^blndng 

88.TM.RGS,DIX 

fiJston_gly,homeobox,TM 

pkinase,Sema,PSI.nG,A4_E 

Rho_GI3l,homeobox,88,Cytid 
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440088 
447665 
431785 



458367 
415010 
410076 
412940 
440042 
414023 
414513 



434202 
432183 
444416 
447206 
407704 
453190 
439975 
449514 
432805 
414362 
417483 



452094 
451524 
427438 
439685 
440511 
417334 
425976 
433173 



450778 
423612 
422701 
412958 
436957 



451920 
453054 
415117 
413163 
425246 
433271 
448484 
449139 
449181 
414457 
424964 
415193 
407754 
413049 



AW771318 

AW161481 

X14850 

AW264102 

BE382411 

AW151952 

AW238085 

BE617Q15 



U81375 
NM.002067 
Nlil014699 
BE391579 



D79991 

AA224483 

AI878903 



ALa48891 
AAS27348 
NIUL002151 



Hs.191208 
Hs.120243 
Hs.183232 
Hs.30011 
Hs^68763 
Hs.1 19387 
Hs^84207 
Hs.76394 



Hs.819 
Hs.1 33898 

Hs.76297 

Hs!56874 

Hs!l8747 



Hs.178011 

H3.293937 

Hs.7236 

Hs.157240 

Hs.334514 

Hs.3196 

Hs.6311 

Hs.51273 

Hs.293629 

Hs.9100 

Hs^587g 

Hs.25450 

HS.1S86 

Hs.1 19273 

Hs.75087 

Hs.122952 

Hs.193313 

Hs.77313 

Hs.100221 

Hs.158196 



Hs.30002 
Hs.27239 
Hs.31547 
Hs.78016 
Hs.75216 



Hs.823 

Hs.63236 

Hs.343661 



hypothetical protein FU22638 
hypothetical protein MGC2963 
Brealspoint cluster region piot 
KIAA0792 gene product 
Homo sapiens, Sim9arto RIKEN 
enoyi Coen^me A hydratase, sh 



homeoboxB? 

gb:TCBAP1 D1053 Pediatric pre^ 

G protein-coupled receptor kin 

haal3liock27kD protein ferally 

ubiquifm-like 4 

P0P7 (processing of precursor, 
hypothetical protein MGC11134 
integral membrane protBin 3 
H2Ahi5tone My, member X 
ESTs 

.guanylate kinase 1 
hypothetical protein FLJ20739 
hypothetical protein 
ESTs, lUloderately similar to T1 
malignant eel expiession-enha 
KIAA0356 gene product 




gb:EST386752MAGErEi 
activated p21cdc42l Is kinase 
hypothetical protein FU20041 
centrosomal protein 2 
hypotheticai protein FLJ10604 
hypothetical protein FU202S7 
ESTs, Weakly similar to FXDi.H 
eNOS interacting protein 
hypothetical protein MGC4737 
NG22 protein 
surfeit 1 

hypotheticai protein FU208ra 
hypotheticai protein MGC2650 
hypothetical protein MGC3121 
Homo sapiens cDNA FU13100 fis 
Homo sapiens mRNA: cDNA DKFZp4 
salute canier family 29 (nuci 
guaiine nucleotide binding pro 
KIAA0296 gene product 



transcriptional adaptor 3 (ADA 



Target CAT 

polynucleotide kinase 3'-phosp 
protein tyrosine phosphatase, 
serum response factor (c-fos s 
nucleolar protein family A, me 
Homo sapiens, Similar to RIKEN 
phenylalanlne-tRNA synlhetase- 



Homo sapiens CDNAFU14105 fis 
hepsin (transmembrane protease 
ribceomal protein S1Sa 



ECH,PeptIdaseLU7,SS,TM 



pkinase,RGS,pklnase_C,SS, 
PX,SS 

H8P20,88.TM,zf-C2H2,BTB,E 
ubiquitin,SS,™,G6PD,G6PD 
SS,TM,WD40,vwd,MAM,EPO TP 



hislone,CBFD_NFYB_HMF,SS, 

SSTM,LRRCT,LRR 

Guanylale_kin,CoaE,Viral_ 

2f.C3HC4,SpoA,PHD,TI^,syna 

SS,TM,LRRCT,Sema 

TM,MB0AT,S8,TM 

PH,PHD,RUN.SS 

Haiii1pJke,SS 

SS,PXaif,lipccalln,PHD,z 

SS,TM,7tmJ!.GPS.EGF,SS,TM 

NSF.88.TM 

Acyl-CoA_dh,Acyl-CoA_dhJM 



pkin3se.SH3 
SS,TM,lon_trans 
bZIP,5_3_exonucle3se,M,8S 
8S,88,TM,pkinase,pkinase_ 
SS,TM 

88,PWWP,T8C22 

SaTM,MAGE,Rlbosomal_S17, 

SS,TM,ion_trans 

SS,TM,pWnase,SH2,SH3,BNR 

SURF1,SS,TM,SURF1,SURF4 

TM,SET 

SS,ARTJM 

SS 

SS,TII4,UM,homeobox 

SS/£C_tian,88.TM 

Nucleo5lde_tian,SS,TM,HSP 

6-alpha,arf,88,&alpha 

zf-C2H2,G8T_C,PHD.8S,T^H 

SS.pkinase 

SS,DAGKc,DAGKa,RA,DAG_PE- 
MoaA_NIS_PqqE,SS,TM 

hormone_rec,zf-C4,SS,DNA_ 

homeobox,SS,homeobox,home 
Herpes_HEPA,SS 



\/iral_helicase1,SS,Amlno_ 
fh3,ig,Y_phosphatase,SS,T 
SRF-TF,tiavodoxin,SS,TM,p 
SS,TM 

TM,S6,TM,Kunitz_BPTl 

neur,SS,zf-C2H2,DNaseJI 

C2,SS,TM,Y_phosphatase,Tr 

ATF^ynLC,SS.TM,pkinase 

actin,SS 

SS,TM,aniinotran_1_2,LRR 

S8.TM,SaTM.T8PN,tsp_3,8E 

trypsln,S8,TM,ATP1G1_PUM_ 



K1AA1271 pr( 
gb2w2af11,s1 Soares ovary turn 
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416322 BE019494 

406673 M34996 

415361 U44755 

411030 BE387193 

410653 BE383768 

433012 NM_004045 

437741 BE561610 

421883 X55079 

427361 AW732480 

411574 BE242842 

457313 AF047002 

428345 AI242431 

434845 BE267057 

427162 AB011133 

447402 H54520 

433676 AW371389 

424373 AJ133796 

423402 BE167615 

409983 D50922 

450184 W31096 

431629 AU077025 

430413 AWB42I82 

440333 AI378424 

424927 AW973666 

412276 BE262621 

416181 AA174126 

440609 AI287585 

435327 BE301871 

421139 AW953933 

453449 W16752 

414411 X54C79 

440906 AW161556 

421899 AJ011895 

439473 AI215529 

461686 AK001171 

407191 AA608751 

427515 T79526 
405325 

434119 AF1 93844 

413052 BE249841 

445109 AF039916 

409323 H28855 

438707 L08239 

442599 AF078037 

420372 AW960049 

436576 AI458213 

439012 BE383814 

418910 Z25821 

414849 AW372721 

425743 BE396495 

418231 AA326895 

419238 AW9S9538 

441917 A1989925 

437617 AI025701 

412867 AU076B61 

419579 W49529 

425824 AI939563 

439414 NM_001183 

436042 AF284422 

410775 AB014460 

453350 AI917771 

400300 X03363 

426811 BE259228 

421179 U72664 

429762 AI346255 

419250 AW770185 

426831 BE295216 

442103 AA333367 

414320 AA371931 

426347 AA454912 

423880 BE278111 

429545 AI824164 

443044 N23522 

417080 BE392846 

441455 AJ271671 

410182 NI\L0019B3 

456062 AI866286 

439270 BE26B278 

40B985 BE267317 

416978 BE243985 

436057 AJ004832 



HSJ85813 
Hs.134729 
Hs.79217 
Hs.198253 
Hs.78403 
Hs.67896 
Hs.65238 
Hs.279910 



Hs.1 18282 

Hs.325321 

Hs.173864 

Hs.1 8490 

Hs.250173 

Hs.146219 

Hs.141556 

Hs.57729 

Hs.237617 

Hs.265827 

Hs.241392 

Hs.288761 

Hs.153850 

Hs.73798 

HsJ32163 

Hs.7301 

Hs.4867 

Hs.301372 

Hs.32981 

Hs.75067 

Hs.240170 



Hs.324051 

Hs.293660 

Hs.77542 

Hs.6455 

Hs.89456 

Hs.291623 

Hs.159428 

Hs.83848 

Hs.321214 



Hs.119178 
Hs.66196 
Hs.61790 



Hs.172673 

Hs.8088 

Hs.77422 

Hs.169407 

Hs.134200 



Hs.1063 
Hs.7854 
HS.59S44 
Hs.71962 



hypothetical piateinFU11807 
FXYD domain-containing Ion tre 
pyrrDline-S-caitx»vl8le reduct 
major hlstocompaSbility compi 
small nuclear PHA activating c 
7-60protan 

95 kDa retinoblastoma protein 
ATX1 (antioxidant protein 1,y 
putative transmembrane protein 
glucosidase, alpha; acid (Pomp 
cellular relinoicxid-binding 
protein tyrosine kinase 9-like 
transcnptional coactivator 
PAP-1 binding protein 
hypothelcal protein R32184_1 
KIAA0561 protein 
hypothetical protein FU20452 
hypothetical protein FU13158 

Homo sapiens cONA FLJ12976 fis 
Kelch-Iike ECH-sssociated prot 
Homo sapiens, clone 1MAGE:3447 
interferon, alpha-inducible pr 
.§mall Inducible cytokine A5(R 
hypothetical protein FU21749 
hypothetical protein C321D2.4 



8S,TM,pl<lnase 

ss,™ 

88,THV\TP1G1J'IM.MAra.ATP 

P5CaOclopine_DH_N.SS,thl 

SS.TM,MHCJLalpha,lg.S§. 



heat shock 27kD protein 1 

ed factor 1 
ESTs 

hypothetical protein MGC4549 
gb:ae56h07.s1 Stratager-.e lung 
integral type I protein 
C14000766':gi|702351'l. 
C0P9 complex subunit 7a 
gb:600942857F2 NIH_MGC_15 Homo 
ectonucleoside tnphosphate di 
Homo sapiens mRNA; cDNA DKFZp7 
amino acid system N trans porte 
RelA-associated inhibitor 
Homo sapiens, clone IMAGE;3535 
ESTs 

RuvB(Ecoli homolog)-like 2 
Homo sapiens, Similar to dadec 
ESTs, Weakly similar to unname 



hypothetical protein AF053356_ 
ESTs.HtoderatelysimnartoRE 
ATPase, H transporting, lysoso 
cation-chloride colransporler- 
nth (Ecoli endonudease tll)- 
hypothetical protein FU23338 
HER2 receptor tyrosine kinase 



U5i 

S-adenosylhomocysteine hydrola 
similar to S. cerevisiae Sec5p 
proteolipid protein 2 (colonic 
SAC2 (suppressor of aclin muta 
DKFZP564C186 protein 
lymphocyte antigen 6 complex, 
quinolinate pho^honbosyltran 



3S,Cotlagen,Collagen 

zt-C3HC4,SS,SNF2_N,helica 

HMA,SS,TM 

SS,TM,SS,TM,RA,VPS9,SH2 

trefoil,Glyco_hydro_31,SS 

SS,THaminotran_1_2,LRR 

cofilin_ADF,SS,Tlul 

SS,rrm,SS,Cytidylyllransf 

SS.TM 

SS,TM,CHcalponin,ARID 

SS,pklnase,PDZ,SS,SH2,Rho 

SS,TM 

SS 

C2,SS 

BTB,Kelch,SS,Tr^ 

pkinase,SH2,SH3 
IL8,SS 

8S,TM,IP_lrans,pkinase,pk 
SS,TM 

MIF,SS,TM,MIF,sugar_tr 

88,TM,GalP_UDP_transf,Gal 

SSA:yl-CoA^<lh,Acyl-CoA_d 

88,HLH,IVIyc_N_temt,IWyc-l2. 

88,1114 

8S,Sema,ig,PSI,SS,TM,G-al 



SS,lltetalk)phos 
SS,Peplidase_C1 
EMP24_GP25L,SS 



TM,SS,TM 

SS,TM,GDA1 CD39,SS,TM,pho 
TPR,SS,TM,pkinase,ig 
SS,TM,ACAT,MBOAT,SS,TM,TB 
SH3,ank,SS,TM,HHH,ig 

SS,TM,7tmJ,DnaJ 
AAA,DnaB,UPF0079,6S,Cys_k 
ECH,SS,TM,aminotrBn_3,ABC 
TM,pkinase 

Bcl-2,SS,feniHn,Bcl-2,e 
TIIVI,88,TM,zf-UBP,UCH-2,UB 
SS.TM,WH2 



8S,ztC3HC4,Peptidase_IM16 
UPF0005.SS,TM 

H^SP_domaln,SS,TM,CUB,fJTR, . 
SS.PHD 

SS,SS,TM,GDl,Sema,TlG,PSI 
SS,TM,aa_permeases,SS,TM, 
HhH-GPD,SS,TM,REJ,PtAT,PK 
SS,SS,TM,EMP70,PA28_alpha 

efhand,SS,TM,GFOJDH_MocA 
UiM,SS,TM,PI»1P22_Claudin,P 
SS,zf-C3HC4,Palm thioest 
SS,TM,7tm_1,BAH,zf-CXXC,D 
AdoHcyase,SS 



QRPTas6QRPTase_N,SS,Tr^ 



zinc/iton regulated tiansporte 
excision repair cross^mpleme 
E8TS, Weakly similar to B36298 
hypothelkal protein MGC2592 
■jinMGC13010 



neuropathy target esterase 



88.TW,HC03_cotransp 
SS,™,ACAT,LRR 
Vault,S8,TIVI,kinesin,zf-C2 
cNIVIPJiinding.SS.TRcNMPJj 



wo 02/102235 



PCT/US02/19297 



424601 


AI470163 


Hs.323342 


adin reialed protein 2/3 comp 


SS,HhH-GPD 


4.8 


409214 


AW405967 


Hs.333388 


Homo sapiens, done IMAGEdSS? 

ESTs 


SS,EF1BD,P5CR 


4.8 


432716 


AI762964 


Hs.206180 


SSM 


4,8 


414460 


L00727 


Hs.898 


dystrophia m/otonica-ptotein k 


pkinase,SS,WD40 


4.8 


443329 


BE262943 


Hs.9234 


hypoMal protein IVIGC1936 


SS,TM,SS,TMapdh,gpdh_C 


4.7 


426120 


AA32S243 


Hs.166887 




C2,SS.aminotran_5 




405356 






ENSP00000247029*:SEBOX. 




4.7 


437118 


AB037857 


HSJ00591 


C09 partner 1 


TM,ig,SS,TM 


4.7 


430609 


AA302921 


Hs.247362 


SS,TM,GST Cabhydrolase 


4.7 


447131 


NM004585 


Hs.17466 


retinoic add rocepbr respond 


SS,TM,pkinase 


4.7 


428469 


BE549205 


Hs.184488 


Band 7,Floti1lin,Tlul 


4.7 


405189 






Target Exon 


SS 


4.7 


404256 






Nr/L024018*:Homo sapiens butyro 


SSTM,SPRY,SPRY,ig 


4.7 


467966 


AI208986 




ESTs 


SS^f-B box,SPRY,SS,Nol1_ 


4.7 


413201 


BE275378 


Hs.13972 


tiypottielical protein IVIGC12972 


S8,SH2,RhoGAP,SH3,GILT 


4.7 




AB015427 




zinc finger protein 219 


zf-C2H2,SS 


4.7 


442414 


BE408768 


Hs.8297 


rilxinudease 6 precursor 


nbonuclease_T2,SS,n1)onu 


4.7 


418289 


AW403103 


Hs.83951 


Hennanslty-Pudlali syndrome 




4.6 


436730 


AA045767 




bladder cancer associated pro) 






444596 


BE560662 


Hs.11417 


Rab acceptor 1 (prenylated] 


SS,TM,liajchanANF_recept 


4.6 




AI208513 


Hs.279915 


transtocase of inner mitoclnnd 


zf-Tim10_PDP,SS 


4.6 




AI625869 


Hs.258609 


protein tyrosine phosphatase, 


fh3.Y_phosphatase,SS,TM 


4.6 


400846 






sortiBn-related receptor, l^D 


klLreoepLa,fh3,ldLrece 


4.6 




T79045 


Hs.1 63812 




SS 


4.6 


420321 


D78761 


Hs.95557 


.Jiyothetical protein 


SS,Up_1,SS 




439921 


AL1 10209 


Hs.5770 


LCAT-lil<elysoptio3pholipase 


SS,LACT,SS,TI^,aa_permease 


4.6 


427122 


AW057736 


Hs.323910 


HER2 receptor tyrosine Idnase 


pkinase.Furin-like.Recep 


4.6 


426899 


AL043221 


.Hs.172825 


KIAA1037 protein 


W040,TPR,SS,TM 


4.6 


408116 


AA251393 


Hs,269052 


Homo sapiens, Similar to I^KEN 


SS,TM 


4.6 


412974 


R18978 


Hs.75105 


emopamil-binding protein [ster 


SS,TWSS,TM,TBCrTm,FlsJ 


4.6 


426510 


AWB61225 


Hs.110613 


BANPhomolog, SMAR1 homolog 


TM 


4,6 


414702 


L22005 


Hs.76932 


cell division (^e 34 


UQ_con,SS,tiypsin,ig 


4.6 


408136 


AA317248 


Hs.42957 


metliyltransferaseJike 1 


Methyltransf_4,SS,p450,Ge 


4.6 


445637 


W53459 


Hs.3949 


liypotlietical protein ll«3C4172 




4,6 


452190 


H26735 


Hs.91668 


Homosapiens clone PP1498unl(n 




4,6 


409680 


W31092 


Hs.55347 


mitoctiondrial ribosomal protei 


SS,TM,synaptobre>/in 


4,6 


421140 


AA298741 


Hs.1 02135 


signal sequence receptor, delt 


Herpes_UL3,SS,TM,Sema,pki 


4,6 




AI356293 


Hs.75339 


inositol polyptiosptiale pliospha 


SH2,SAH/(,SS,Folale_rec 




402463 






NM_01 4624:Homo sapiens S100 ca 


elhand,SJOO,SS,ethand,S_ 


4.5 


426402 


BE387327 


Hs.30475 


polymerase (RNA) II (DNAdirec 


SaPGAM 


4.5 


406939 


M34515 




gb:Human omega light chain pro 


SS,lg,PH 


4.5 


417891 


W79410 


Hs.32887 


protein phosphatase 1, regulat 


SS,THIS 


4,5 


426207 


BE390657 


Hs.30026 


HSPC182 protein 




4,5 


423664 


NM_004714 


Hs.130988 


dual-specificity tyrosine-(Y)- 


pkinase,SS,Fibnllarin,CK 


4.5 


432562 


BE531048 


Hs.273422 


DKFZP585G1122 protein 


zl-C2H2,S8,TM,F&GAP,inte 


4.5 




W60675 




hypothetical protein FU10350 




4.5 


432893 


NM_016164 




Homo sapiens clone PP1596 unkn 


ras,arf,SS,20G-Fell_0)(y,2 




424954 


NM 000546 


Hs.1 845 


tumor protein p53 (tJ-Frauraeni 


P53,SS 


4.5 




AL046341 




discoidin domain receptor fart 


F5_F8_type_C,pkinase,SS,T 


4.5 


448963 


AA459796 


Hs.331247 


Homo sapiens, clone 1MAGE:3610 




4.6 






Hs.79172 


solute carrier family 25 (mito 




4.5 


421962 


D82051 


Hs.288354 


FabG (beta-ketoacyl-[acykarr 
frophinin associated protein ( 


SS,adh short,SS,TH4,zf-C3H 


4.5 


426726 


AA488915 


Hs.171955 


88 


4.6 


414427 


L19711 


Hs.76111 


dystroglycan 1 (i^strophin-ass 


SS,TM 


4.5 


435891 


AW24ffl94 


Hs.5002 


copper chaperone for superoxld 


sodcu,HMA,SS,TIU,spectnn, 


4.5 


453997 


AW247615 


HsJ7003 


v-Ha-ras Haniey rat sanania vl 




4.5 


449029 


N28989 


H5.22891 


solute canter ^ly 7 (caHo 


aa_permeases,SS,TM,bZIP 


4.5 


432078 


BE314877 


Hs.24553 


hypothetical protein FU12541 


SS,TM 


4.5 


409650 


T08490 


Hs.288969 


HSCARG protein 


SS,SS,WD40 


4.5 


412833 


AW960547 


Hs.298262 


ribosomal protein S19 


SS,TM,ig,ITAM,RlbosomaLS 




424133 


AA335721 


Hsi13628 


ESTs 


SS.TM 


4.4 


414787 


AL049332 


Hs.77311 


BTG family, member 3 


SS,AntLprolilerat 




433046 


AA229663 


Hs.279945 






4.4 


417068 


AA451910 


H5.85852 


hypothclical protein MGC3169 


SS,TI\/I 


4.4 




D14697 


Hs.77393 


famesy! dipl-.osphate synthase 


palyprenyLsynt,SS,TM 


4.4 


418267 


BE339537 


Hs.83919 




Glyco_hydro_63,SS,PH 


4.4 


439902 


AF1 74499 


Hs.6764 


hislore deacetylase 6 


HisLdeacey,zf.UBP,SS,G 


4.4 


443347 


AI587180 


Hs.1 10906 


Homo sapiens, Similar to RIKEN 


TW.SS 




452160 


BE373541 




cysteine sulfinic acid decerbo 


SS 






X92762 


Hs.7g021 


tafazzin (cardiomyopatiiy, dila 


Acyltransferase,SS,TM,GDi 


4.4 


449717 


AB040935 


Hs.23954 


cerebral cell adhesion molecul 




4.4 


425069 




Hs.298184 


potassium raltage-gated ctianne 


SS.aldo keLred 




413380 


AI904232 


Hs.75323 




Band_7,SS,Band_7,SH3 




452911 


AA541537 


Hs.112619 


metallcthionein 1E (iunctional 


SS,SS,TM,Sec1 


4.4 




BE265254 




proliferation-associated 2G4, 






429213 


AA225065 


Hs]l98269 


Target CAT 


SS,Nop - ■ ' ' 


44 


447937 


BE621544 


Hs.157160 


hypotheticat protein MGC2616 


SS,I^DK,tJ^RIfr,tJ?RCT,LRR 




407230 


AA157857 


Hs.182265 


keratin 19 


filament,bZIP,SS,filament 


43 


448886 


AL137291 


Hs.22451 


hypolhetical protein FU10357 


SS,PH,RhoGEF.SS,maseA 


A3 


421178 


BE267994 


Hs.102419 


zinc finger protein 


zf-C2H2,SS,TH/l 


4.3 


454031 


R36772 


Hs.71941 


hypothetical protein MGC15677 


TM 


43 


450126 


BE018138 


Hs.24447 


Sigma receptor (SR31747 bindin 


SS.ig,fh3 


43 


416557 


U68566 


Hs.15318 


HS1 binding protein 


SS,TM,HflP,UBA 




413781 


J05272 


HS.85Q 


IMP (inosine monophosphate) de 


IMPDH_C,IMPDH_ig,CBS,NPD,S 


43 



240 
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PCT/US02/19297 



407161 
426613 
408616 
446616 
414467 
456867 
401751 
400563 



457757 
446388 
412825 
439737 
422256 
441164 
401727 
411142 
458176 
432178 
421537 
421380 
422702 
434142 
423696 
427407 



426551 
417782 

443639 



430067 
403449 
448099 



X75208 
Y09763 
AI124572 



N 14.005397 
H25836 
U96132 
R51604 



BE281303 
AA457456 
AF034373 
Z83844 



AF037332 

AA434109 

AA292979 

AW167439 

AI751438 

M64673 

AB023180 



D31833 

AJ011373 

U47927 

Z92546 

BE268649 

AI929320 

BE274009 

D60647 

AF103801 

AW410601 

BE302472 

NIL006245 

AAS700S6 

AAS35762 

AA381268 

T10149 

BE269042 

AF207989 

Z21093 

AI751601 



M88359 

AW074836 

AW411280 

D30612 

U79458 

NM_004408 



NM_002975 
NM_003926 
AA341497 



Hs.322735 

Hs.98614 

Hs.2913 

HS.2Z785 

Hs.323879 

Hs.279861 

Hs.194691 

Hs.348183 

Hs.16426 
Hs.301527 
Hs.171280 
Hs.300842 
Hs.334873 
Hs.85752 



Hs.236522 
Hs.299148 



Hs.18528 
Hs.61258 
Hs.278569 
H3.12271 



H3.140309 
Hs.272814 
Hs.105547 
H3.1372 



Hs.75516 

Hs.772 

HS.2S0879 

Hs.16361 

HsJ0026 

Hs.11314 

Hs.118244 

Hs.122730 

Hs.183435 

HsJ23947 

Hs.4243 

Hs.9661 

Hs.58014 

Hs.30352 

Hs.3375 

Hs.247280 



Hs.74375 

Hs.173984 

Hs.90693 

Hs.58167 

Hs.231840 

Hs.166161 

Hs.61976 

Hs^4S 

Hs.14g846 

Hs£9621 

Hs.129926 

Hs^ 

HS.10S927 

Hs.178728 

HsJ1408 

Hs.61790 

Hs.6351 

H3.16S263 

Hs.4884 



gamma-aminobufyric acid (GABA) 

Inhibitor of kappa right polyp 

CGI-31 protein 

relinolc add induced 3 

tumor necrosis factor receptor 

C12000644:gi|5729785|ref|NP_00 

podocalyxin-lilce 

ESTs, Moderately similar to un 

hydroxy acyl-Coenzyme A dehydro 

K(AA1608 protein 

ESTs, Weakly similar to ALU8_H 

gb:yc18da3.s1 Stratagene lung 

RAN binding protein 3 

Target Exon 

DKFZP434P106 protein 

C1 1000273*:gi|12656107|gb|AAK0 

hypothetical protein FLJ21801 

hypothetical protein FU20435 

' hypothetical prctsin dJ37E1&S 
.£NSP(X)000243264:Dolichyl-dipho 
gb:Homo sapiens sklrvspecHIc 
Sjogren's syndrixne nuclear aut 



sorting nexin 17 

f-box and leudne-rich repeat 

NPD007 protein 

Homo sapenscDNAFLJ 13625 fis 
Homo sapiens mRNAftjII length 
heat shock transcription facto 
KIAA0g63 protein 
Target Exon 



Homo sapiens, done IMAGE:3677 
hypothetteal protein DKFZp434E 
neural prolifera6on,difreren 
arginine vasopressin receptor 
chromosome 9 open reading fram 
ubiquitin specific protease 6 
Sushi domain (SCR repeal) cent 
■~" (NAO;p 



ESTs. Highly similar 10 CTXNR 

HSPC182 protein 
DKFZP564N1363 protein 
protein phosphatase 2, regulat 
ESTs, Moderately similar to Kl 
NM_004646:Homo sapiens NADH de 
ESTs 

hypothetical protein FU12650 
proteasome (prosome, maoropain 
Homo sapiens. Similar to G pro 
ribosomal protein S6 kinase, 5 
TN F receptor-associated factor 
HBV associated factor 

Target Exon 

hypothetical protein MGC1S42g 
gb:Hamo sapiens DNA-bindIng pr 
dishevelled 1 (homokigous lo D 

T-boxl 

replication initiation region 
zinc finger protein 282 
WW domain binding protein 2 

Homo sapiens etm FU12947 lis 
hypothetical protein FU20643 
integrin,beta5 

E8Ts,WbaklysimllarloA40815 
ESTs 



SS,SS,zf-aHC4,SPRY 
bZIP.SS 

EPHJbd,fn3,pkinase,SAM 

Neur_chan_LBD,Neur_cha 

zf-C2H2,SS 

thiored,SS,TI^ 

7tm_3,SS,TM 

TNFR_c6,SS 

SS,TM,SS,TIVI 

SS,TNF 

adh_shorl,SS 

SS.DENN.DENN 

SS,Zn_carbOpept 

SS,TM,isodh 

SS,0rexin,SH2,STAT 

SS,Pep_M12B_propep 

abhydrolase,TM 

SS,TM,7tm_1 



SS,Hydrolase,SS,Gal-bind_ 



PX,fn3,pkinase,SAM,EPH lb 

SS,F-box,SS,TM,HEF_DNA-bi 

SSJM 

SS 

SS,C1q,Ccllagen 
N^SS,TM,F-box 
helicase.C,SS,RNA_poLH 
A^deaniin,SS 

Galactosyl_T,SS,Ribosomal 



7tm_1 



;S,TM 



l-2,UBA,UCH-1,SS 
SS,TPRvwd,sushi,Somatome 
BRCT,PARP,zf-PARP,PARP_re 
pkinase,SS,TM,ig 
Glycos_transf_1,SS 
SS,TM,mn 
DA0,8S 



stem cell growth factor; lymph 
melhyl-CpG binding domain prot 
RAR(RASIikeGTPASE) 
hypothelKal protein FU23338 



B56,SS,TM,Alraphin-1.Exo_ 
THSS 

SS,TM,Josephin,UIM,Joseph 
SSJM 

proteasome,SS,TM,LACT,try 
SS,TM,7lm_3,SS,TM 

zf-C3HC4,MATH,zf-TRAF,SS, 

SS,zf-C3HC4,zf-RanBP,pkin 

UQ_con,SS,TM 

SS,TM,ig,DSPc 

SS,KH-<lomain,WD40,Rjbosora 

Ss!pDZ,DEpDIX,Dishevelle 

SS,TM,T-box,GTP_CDC,LRRCT 

zf-C2H2,LIM,TM 

zf-C2H2,KRAB,SS,zf-C2H2,K 

GRAM,SS 

PH,GED,dynamin.dynamin_2, 

SS,TM,sugarJr,PID 
lntegrirLB,EGF,SS,TM 
SS,TM,ig,pkinase 
SS,TM 

ThiF,UBACT,SS,pkinase,UCH 
leclin_c,SS,TM 
SS,HLH 



pk!nase,SS,hexokln3se,hex 



wo 02/102235 



PCT/US02/19297 



414168 


AW793296 


Hs.103845 


ESTs, Moderately similar to IS 




3.9 


451982 


F13036 


Hs^7373 


Homo sapiens mRNA; cONA DKFZpS 
cleavage and polyadenylatlon s 




3.9 


418181 


U37C12 


Hs.83727 


CPSF_A,SS,TM 


19 


402793 
418681 


AA287786 


Hs.23449 


Target Exon 

insulin leceptor tyrosine kina 


SS,TM,cyclin,cydln_C 
SS,SH3 


3 


9 


412621 


L40397 
AW975530 


Hs.74137 
Hs.28355 


transmembrane trafficking prot 
liypothetlcal protein FLJ22402 


EMP24_GP25L,SS,TM 


3 


9 
9 


438483 


AW955735 


Hs.321635 


ESTs, Weakly similar to A46302 


Ss!tm,IP trans 






431472 




Hs.256549 


nucleotide binding protein 2 ( 


ler4 NifH,ParA>^PS_kinase 


; 


9 




ALC8C092 


Hs.19610 


DKFZP564N1362 protein 


SS,TIM,SS,TI« 


3 


8 


436686 


AW450205 


Hs.305890 


BCL2-like 1 


TM,Bcl-2,BH4 


3 


8 








hypothetical protein FU22341 




3.8 


441196 


BE3978C2 


Hs'.7744 


NM_007103*:Homosapiens NADH d 


Complex1_61K,SNF2_N,hellc 






433030 


AW068857 


Hs.279929 


gp25L2 protein 


SS,TM,EMP24 GP25L,SS,TM,G 


3.8 




BE515274 


Hs.47062 


polymerase (RIW) 11 (DIMdirec 


RI>1A-P01^M 15KD,SS,COX7a 


I 




435049 


AL122067 


Hs.4746 


hypothetcal protein FU21324 


8S,pfkB 




i 




AI133461 


Hs.251664 


insuMike growth factor 2 ( 


8S,lnsulin,lnsulin 


3.8 


450835 


BE262773 


Hs.25584 


hypothetcal protein FU10767 


Arf3ap,SS,vwa,TSPN,fh3,Co 






414134 


X60188 


Hs.861 


mitogen-activated protein kina 


pkinase,SS,pkinase,T-box 


3.8 


418090 


U57059 


Hs.83429 


tumor necrosis factor Qigand) 


TNF,SS 




8 


448832 






ECSIT 






8 


447256 


AW593008 


Hs!6126 


hypothetnal protein dJ1141E15 
protein tyrosine kinase 2 lieta 


ss!tm,ss,tm 


3.8 


448107 


D45853 


Hs.20313 


FocalJVT,pkinase,SS,Pepe 






426433 


138969 


HS.16987S 


' thiomlxspondin 3 


TSPN,tsp_3,SS,TtW,SEA,TSPN 


3.8 


431626 


ALC35681 


Hs.265327 


-iiypothefcal protein DKFZp761l 




3.8 


430956 


AI183S29 


Hs.2706 


giutathkine peroxidase 4 (plios 


GSHPx,SS,TM,ABCJran 


3.8 


450998 


BE387614 


Hs.25797 


splicingfectar3b,suliiinit4, 


8S,TM,sugar_tr,histane 


3.8 


434899 


BE613631 


HS.283S6S 


FOS-iikeantigen-1 
7-dehydrocholesterol reductase 


bZIP,SS,bZIP,collllnJ«3F, 


3.B 


444734 


NM 001360 


Hs.11806 


E(«34_ERG24,SS,TM 


3.8 


411090 


BE1656S0 


Hs.339697 


VPS28 protein 


SS,TM,CPSF_A 


3.8 




AI492175 


Hs.21446 


KIAA1716prolein 


X SS,DIX,PDZ,DEP,DisheKelle 




8 


421339 


AA070224 


HS.103S61 


SRp25 nuclear protein 




3.7 


406535 






Target Exon 


SS,Tr/l,Ribosomal S19e,ig,l 




7 


447281 


AA017018 


Hs.18021 


hypothetical protein FU20446 


SS,SS,Tektiii,Piwi,PAZ 


3.7 


433126 


AB021262 


Hs.99816 


tKta-calenin-lnteracting prote 


SS.TM 


3.7 


425215 


AF030291 


Hs.155165 


zinc finger prolein-like 1 


PHD,SS,TM,DnaJ,ERG4_ERG24 


3.7 


420536 


ALII 7456 


Hs.275438 


his tone deaoetylase7A 


HisL<ieacetyl,SS,HisLdea 




7 


417998 


AW967420 




gb:EST379495 MAGE resequences, 


SS,TM 




7 


430890 


X54232 




glyplcan 1 


Glypican,SS 




7 


427863 


AF189712 


Hs.181002 


MLL septin-like fusion 


• SS,GTP_CDC,SS,TM 


3.7 




BE613362 






SS,TM 


3.7 




AB032993 


Hs.109929 


likely homolog of rat Gf^P-ass 


TM.ietetra,lon_trans,HUH, 




7 


410293 


AK000047 


Hs.51960 


hypothetical protein 


K.tetra,SS 


3.7 








El B-SSkOa-assodated protein 5 


SPRY,SAP,SS,TI^,SPRY,SAP,p 


3.7 


423683 


BE388699 


Hs.4188 


hypothetksl protein MGC10812 


SS,Peptldase.C15,TGF-beta 




7 


415697 


AI365603 


Hs.78605 


DKFZPS66I1024 protein 


SS,cpn60 TCP1 


3.7 


415825 


Y18024 


Hs.78877 


inositol 1,4,54risphosphate 3 


ss 




7 


418062 


AA350659 


Hs.83347 


angio-assodated, migratory ce 


W[)40,BacterialJ'QQ,TH/l,UPF 


3.7 


444706 


AK000398 


Hs.11747 


hypothetical protein FU20391 


SS,88,TM 


3.7 


407381 


AA420659 


Hs.183110 


ESTs, Vtteakly simliar to ALUCJH 


SS,TM 


3 


7 


423432 


BE2S2996 


Hs.44067 


ESTs 


m 


3.7 


444982 


AK002182 


Hs.12211 


GIF-tucose transporter 1 


SS.TM.DUF6,SS,TM 


3.7 


407777 


AA161071 


Hs.71465 




SS,TM,Monooxygenase 


3.7 


422715 


AA332178 


Hs.119403 


hexosaminidase A (alpha polype 
sialidase 1 (lysosomal slalida 


ayccJiydro_20,GlycoJiydr 


3.7 


422609 


Z46023 


Hs.1 18721 


SS,TM,BNR,SS,TM,SET,HSP70 


3.7 


414732 


AW410976 


Hs.77152 




IWCM,RIP,SS,zf-C2H2,KRAB 


3.7 


452579 


AA131667 


Hs.23830 


ESTs 


SS,CIOydrolase 


3.7 


419032 


W81330 


Hs.58643 


ESTs, H-Qhly similar to JAK3B 


SS,pkInase,SH2,lnsulIn,pk 






411166 


NM_000169 


Hs.69089 


galactosidase, alpha 


IMelliiase^Rb)samaLl-44,z 






444000 


AIC95034 


Hs.135528 


ESTs 


SS,HLH 


; 




441174 


BE312775 


Hs.294005 


Homo sapiens, done IMAGE:3050 


SS,TM 






429491 


NM C12111 


Hs.204041 


chromosome 14 open reading fi'a 


SS 






433433 


AB018274 


Hs.6214 


KIAAC731 pratein 








425162 


BE514851 






CanLacyltransf,CholineJ( 


; 




429571 




Hs'.211594 


proteasorae (prosome, maoropain 


AAA.NBAi«;,TIW 


3 




421018 


AI569023 


Hs.129888 


hypothetical protein FU14768 


zf-C2H2,SS,rm,ENTH 






433604 


NM_013442 


Hs.3439 


stomatln-like2 


Band_7,SS,TM,AAA,cdc48_N, 


; 




451544 


AK000429 


Hs.26570 


hypothetical protein FU20422 


SS,TM,C0X3,SS,TM,SRF-TF 


3 


6 


444369 


AV649296 


Hs.282793 


ESTs 








405560 


X65371 


Hs.172550 


polypyrimldine tract binding p 


rnn,beta-lactamase,SS,try 


3.6 


456503 


AW977779 


Hs.194613 


ESTs 


SS,TM,bromodomain,abhydro 


3.6 




AKa00461 


Hs.26890 


cat eye syndrome chromosome re 


SS,SS,TM,A_deaminase 


3.6 


425394 


AA356730 


HSJ23949 


kangai 1 (suppression of tumor 


SS,TM,lransmembrane4 






428011 


BE387S14 


Hs.181418 


KIAA0152 gene product 


Acyl-CoA_dh,SS,efhand 
SS 


3.6 


407627 


AI419a20 


Hs.62620 


chiamosome 6 open reading fram 


3.6 


436437 


F1220a 


Hs.5811 


chromoscme 21 open reading fra 
tuberous sclerosis 2 


SS,Syia_N,Exo_endo_phos 






419418 


X75621 


Hs.90303 


Rap GAP.Tuberin.Peptidase 






440300 


N39750 


Hs.8859 


Homo sapiens. Similar to I^KEIM 


SS 






448135 


AA036680 


Hs.20447 


protein kinase related to S.c 


pkinase,PBD 






435977 


AL138079 


Hs.5012 


brain«pecifk; membrane-anchor 


SS,TH<,SS,TM,ubiquitin,Rib 






419095 


AA234009 


Hs.188715 


ESTs 


pklnase,PH,pkinase_C 






447267 


AL360143 


Hs.17936 


DKFZP434H132 protein 




3.6 


418054 


NM.002318 


HS.833S4 


iysyloxklas&4ike2 


SRCR.LysyLoxidase,SS,TM, 


U 


444354 


AA847582 


Hs.10927 


hypothetical prolan 103729.1 


3.6 



242 
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413097 
427579 
409154 



449030 
426542 
439873 
428960 
421564 
441094 
460007 
422898 
444914 
420178 
418984 
414166 



430053 
458687 
424679 



422051 
417230 



432647 
421310 
420999 
409561 
419727 
421267 
411501 
448741 
407103 
422808 



BE616971 
AW005054 
NIVL00S632 



NM_003654 

AI80S664 

BE245274 

AA333387 
M604520 
AF052155 
AW024315 



AI251317 
BE076537 
BE258946 
AW327546 



451714 
410633 
410609 
414775 
428495 



Hs.75196 
Hs.179669 
HS.S0842 
Hs.172084 
Hs.11170 
Hs.53066 
Hs.10iai 
Hs.279881 
Hs.1 18552 

Hs.5898 

Hs.57100 

Hs.170310 

Hs.300638 

Hs.194673 

Hs.105350 

Hs.7647 

Hs.24301 

Hs.1 27401 

Hs.12142 

Hs.95659 

Hs.75789 
Hs.57687 
Hs.104576 
Hs.31731 
Hs.180428 

Hs.82916 



Hs.173611 
Hs.1 11024 
Hs.224571 
HS.B1728 



F11327 

AI360119.comp 

BE410228 

AW250533 

AA622968 

AK001459 

AU076725 

AW134756 

AW750535 

AI807481 

AW630087 

AA338903 

U58048 

AW160796 

BE314724 

AB002368 

AA424881 

AA449014 

N95657 

H61851 

M16714 

AI867781 

AW977263 

AW327785 

AW95g486 

AW500508 

BE516501 

AK001301 

AA912347 

AW960504 

AA293279 

AK000344 

BE546789 

BE29B441 

AA992036 

NM_013279 

NM_005341 



Hs.1 57406 
Hs.181013 
Hs.75410 
Hs.75139 
Hs.71574 
Hs.90375 
Hs.101408 
Hs.192477 

Hs^581 
Hs.103315 
Hs.100915 
Hs.183138 
Hs.92700 
Hs.103081 
Hs.70500 
Hs.19574 
Hs.256301 
Hs.121025 



Hs.173421 
Hs.21732 
Hs.75102 
Hs.32343 



Hs.346742 
H3.287361 
Hs.172702 
Hs.184640 
Hs.2364 



enoyi Coenzyme A hydratase 1, 
Homo sapiens, Similar to DMA s 
ESTs, Weakly similac to KCC1_H 
small opiic lobes (Drosophila) 
ankyrin repeat-containing prot 
hypothetical protein FU20637 
Inlerferon-lnduced protein 35 
Homo sapiens, clone IMAGE:3627 
RNAIiinding motif protein 14 



hypothetical protein FU12875 
mannosidase, alpha, class IB, 
hypothetical protein FU20539 



phosphoprotein enriched in ast 
KIAA0404 protein 
MYC-assoclated zinc linger pro 
polymerase (RNA) II (DMA direc 
DKFZP434A163 protein; selecHv 
WD repeat domain 13 
.Jelhal giant larvae (DrosopWI 
ritxisoroai protein L18 
N-myc downstream regulated 
four and a half UM domains 3 
carbohydrate (keralan sulfate 



KIAA1181 pn 
Target Exon 

chaperonin containing TCP1, su 
ESTs, Moderately similar to AL 
SEC13 (S. cerevlsiaej-llke 1 
aUT4 enhancer factor 
DKFZP727G051 prolein 
hypothetical protein similar t 
ESTs 

ubiquitin^njugatlng enzyme E 
Target CAT 

solute carrier family 25 (mlto 

ESTs 

unci 19 (C.elegans) homolog 
hypothefcal protein FU11137 
tetracycline transporter-like 
gb:HSC2VD101 normalized infant 



heat shock 70kD protein 5 (glu 
partner of RACKaifaptin 2) 
hypothetical protein FU14926 
hypotheticsl protein FU1CS97 



Homo sapiens cDNA:FU23331fi 
fibroblast growth factor recep 
trinucleotide repeat containin 



procollagen (type III) N-en 
DKFZP5640243 protein 
ribcscmal protein S6 kinase, 7 
KIAA0370 pcctein 
hypothetical protein MGC5469 
hypothetical protein MGC1317C 
chromosome 1 1 open reading fra 
ESTs, Moderately similar to YO 
gb:yr80e10.r1Soai^ fetal liv 

HT014 

general Iransciiptlon factor I 
KIAA1564 protein 
ESTs 

alanyt-IRNA synthetase 
Home sapiens, Similar to RIKEN 
hypothelical prolein FLJ10439 
ESTs, Weakly similar to JC5314 
FE65-UKE2 

hypothelical protein FU20S1S 
hypothelical protein FU20337 
hypothetical prolein MGC3260 
ADP-ribosylation factor relate 
ESIs, Weakly similar to (defli 
hypothatksal protein MGC1 0781 
QLI-Knippel MIy member HKR3 

243 



ank,SET,SS,TVI,pkinase,SH2 
HECT.SS.HECT 
SS,ras,RibosomaLL27e,KOW 
SS,PID,SH2 

[mi,SS,spectnn,PH,nmi,so 



TM,UI21,LipoproteinJ,GBP 
Mn 

se,SS,TM,Hydrola 



WD40,SS.TBCrrm 

WD40,SS,TM 

SS,TM 

l«lr,abhydrolase,SS 

UM,SS 

SS 

AhpC-TSA,SS,hormonejec,z 

TM,SS,TM,KOW 

SS,tubulin,SS 

opn60.TCP1,SS,cpn60.TCP1 
SS,UCH-2,UCH-1 
WD40,SS,TM,E1-E2_ATPase,C 
SS 

SS,TM,GDA1 CD39,GDA1_CD39 

SS,TM,GYF,actin,PA 

UC)jcan,SS,TM,AmiadlDo.^e 

coiiiplex1_49Kd,SS,TM,ITAM, 

SS;mito_carr,SS,mito_carr 

SS 

SS,glycalyti(Lenzy 

SS,zf-C2H2,SCAN,TRIS,SS 

SS,TM,SS,TM 

SS 

PGAM,SS,TM,ldh 

SS,HSP70,homeobox,Hydanto 

SS.hemapexin,Filamin,NHU 



PEP-utilizers,PEP-uUlize 

amlnolran-4,TM 

SS,Exonuclease,zf-C2H2 



Herpes_env,SS,TM,Peptidas 

pklnase,pkinase_C,SS 

SS,TM,SS,TM 



SS,TM,tiomeobox,LIM 

MHCJ,ig,SS,TM 

SS,abtiydrolase_2 

SS.PeptIdase_M24 
SS,zf-C3HC4,SPRY 
DHHA1,SS,tRNA-syntJc,DHH 
SS 

WD40,SS,TM,Synta)iin,Synta 
SS 

SS,TM 

DSPc,SSJm]CF-box 

8S,TBC,FHA^f-C3HC4 

SS,TM 
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446618 
444868 
430041 

416950 
431203 
432714 
415674 
426162 
418440 



453856 
430137 
446427 
400747 
445580 
452568 



429460 
416448 
433038 
440261 
412922 
432941 
441244 
438175 
423024 
430120 
419571 
413019 



422242 
430480 
452438 



409197 
417896 
418026 
409057 



425080 
445363 
421943 



413891 
449455 
419193 
406701 



443402 
432416 
429662 
429160 
427729 
418151 
443250 
431158 
414292 



AL110307 
BE660471 
AW247237 
AL049798 
AW248421 



AW796016 

M60721 

W04803 

BE612935 

AI376727 

AW67529e 

AW674962 

BE281604 

X07730 

BE263901 

AL039734 

AJ251760 

AL079399 

BE514230 

AA431633 

AL049709 

XI 2458 



BE379727 

AA702305 

W91976 

AW967107 

AI393498 

NIIL005993 



T60748 
D29643 
AA780613 

U52427 

AF279142 

AL137255 

AI199575 

U24578 

T08566 

U77846 

BE410937 

AI929701 

AB033100 

AA854238.oonip 

NM_016034 

AW859138 

BE3B8407 



Hs.15591 

Hs.12101 

Hs.227835 

Hs.80552 

Hs.250758 

Hs.278675 

Hs.73596 

Hs.1 67246 

Hs.85119 

Hs.64310 

Hs.54467 

Hs.9006 

Hs.19447 



Hs.86131 

Hs.203238 
Hs.79339 
Hs.279939 
Hs.332012 
Hs.74870 
Hs.279851 
Hs.184052 
Hs.122110 



Hs.273385 
Hs!29595 



Hs.303025 
HS.B2390 
Hs.83213 
Hs.130060 



Hs.12570 
Hs.343912 
Hs.9235 
Hs.89434 
Hs.72241 

Hs.278408 
Hs.34789 
Hs.62954 
Hs.91681 
Hs.14839 



427584 
419069 
431717 
448381 
419394 
436240 
413900 
417920 
421819 



AW409747 

S47833 

NM_0134O3 



Hs.6945 
HS.21C36 
H5.gC232 
HS.5C86 
Hs.75612 
Hs.82g27 
HS.10866S 



PDZ-LIM protein myslque 



ESTs 
Target Exon 

skbl (S. pombe) homolog 
Homo sapiens mRNA; cDNA DKFZpS 
Fas (TNFRSFei-associaled via d 
Target Exon 

phosphodiesterase IB, calmodul 
;in, gaiactoside-binding, s 



Homo sapiens, clone IMAGE:3687 
.U2.0(Drosophila)-likehameob 
hypothetical protein FU10241 
PP1201 protein 
ESTs 

ESTs, ivloderately similar to AL 

tiypothetioal protein FU11350 
protein kinase D2 
low density lipo protein-relate 
kallikrein 3, (prostate specif 
ESTs, Weakly similar to S37431 
nardilysin (N-arginine diliasic 
guanine nucleotide binding pro 
DKFZP586F1524 protein 
JM4 protein 

NM_002488*:Homo sapiens NADH d 
Human DNA sequence from clone 
Protein P3 

chromosome 1 1 open reading fra 
defender against cell death 1 
fatty acid binding protein 4, 



hypothetical protein MGC4365 
inositol 1,4,5-triphosphate re 
tubulin-spedfic chaperons d 
Homo sapiens. Similar to RIKEN 



femtin,heary polypeptide! 
ESTs, We^y similar to DCHUO 
polymerase (RNA) II (DNA direc 



tiypotheical protein FtJ22693 



ESTs 



Tax interaction protein 1 
elastin (supravalvular aortic 
emerin (Emery-Dreifiiss muscula 
phosphoinositide-3-kinase, reg 
smoothened (Drosophila) homolo 
KIAA1274 protein (similar torn 
acl'n related protein 2/3 comp 
mitochondrial ritxKomal protei 
Homo sapiens cDNA: FU22288 fi 



Target Exon 
hypothetical protein FU22427 
v-myb avian myeloblastosis vir 
ESTs, Weakly similar to CA13_H 
mitochondrial ribosomal protei 
Homo sapiens mRNA; cDNA IXFZp4 
KIAA0552 gene product 
hypothetical prolan MGC1 0433 



li*w34,SS7f-C2H2.SCAN 

ss,pa 

SaTM,7tmJ,tubulin 
SS 

AAA,Sigma54_activat,SS,TP 

bPomodomain,SS,TM,SNF2_N, 

SS,proteasome,SS,TM,Cadhe 

flavodoxin,FAD_binding,SS 

ubiquitin,SS,UQ_con 

ig,fn3,SS,TfH,GalP_UDP tra 



TM,MSP domain 
UM,SS,SH3,Sorb 
SS,SH3,PID.SS,PID 



death,DED,SS,TM 
SS 

PDEase,SS,PDEase 

SRCR,SS,TM 

TM,mito carr.TM 

SS,TM,SS,TM,IRK 

SS,homeobax,SS 

SS,RNase_PH.RNase_PK.C 

8S,TM,WD4C 

SS,TM,trypan,kringle,fn2 

SS,TM,CD35,CD36 

pkinase,DAG_PE-bind,PH,DC 

lrypsln,SS,trypsin,trypsi 
SS.TM 

Peptidase_M16,HC03_cotran 

G-alpha.arf,SS,G- alpha 

SS,TM,hemopexin,Somatomed 

SS,TM,KOW,HLH 

SS,tRNA-synL2b,WHEP-TRS, 

TM 



DAD,SS,TM 

lipocaliaSS.IIpocalln 

SS,TGFb_propeptide,TGF-be 

SS,TM,SH3,2f-C3HC4 

SS,TM 

SS,CTF_NFI 

ATP-6ynLB,HEAT_PBS,SS,TM 
SaTM.S8,TM 
NDK,SS,adhj5hort,NDK 
colilinJU)F,88.colilirbAD 



SS,TM,DIX)ST_48kD,VP7,SS,T 
S8,TIM,UDPGT 
SS,tRNA^ynL1b,tRNAubind 
S1,SS 

S8,TM,EGF,leotin.c,sushi, 

SS,hormone_rec,zf-CCCH 

SS,TM,OKysterol_BP 

SS,A2M,NTR,A2M_N,prenyltr 

PDZ,SS,TM,P2X_receptor,FG 

SS,PDZ,UM,pkinase 

LEM.SS.Ri bosom aLL10e,Acy 

SH2,SH3,RhoGAP,SS,GILT,SH 

SS,TM,Fz,Frizzled,7tm_2,S 

RhoGEF,REV,PH,SS,TM,Ribos 

Ribosomal_S2,SS,lipooa!in 

SS,ExQnuclease 

UCLcon,SS,TM,SAHU'NT 

88,TM,7tmJ,88,TM,7tm_2,G 

88,TM.Surp,ublqutlin.TBC 

NA.SS 



SS.TM,Ets,C0X6B,transmemb 

TPR,SS,TM,I>iaJ 

A_deaminase,SS,G-alpha,GS 



wo 02/102235 



PCT/US02/19297 



408917 AW249025 

443099 A1372836 

427022 AW245e39 

452711 AW967047 

407236 W79485 

452537 AW247390 

452139 AA099969 

447529 AF034790 
401097 

452736 coil 64 

435507 AI143679 

424934 U75370 

413245 BE244334 



431677 AK000496 



401772 

453754 AW972580 

423865 H05202 

450962 BE635647 

441954 AI744935 

412787 D87452 

422034 AC006486 

450788 AI738410 

452511 BE4C8178 

414380 BE391815 

407597 AA043925 

434955 BE276128 

435632 AF220049 

432465 056165 

43C526 AFiai862 

453412 AJ003290 



447322 BE617649 

422268 N25485 

419578 AF064853 

446929 AA076132 



Hs.169474 

Hs.7768 

Hs.9003 

Hs.173255 

Hs.293224 

Hs.173980 

Hs.77735 

Hs.16331 



Hs.26510 
Hs.163880 
Hs.75249 



Hs.52788 

Hs.74617 

Hs.1342 

Hs.275924 

Hs.97871 

HS.18S87 
Hs.100071 
HS.1264S1 
Hs.21432 



Hs.25723 
Hs.8047 
Hs.74579 



Hs.285165 
Hs.75981 
Hs.339352 



H3.31854 
Hs.77690 
Hs.330310 
Hs.91299 
Hs.9460 



DKEZP586J0119 protein 



hypolheliral protein FU13868 



ESTs, Weakly similar to T0O375 

nuclear malrix pralein NMP200 
hypothelical prolan FU 1 1 6 1 8 
Homo sapiens cDNA:FU21482fi 
translocase of inner mitochond 
CI 2000858':gi|7363437|reflNP_0 
Homo sapiens PAC clone RP1-130 
vacuolar protein sorting 338 ( 
polymerase (RNA) mitochondria 
ADP-iibosylation factor-like 6 



BCS1 (yeast homdogHike 
hypothelical protein FU20489 
hypothelical protein FU14596 
fragile X mental retardation, 
dynactinl (p150, Glued (Droso 
oytoclirome c oxidase subunit V 
dystrophia myotonka^onlalnin 
itomo sapiens, clone IMAfiE:3B45 

' tubulin, lieta 5 

.J<IAA1588 protein 
6^)hosphogluconolaclonase 
ESTs, Weakly similar to A46302 
SEX gene 

NHL01452aHomo sapiens MYB bin 

ESTs 

FGF receptor actiuaBng protei 



Homo sapiens cDNA FU2084S lis 
ubiquitln specilic protease 1 4 
Homo sapiens brofiier of COO (B 



uncharaoterized hematopoietk; 
non-melastaticcells2,prol9 
G protein<»upled receptor, fa 



ne 14 open reading fra 
KS62 cell-derived ieuclna-zlpp 
RAB5B, member RAS oncogene fm 



im,SS,rnn,SH3,ras,20G-Fe 



Tim17,SS,TM,pk!nase,0TU 

SS,TM,7tmJ,SS 

SS,SS,TM,TBC,Surp,ubiquit 

PPR,SS,TM,cNUP_binding,RN 
SS,TM,kazal,RibosomaLS8, 
SS,SS,TM,B56 
AAA,SS,PI-PLC-X,PH,PI-PLC 

SS,TM,PTPA 

KH-doraain,SS,TW,HMG_box 
CAP_GLY,SS 



GlucosamlneJsaSS 
SS 

SS.TM,Sema.TIG,PSl,GDI 

SS,TM,tas,RlbosoniaLS19,T 

SSJM 

SS,TI« 

TPR,SS,TM,AAA,cdc48_N,Ban 
zf-CCCH,SSTM,NTP_lransfe 
Ets,SS,pkinase,PAF-AHJb 
SS,TM 

SS,thiored,P5CR 

UCH-2,UCH-1,ublquitln,SS 

SS,TM,SS,TM 

SS.UCLcon 
NDK,SS,NDK 
7tm_3,homeobox,SS,TM 



Gia,SS,W(]4a 

WD40.SS,EPO_TPO 

SS,TIUI,WD40 



TABLE21B: 

Pkey: Unique Eos probeset identifier number 
CAT number Gene cluster number 
Accession: Genbank accession numbers 



411674 
413052 



419069 
419250 



181650L3 
183289J 



BE614290 AA307674 N35629 AA338S38 AI193603 AA7810g6 AI6B0061 AI613258 AW276647 BE221263 AI348910 AI985C31 AI0g0078 AI359617 
AA666391 AI160210 A1446461 AI355345AI343638AI343640AI275091 M78746AW262795 AW250002 AA503755AI934519 AW272086 N26520 

AA626639 

AW974648 AA652153 AA649671 AA078582 

AW8611 23 AW861 125 AW856717 AW861 1 1 6 AW856706 AW856788 AW856774 AW856787 AW856780 AW856782 AW856789 AW856772 
AW856784 AW856786 AW856776 AW856635 AW856767 

BE249841 BE062657 BE062771 BE06263S BE062813 BE0S2699 BE062895 BE062747 BB)62719 BE293541 

AW163526 AW163255 AW1 63385 AI929359 BE279279AA1 32590 AW157329 AA584408 AW157252 AI692198 AW003514 T24436 AI765658 
AW1 57459 AI810740 AI659582 AI969924 AI929284 AI340993 AI349083 AW299522 AW66465C AW299513 AA132529 AI340991 AI912836 

AI341293AI650609AA279 

BE271 020 AI763358 AI926430 AI806151 AW003726 T15590 AA649945 AW12991 1 AI570748 T57492 AA828002 AW237602 AW003639 AI13904S 
AI950958 BE042625 AW778973 AI287a59 AI9B3931 AW515101 AW1 50029 AI358496AI621173AA846016 AI470921 AW169748 AI991000 

AW513748AI04058 

BE243628 BE246081 BE247016 BE241984 BE241534 BE246091 BE245679 BE243620 BE24S998 BE242329 BE241417 BE241457 BE242522 

BE24ig89BE241464 

i«1851H74099 T67099 

AW967420 AA210915AA236991 AA210916 

AA421401 T49326 AA330666 AA328941 W63573 AA75a023 AA976306 H52254 AA877107 BE207784 AW664584 AI924890 AA458586 AI422142 
AI891097 AI811174 R69866 T49327 AA233722 AA631 138 AA910314 AI379416 AI129321 AA861574 AA635649 AI339443 AW009533 AA677036 
AA948287AA62 

AA233801 BE383487 AA913939 AI632681 AI813277 AI373652 AW134802 AI863574 AW305364 AI858557 AI670746 AI01 5036 AI9353a4 AI935317 
AW138668 AW204971 AI765223AA884146 AA973341 AA234062 

AWT70185 AW296271 H11254 AW403510 AI032786 AA767046 AI376115 AI582209AA460965 AI868663 AI016900 R05715 Al 127382 AI660953 
AI023644 H00465 AW959578 AA815039 AW292253 R05714 AAB15462 AA235654 AA461274 W24933 AA300091 H0051 5 



245 



wo 02/102235 
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423696 23112.1 Z92546 AA330586 AI570568 AW341487 AI827050 AW298668 AI7921B9 AI015693 AI733599 AI572251 AI672'188 AW193262 AI244716 AI864375 

AI206100 AA912444 AI269365 AI6402S4 AW772466 AIBS7336 AA627604 H16914 AA358477 AA338009 
426080 246559 1 AI393498 R42314 AI088818 AI696468 AI418641 AA573152 F08817 AI910796 AW338984 R39024 AA729145 BE245966 AI0g3722 AA541730 

F09835 At242755 AA350447 AA865667 T93903 AW081029 AA493711 AA650030 N35995 N21491 T57002 Z25379 AI906851 
427239 27647 1 BE270447 AW409921 BE207288 BE207170 D56355 BE263223 BE408171 BE262243 BE392439 BE292738 eE261776 BE314300 BE267719 

BE268715 BE513876 BE295291 BE297065 AA21(I923 BE407519 H51344 BE622905 AW248281 AW250313 T19021 AA355115 AA316879 

BE259533 BE621936 AA290724 

427391 27815J W60675AK001212AA165752AA878366AA090872AB033013AW249107AA031890AA112820AW366388N66166AA326766AW962294 
AA130320 C03570 C04358 W60676 AW248674 AA034989 AA044781 AA074274 H2S212 AI800572 AI127583 AI951785 AA356557 AI571746 
H23835AI589543AI215670 

AA421182 AI734104 AI733923 AA430600 
AI824164 AI676005 AW12961 2 AI825903 AA773987 AI823645 AI823860 AA456229 A1824295 AA454622 AI264049 AI090237 AI669787 AI804012 
AI306153 W96164 AI298273 AW884073 AW883986 
430168 313927_1 AW968343 AA468507 AI478223 AW513008 AI762122 AI554512 AA862642 AA468976 
433519 3688Q1 2 BE263901 AA5960B6Alig0276AI094806AI8312SO AIS72668AW204652AIS60600AI922941 R49621 

438707 46360 1 L08239 BE618914 AW385394AW385398 AW385401 AI922683 AA907337AA160504AA928142AA6019S9AA010594BE618528AA160SS1 
440191 48804J AI990417 AI304400 AI193071 AI742483 AW003408 AW131566 AI400201 AI656740 AI309186 AW665173 AW204722 AI215122 AI200785 

BE467373 AI147599 A1215120 AI076110 AI803429 AI262491 AI808243 AI281007 AW135212 AW205103 AI754349 AI004801 AI051273 AW768918 



448606 77159.1 



AA457456 AA907921 AI567715 AA579472 T642) 6 AA373128 F35533 AA7221 13 T64403 AA653738 F28806 AA5956B9 AA047537 AA022499 
AW440632 F36782 AI554160 AI183767 A1805052 AA160379 AA481678 AI186031 AI148988 AI174482 AA868833 AI674395 AA481440 AI914985 
AI698771AA44 

BE246743 AA436942 AVV024744 AW242177 AA975476 AW385185 R07536 R73462 AV654529 T57442 AI399986 R50073 R48743 AI7e9689 
AI863006 AA317806 AI678000 AW189963 Aig86207 AW471273 R73463 AI335104 AI590161 AI469257 AI954604 H21954 T25141 AA856793 
R50074AI70B253AI2 

BE613362AA447862 H72036AA393664AI681334AW139128AA932S79AI302241AI936800AVV960628AI492148(»6192AA336107AA80800^ 
AW615212 BE297403 BE298S78 AI187207AA9286S5 AI620631 AA938128 AI346527AI040261 AA808401 AW130326AI440313 AA868693 



448677 
450788 
452160 



467022 



775217.1 
846840.1 
9Q1991_1 



274445.1 



AI56b769A1857497AW151454 
AI738410AW016906AI971725 
BE378541 AI863051 
AJ003290 AJ003238 AW276947 
T70192 BE147696 
BE170313BE158339BE158290 

AW37726B BE067458 BE067611 BE067515 BE067467 BE067614 AA397442 



TABLE 21 C: 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in tills column are Genbank Identifier (Gl) nu 

human chromosome 22" Dunham, et al. (1 999) Nature 402:489495 
Strand: Indicates DNA strand from which exons were predicted 
NLposition: Indicates nude " ' "' - - ■ ■ 



s. 'Dunham L et al.' refers to the pubncallon entitled The DNA sequence of 



400460 
400663 
400747 
400846 
401097 
401128 
401655 
401727 
401751 
401772 
402365 



405325 
405356 
405496 
406101 
406307 



of predicted ex( 
Strand NLposition 



9828651 Plus 



6136940 Minus 

7406502 Minus 

7670577 Minus 

8440025 Minus 

9367203 Plus 



9124019 Plus 



81941-82434 

71249-71441 

39310^474 

60356-61096 

37349J7885 

79556«)132 

5434i«482 

139165-139322 

183917-1B4042 

70928-71185 

3818-8952 

11824-12090,14290-14544 
69012-69165 
361474,541-637 
114150-114272 



145931-147795 



36116-36276 

147706-148052 

125325-125831 



TaUe 22A lists about 81 1 genes up-regulated In ovarian cancer compared to normal adult tissues that are likely to encode extracellular or cell-surface proteins. These were 
seleoted as for Table 20A, except thalthe ratio ot "ararage" ovarian cancer to 'average" normal adult tissues was greater than or equal to 4jO, the 'average" ovarian cancer level 
was set to the 96th percentile value amongst vartous ovarian cancer specinens, he 'average' nonmal adult fissue level was set to the 76th percentile value amongst various non- 
malignant tissues, the 'average' ovarian cancer value was greater than or equal to 80 units, and the predfcled protein contained a shicturai domain that is Indicative of enzymatic 
hinctkm or of transducing an Intracellular signal, or of being modulatable by small molecules (e.g., pUnase, peptUase, phosphatase, or ian_lransporter). Predicted protein 



TAB1£22A: 

Pkey: Unique Eos probeset idenlHier number 

ExAccn: Exemplar Accession number, Genbank accession number 



wo 02/102235 
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UnigenelD: Unigaie number 
UnlgeneTiHe: Unigene gene Ode 
PiDtein Dom.: Predicted protein domain 
R1: Ratiooftumorto normal Ixjdytissue 




Hs.118890 

HS.68Q61 
Hs.1 73724 
Hs.15977 
NM_002419 Hs.89 

Hs.89631 
BE395875 Hs.279609 
BE276760 Hs.30928 
■ 173125 



017629 



S.1 59479 



447595 
422708 
414837 
429712 



421612 
421363 
442739 
420568 
421445 



429183 
444664 
427640 
425123 
416006 
412942 



442520 
458130 
449936 



426108 
418133 
418736 



NrA.002496 Hs.90443 

U40343 H3.29656 

AW379130 Hs,18953 

AB017430 Hs.1 19324 

U24266 Hs.77448 

AW245825 Hs,211914 

BE2427C9 Hs.1 59637 

M64437 Hs.234799 

AK002194 Hs.19851 

AW894667 Hs.169965 

AF1612S4 HS.1061S6 

NM.001381 Hs.103854 

NIIL007274 Hs.8679 

F09247 Hs.247735 

AA9130Sg Hs.104433 

NII4.0049S4 HS.15719S 

Nlit.013272 Hs.14805 

11481105 HS.14655C 

A1^8637 Hs.196352 

AB014604 Hs.197955 

N26362 Hs.11615 

AF058293 Hs.180015 

AW2C5274 Hs.1 54695 

AA324251 Hs.78950 

AL120344 Hs.75074 

Z80345 Hs.1 27610 

AW161050 Hs.1 69611 

BE258876 Hs!94446 

AF234887 Hs.57652 

AL050341 Hs.37165 
A1287878 

C00138 Hs.3535 

AA11S811 Hs.6838 

AA938293 Hs.60088 

AA852431 Hs.51299 

NHt.015071 Hs.132942 

AK001S02 HS.917S3 

AA853479 Hs.89890 

AA622037 Hs.166468 

AA723157 Hs.73769 

T18979 Hs.87908 

NIUL002212 Hs.5215 

NHL012152 HS.25B583 

M64590 Hs.27 



Unigene Title 

gb:ywC3b12.s1 Soares melanocyt 
CGI-69 protein 

transmerTibrane protease, serine 
pleckslrin homology-like domal 
hypothetical protein FU14996 
hypothetical protein FtJ20258 
ReoQ protein-like 4 
pyridoxal (pyridoxine, vitamin 
spectrin, ijela, non^ryllirocyt 
pim-2 oncogene 
bicarbonate transporter relate 
phosphalidic acid phosptialase 
aracliidonate 5-lipoxygenase 
prostaglandin E synthase 
calpain 4, small subunit (30ig 
FKSOB-binding protein 2 (13kD) 



inosltol.J,3,4-ln'phosphale5/ 
phosphorylase liinase, gamma 2 
novel SH2K»ntalnlng protein 1 
glycogen synthase kinase 3 alp 
sphingosine kinase 1 
creatine kinase, hraln 

mltogen-activated protein klna 
ryanodine receptor 1 (skeletal 
mitochondrial carrier homolog 
DNA segment on chromosome 19 ( 
peptldylptolyl Iso 



galactosamlne (N-acetyl)-6-sul 
Target CAT 

cyclin-dependent kinase Inhibl 
phosphodiesterase 9A 

aldehyde dehydrogenase 4 famil 
ENSP00000233627*:NAt)H-ubiquin 
valyl-tRNA synthetase 2 
breakpoint cluster region 



806 antigen 

dacUng protein 1,62kO (downs 

cylosolic acyl coenzyme A thio 

protocadherin alpha 10 

Homo sapiens, done IMAGE:4054 

ELKL motif kinase 

salute canler family 21 (orga 

myosin, heavy polypeptide 9, n 

neutrophil cylosolic factor 4 

KIAA0704 protein 

map kinase phosphatase-llke pr 



pklnase 
DAGKc 

ATP-gua.Ptrans,ATP-8ua.Pt 

SH3,pkinase,pyrklaxal deC 
RYDR.ITPf?,RyR,SPRY.kjn_tr 

EuKjiorin 
projsomerase 
HC03.colransp 
Sulfatasa 

EGF,ig,tectln_e,sushl,Xli 



GST_C,GST_N,Trapomyosln 
RhoGEF,RhoGAP,PH,C2 

DAG_PE-blnd,RhoGAP 

IdljBcepLa 

PH,IRS 

Acyl-CoAJiydro 



phosphomannomutase 2 
branched chain keto xkl dehyd 
mltogen-activated protein kina 
acyl-Coenzyme A dehydrogenxe, 
second raitochondria<)erlved ac 
cytochrome P450, subfamily XXV 
polyamlne-modulated factor 1 
cadheiln, EGF LAG seven-pass G 
collagen, type IX, alpha 2 
gb:qv23ia6.x1 Na_CGAP_Lym6 Ho 
Homo sapiens mRNA for K1AA1668 
ras homolog gene family, membe 
hypotheOcsl protein MGC11314 
Nli/L021074:Homo sapiens NADH de 
GTPase legulalor associated wl 



PX,SH3,0PR 

PH.OxysleroLBP 

DSPc,Rhodanese 

MIF,late_protein_L2 

PMM 

Eljehydrog 

Acyl-CoA_dh,Acyl-CoAJh_M 
p450 

aldo_ketjed 

7tnu2,EGF,cadherin,lamlnl 
7tm_1 I 



pyruvate carboxylase 



folate receptor 1 (adult) 
Sn12-ielatedCBPaclivatcrpro 
Integrin beta 4 binding protei 



CPSas^l^chaln,PYC_OADA,H 

DUF122 

Folatejsc 

helicase.C,ATJiool< 



glycine dehydrogenase (decarbo 
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45 
50 



444855 
423464 
450787 



414757 
428693 
432370 

428782 
425999 

' 410720 
407143 
421321 
425251 
431354 
420421 
416714 
427336 
409799 
435319 
400748 
428948 
401215 
401281 
427397 
453496 



BE409261 Hs,12084 

NM_016240 HS.1288S6 

AB006190 Hs.25475 

AW410063 Hs.184877 

AF281134 Hs,283741 

AA297356 Hs.80324 

AK000596 Hs.3618 

U76666 Hs.69949 

NM_002962 Hs.296C 

AA1 14050 Hs.19949 

BE246449 Hs.2167 

AA436187 Hs.172631 

NIUL014835 Hs.15519 



U46922 Hs.77252 
AW207440 Hs.1 85973 
AA308334 Hs.274424 



AF035154 Hs.65756 

C14076 Hs.332329 

NM.005309 Hs.103502 

Z22521 Hs.1 55342 

BE046956 Hs.251673 

AF281133 Hs.343589 

AF283770 Hs.79630 

NM.005658 Hs.2134 

D11928 Hs.76845 

H90727 Hs.5123 



431157 
422616 
406779 



BE514362 



AI929685 Hs.177656 

AA442103 Hs.33084 

AF231023 Hs.55173 

NM_001975 Hs.1 46580 

AW401864 Hs.18720 

AF037062 Hs.172914 

R36664 Hs.167257 

BE379936 Hs.28866 

AB023186 Hs.241161 

AK000790 Hs,246886 

BE281240 Hs.9857 
AW295871 

BE383676 Hs.334 

AK001383 Hs.1 16385 

NltiL0CC020 Hs.172670 

AI823g69 Hs.132678 

BE300330 Hs.1 18725 

AA412048 Hs,279574 
AL135841 



4C3213 
412158 
423673 
403949 
457670 
418416 
419594 
422765 



413431 
415200 
414874 
423524 
457558 



BE241740 HS.7S5 



AF119666 Hs.23449 

U11700 HS.B4999 

AA013051 Hs,91417 

AW409701 HS.157B 

AW028733 Hs.31439 

U51333 Hs.159237 

AB037108 HS.641B 

AL031685 HS.127BS 

U28694 Hs.158324 

AW246428 HS.7S3S5 

AL040328 Hs.78202 

D26351 Hs.77515 

AF0SS989 Hs.12g738 

AF083955 Hs.279852 




integrin, alpha M [complement 
oxystetol-binding proteiiMela 
enolase 3, (beta, muscle) 
milotb spindle colleckoll re 
fragile histidlne Mad gene 
degenerative spermatocyte (liom 
N-ace^lneuraminic acid phosph 
C15001413*;gi|10645199|ref|NP_ 
interleukin 6 receptor 
ESTs.Wealtly similar to 138022 
collagen, type Vlll, alpha 1 
regulator of &proteln signall 



DNA (cytosine^l-methylbansf 
exosome component Rrp41 
C079A antigen (immunoglobulin- 
TNF receptor-associated factor 
phosphoserine phosphatase-ike 
inorganto pyrophosphatase 
NM_022122:Homo sapiens matiix 
FK506-binding protein 3 pskD) 
C12000457*:gi|7512178|pir|rr30 
DKFZP686N2124 protein 



solute carrier family 2 (fadi 
cadherin, E6F LAG seven-pass G 
enolase 2, (gamma, neuronal) 
programmed cell death 8 (apopt 
relinol dehydrogenase 5 (11-ci 
NM_005569*;Homo sapiens LIM dc 




ATPase,CuUa 
topoisomerase(DNA) II binding 
bacuiovlral lAP repeatcontah 



Target Bran 
KIAA0939 protein 
chemoWne (C-C moHf) receptor 
ubkiuiUn-conjugatIng enzyme E 
SWI/SNF related, matrix associ 



ICE_p20,DED,iCE plO 
WH1,PBD,WH2 
FG-GAP 
OxystenoLBP 



pkinase.DAGJ'E-bind.pkina 
PWWP,PHD 

RNaseJ'H,RNase_PH_C 

ig,ITAM,Zn_clus 

MATH 

Hydrolase 

Pyrophosphatase 



71ra_2,(;adherin,GPS,lar 

pyr_redox 
adh_short 
pkinase,UM,P[)Z 



Fiho guanine nudeotkie exchang 
NMJ03105':Honio sapiens sortil 
hypothetical protein FLJ10521 
NM_0018g5':Hoino sapiens casein 
adivin A receptor type ll-lik 
ESTs 

selenophosphale synthetase 2 
CGI-39 protein; cell death-reg 
olfactory receptor, family 2, 
Target Exon 

pctassium voltage-gated ctianrie 
C-type lectin-like receptor-1 
C11001722';gi|1l436283|refiXP_ 
a disintegrin and melalloprote 
lactate dehydrogenase A 
NM_0195g5:Homo sapiens inlerse 
inlegrin, alpha 2b (platelet g 
matrix metalloproteinase 12 (ra 
C10000813':gi|5453992|reliNP_0 



pkinase 

pklnase,Activin_Fecp 

MAPEG 

AIRS,AIRS_C 



SH3,elhand,C2,PH,RhoGEF 

FG-GAP,integiinJk 

Peplidase_M10,hemopexin 



E1-E2 ATPase,HMA,Hydrolas 
BRCT 
BIR,TK 
Kunilz_BPTI 



Na_H_E«;hanger,ABC2_niembr 
7lrrLl 

SNF2JJ,hericase_C,bromodo 
RYDR_ITPR,tenJrans,MIR 
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427672 
412048 
410079 



30 
35 



BE392921 Hs.3797 

AF005323 Hs.24040 

AA355B15 Hs.336916 

AW866B63 Hs.73090 

U94362 HS.S8S89 

AW406289 HS.96S93 

NM_001756 Hs.1305 



BE267696 Hs.254105 

AW499834 Hs.327 

NM_002960 Hs.2961 

AA251995 Hs.334648 



AA923729 Hs.26322 

D28419 Hs.82509 

AF017988 Hs.279555 

M77640 Hs.1757 

AF105202 Hs.241376 



fbrablasl growlli factor recep 
RAB26, member RAS oncogene fam 
C1001261*:gi|2695979|emb|CAA70 



65 
70 



432579 
422328 
445143 
450883 
414625 
4C1935 
418329 
426242 
400404 
442332 



404527 
439963 
412970 



401885 
402651 
457432 

420090 
425281 
410855 
407986 
431131 
422802 
447958 
438080 
418843 



AW247430 Hs.84152 

D13636 Hs.1 65237 
AF161221 

AI693251 Hs.8248 

AL137531 HS.258B90 



AW247529 Hs.6793 
AB026436 Hs.177534 
AIM)535 Hs.9573 



NM_016361 Hs.15871 
AW967280 Hs.293894 
X84048 HsJ017B0 



NM_005136 Hs,268538 

AB033002 Hs.21413 

AA220238 Hs,94986 

AA444390 Hs 155482 

X977S5 Hs,66718 

U32659 Hs.41724 

N84730 Hs.250616 

NM_004278 Hs.27008 

AW796524 Hs,6e644 

AA777381 Hs.291530 

AJ251016 Hs.89230 

AI436567 Hs.89761 

BE267033 Hs.1 92853 

U76362 Hs.1 04637 

M89957 Hs.89575 

AF207069 Hs.1 53357 



nudear Mir of kappa iight 
glycogeninZ 
hypothetical proteh 
serine (or cysteine) prolelnas 
NM_001093*:Homo sapiens acetyl 
C1S00C81O*;gl|11131272|sp|P793 
enolasel, (alpha) 
Inteileukin 10 receptor, alpha 
SlOOcaldun-bindlng protein A 
poly(A) polymerase alpha 
ESTs, Weakly similar to KIAA11 
tyrosyl-IRNA synthetase 



NM_003105':Homo sapiens SOI 
nitric oxide synthase 3 (endol 
NM_017434:Homo sapiens dual ox 
Target Exon 
cell cycle related kinase 
tiydroxiraelhylbilane synthase 
secreted frizzied-related prot 
LI cell adhesion molecule (hyd 
potassium voltage^ated channe 



gb:H.sapiens DAT1 gene, partla 



CPSase_lj;haln,bk)tinJip 



DUF101 

G-alpha,art 
WD40 

ldLrecepLa,fn3jdl_rece 

flavodoxin,FAD_binding,NO 

eftiand.Fertiojeduct 

PorpliobiLdeam 

Fz,NTR 

fn3,lg,IRK 

k)n_trans,KCNQ1_ohannel 

MCM 

PGAM 



hypctheUcal protein FU20442 
C4001170:gi|6863176|gb|AAF3040 
CI 10008g0:gl|3746443|gb|AAO639 
NbL02107S':Hamo sapiens NADH d 
junctional adtiesbn molecule 1 



death-associated protein Unas 
glutathione peroxidase 1 

Target Exon 

cystathionine- beta-synthase 
KIAA0010 gene product 
kallikrein14 
Target CAT 

Homo sapiens mRNA; cDNA DKFZp4 
C1002456*:gi|9930918|emb|CAC05 
peptide YY, 2 (seminalplasmin) 
platelet-activating factor ace 
dual speclfcity phosphatase 1 
ATP-binding cassette, sub-fami 
NM_C04347:Homo sapiens caspase 
NM_002210":Honio sapiens integr 
LPAP for lysophosphatkJic acid 
ESTs, Wealdy similar to HERC2 
frequenin (Drosophila) liomolag 
Target Exon 

NM_00O721*-itonio sapiens caldu 



lily 12, (pot 
iibonucteaseP(38kD) 
hydroxyacyl glutalhtone hydrol 
RAD54 (S.oerevisiae)-like 
interteuldn 17 (cytotoxic T-ly 
isooitrate dehydrogenase 3 (l>IA 
phosphaCdylinosiloi glycan, c 
Homo sapiens microsomal signal 
ESTs, Weakly similar to ALUC.H 
potassium intemiediale/small c 
ATP synthase, H transporting. 
Target Exon 

ubiquilin-conjugaling enzyme E 
solute earner family 1 (glula 
CD7gB antigen (immunoglobulin- 



fer2,molybdoplerin,bac_dn 

GDA1_a)39 
PAF-AHJb,Upase_GOSL 
Rhodanese,DSPc 
ABC tran 

ICe320,ICEJ)10,CARD 
FG-GAP 



AL048842 Hs.1 94019 
AW246890 HS.6S42S 
AB004662 Hs.77867 
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400471 










8.8 


406591 






NM_003888*:Homosaptens reb'na 


aldedh 


8.8 


425427 


AI652662 


Hs.157205 


branched chain aminolransferas 




8.8 


410839 


NM 006849 


Hs.66581 


protein disulfide isomerase 


thiored,Rho_GI]l,gntR 


8.7 


430037 


BE409649 


Hs.227789 


mllagen-aclivated piolein Idna 


pkiiase 


a7 


450848 


AI677994 




frtis-ieiated tyrosine kinase 3 


fItSUig 


8,7 


414534 


BE257293 


HSJ5366 


BCL2-anlagonislof cell death 




8.7 








NM_014226*:Homo sapiens tenal 




8.7 


408493 


BE206354 


Hs.46039 


pliosphoglycerate mulase 2 (raus 


PGAM 






AI016521 


Hs.71316 


v-aki murine thymoma viral one 


hameobox,pkinase,PH,pkina 




430432 


AB037758 


Hs.241419 


KIAA1337 protein 


Patehed 




406128 






NM_002920':Homo sapiens regula 


OesLrecep,zf-C4,homione_ 




419493 


AF001212 


Hs.90744 


proleasome (prosome, macropain 


PCI 






AW602166 


Hs.222399 






86 


401134 






C12001198:gi|3183183|sp|Q92142 


biopterin_H 




442286 


W31847 


Hs.50335 


cytochrome P4S0 moncoxygenase 






428376 


AF119e65 


Hs.184011 




Pyrophosphatase 




433494 


AB029396 




beta-1,3^lucuronyllransferase 


GlycoJranf_43 




427001 


NM_0064a2 Hs.173135 


dual-specificity tyroslnefO- 






437278 


AA748017 


Hs.290145 


ESTs 






414463 


T59078 


Hs.76177 


alpha-l-mlcroglobulin/ljikunin 


lipocaIln,Kunit2 BPTI 




421871 




Hs.306122 


giycoprolein, synaptic 2 


Steroidjh 




447827 


U73727 


Hs.19718 


protein tyrosine phosphatase, 


Y_phDsphatase,fn3,ig,MAM 


ie 


403379 
446872 


X97058 


Hs.16362 


TargefExon 

pyrimldinergic leceptor P2Y, G 


0NAj)0U\ 
7tm_1 


8.6 


432857 


NM^016103 


Hs279582 


GTP-binding protein Sara 




a5 


420970 


AA30S079 


Hs.1342 


cytochrome c oxidase subunit V 


C0X5B 


as 


427221 


L15409 


Hs.174007 


TOR Hlppet-Undau syndrome 


VHL 


as 


402209 






TaigetExon 


A2tOI,A2M 


as 


400518 






C100020S7*:gl|321170S|gb|AAC21 




as 


425606 


U52112 


Hs.158331 


lenln-bindtng protein 




as 


437965 


AA843222 


Hs.193534 


E8Ts,lHbderalelysimilartoAL 


RasGEF 


as 


433392 


AF038535 


Hs.127588 


synaptotagminVli 


C2 


as 


402191 






NM.021733*:Homo sapiens testis 




as 


458963 


AI701393 


Hs,27a728 


Rad and Gem-reiated 2 (rat hom 




as 






Hs,271742 


AOP-rlbosyltransferase (NAD; p 


PARP,PARP_reg 


as 


457579 


AB030816 


Hs,36761 


HRAS-IIke suppressor 




as 


409656 


NM 005133 


Hs.2aa626 


RCE1, prenyl protein protease 


Abi 


as 


456373 


BE247706 


Hs,89751 


membrane-spanning 4-domains, s 




8.4 


432499 


BE276633 




RAB6B, member RAS oncogene tam 


ras,ar1 


8.4 


400565 






Target Exon 




401960 






Target Exon 


Branch 


8,3 


432545 


X52486 


HS.3C41 


uracil-DIWglycosylase2 




8,3 


445303 


AW352198 


Hs.12503 


Interieukin 15 receptor, alpha 






404528 






peptide YY, 2 (seminalplasmin) 


GDA1_CD39 


8.3 


428542 


D79989 




KIAA0167 gene product 


ank,PH,AcfGap,ras 


6.3 


406868 


AA505445 


Hs,300697 


immunoglobulin heavy constant 




8.3 


405473 






NM_001093*:Homo sapiens arstyl 


CPSaseLfjchan,bioBnJip 


8.3 


408601 


U47928 


Hs.86122 


protein A 


7ttn_1 


a3 


415008 


NM_002777 Hs.928 


proteinase 3 (serine proteinas 


\ijpsm 


8,3 


430258 




Hs,236963 


protein phosphatase 2A, regula 




a3 


436483 


AJ272063 


Hs.283010 


vanilloid receptor subtype 1 




8,3 


459302 


NM.002314 


H5.36S66 






8,3 


437644 


AA748575 


Hs.136748 


leclin-lilieNK cell receptor 


lectln_c 


8.3 


421707 


NII1.014921 


HS.1070S4 


lectomedin-2 


Latrophilin,OLF,7tiiu2,Gal 


a2 


414629 


AA34S824 


Hs.76688 


caitioxyleslerase 1 (monocyle/iii 


COesterase 


B2 


453898 


AW0O3512 


Hs.232770 


arachidonate lipoxygenase 3 




a2 


424053 


AF057C36 


Hs.138520 


cattagen-tke tail subunit (si 




a2 


457398 


BE258532 


Hs.251871 


CTP synthase 


GATase 


8,2 


421604 


AW402997 


Hs.105062 


adaptor protein with pleckstii 


SH2,PH 




40S495 






TargefExon 


SRCR 




453610 


AW368882 




RecQptoteln-likeS 


DEAD,heIlcase_C 




424880. 


NM 000328 


Hs!l53614 


rellnilis pigmentosa GTPase re 


RCC1 




423847 


U16997 


Hs.133314 


RAR-relaled orphan receptor C 


honiione_rec,zf-C:4 




409829 


M33552 


Hs.56729 


lymphocyte-specific protein 1 


Caldesraon 




401180 






eukaryotic translation elongat 


ion_trans,IQ 




452072 


BE25885r 


Hs.27744 


fiAB3A, member flAS oncogene fam 






426434 


AA379658 


Hs.272759 


KIAA1457 protein 


IP_trans 




402453 






C1002496;gi|7363439|retlNP_039 


7lnn.1 




457310 


W28363 


Hs.239752 


nuclear receptor subfamily 2, 






422069 


AJ010063 


Hs.343603 


titin-cap (telelhonin) 


gtobln,cNMP_binding,pkina 




400275 






NM_006513':Hoiiio sapiens seryl- 


NA 




434357 


AW732284 




mevalonale (diphospho) decarbo 


GHMP_kinases 




430299 


W28673 


Hs!l06747 


serine carboxypeptidase 1 preo 








AW411479 


Hs.348 


FK506-binding protein 4 (59kD) 
ENSP00000085284*:CDNA FU20404 


FKBP,TPR 




402393 






RhoGEF,PH 




429252 


NM 00465e 


Hs.198312 


RAS protein aotivator like 1( 


C2,PH,RasGAP,BTK 




456181 


1^6463 


Hs.1030 


ras inhibitor 


RA.SHZVPS9 




431493 


AI791493 


Hs.129873 


ESTs, novel cytochrome P4S0 


p450 




4S1S5B 


NM-0010B9 Hs.26630 


ATP-Hndlng cassette, sUb-taral 


ABC_tran,SRP54 




415758 


BE270465 


Hs.78793 


protein kinase Czeta 


pkinase,DA(LPE-b-nd.pkina 




419270 


NM.Q0S232 Hs.8983g 


EphAI 


EPHJbd,pklnase,SAM,fh3 




422837 


U25441 


Hs.121478 


dopamine receptor D3 


TlmJ 




401118 






TaigslExon 
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426440 
418635 
432747 

403672 
437806 



425423 
402211 
408710 
457615 
402760 
425428 
423579 



424774 
402632 
444169 
405714 
442732 
421768 
415995 
406137 



446143 
422201 
440869 



426268 
445087 
416377 
421748 



411898 
427010 
457306 
431630 
457764 



428761 
430396 
422066 
445937 



456243 
424321 
405187 



449546 
428926 
404953 
449401 
429962 
421547 



AI424921 Hs.122487 

U48213 Hs.155402 

AB014606 Hs.139648 

AW780199 Hs.30327 

AI571225 Hs,284171 

AB029010 Hs.143026 

NM_005897 Hs,157180 



AF116846 Hs.10431 

AA257161 Hs.8658 

BE397336 Hs,1422 
UUJMS13 



U18548 Hs.123034 

NM_005133 Hs.288626 

AK00iai2 Hs.7036 

U37689 Hs.3128 

BE245342 Hs.306C79 

NM_001505 HS.1132C7 

NM_0142g7 Hs,7486 



^s.4756 



AA179930 Hs.293867 

NMJ14718 HS.1078C9 

NM_006201 Hs.171834 

AB033078 Hs.186613 

BE40g714 H5.44856 
AW138332 

BE268048 Hs.2364S4 

NM.002204 Hs.265829 

AVW)28284 Hs,4815 

AF213457 Hs.44234 

AW015098 Hs.301946 

AF236119 Hs.1 93076 

D49742 Hs.241363 

AW249275 Hs.343521 

AI452943 Hs.321231 

AA953015 Hs.274370 

D46371 Hs.80485 

NM_002462 Hs.76391 

AL03607) Hs.279899 

W32474 Hs.301746 

AF038660 Hs.206713 



AV655701 Hs.75183 

BE379077 Hs.130849 

AU077005 Hs.92208 

Z33905 Hs.81218 

AK002011 Hs.37558 

NM_005109 Hs.95220 

W85248 Hs.58819 

NM_001702 Hs.1 94654 

AL1 35401 Hs.23557 

M69113 Hs.226796 

AA489908 Hs.1390 

NM.003463 Hs.227777 

AA157867 Hs.182265 



dual specificity phQsphatxe 2 
calcium ctiannel, vollage-depen 
C4001244gi|539933|pir||A61275 
ESTs,Weal<ly similar to A54854 
D site of albumin promoter (al 
kinesin family member 1C 
milogen-activated protein kina 
KIAA1535 protein 

Target Exon 

intracisternal A particle-prom 
KIAA0430 gene product 
milogen-activaled protein Wna 
oalciumtealmodulin-dependent p 
NM_021797':Homo sapiens eosino 
DKFZP586B0621 protein 



TATA box binding protein (TBP) 
Target Exon 

dead ringer (DrosophilaHike 



Glyco_llyd^oJ8,CB^4.14 

Clq.Collagen 

G_glu_transpepl 



Target Exon 

C1001261':gi|2695979|emb|CAA70 

keratin 17 
KIAA1 118 protein 
Target CAT 

G protein-coupled receptor 12 
RCE1, prenyl protein protease 
N^cetylglucosamlne kinase 
polymerase (RNA) II (DNAdirec 
sec61 homolog 
G protein-coupled receptor 30 
protein expressed in thyroid 
flap structure-specific endonu 
ESTs 

a disinlegrin-IIke and melallo 
serine/threonine kinase 24 (SI 
suppressor of Ty (S.cerevisiae 
oaspase recruitment domain pro 
KIAA0726 gene product 
PCTAIRE protein kinase 1 
sphingosine-1-phosphale lyase 
hypottietlcal protein FU12116 
nnuscie RAS oncogene homokig 
RAB10, member RAS oncogene fam 
Integrin, alpha 3 (antigen CD4 
nudix (nucleoside diphosphate 
triggering receptor expresse<J 
ESTs, Weakly similar to T30B57 
GRB2-related adaptor protein 2 
hyaluronan-binding protein 2 
malate dehydrogens^e J, NAD (m 
UDP-GaI;betaGlcNAc beta 1,4- g 
hypothetical protein FU20260 
NM_003105-;Homo sapiens sortil 
adipose most abundant gene tra 
myxovinis (Inliueiiza) reslstan 
tumor neciosis faclorreceptor 
RAP2A, member of RAS oncogene 
UDP-GaltetaGkNAc beta 1,4- g 
Target Exon 

menage a trois 1 (CAK assembly 
lymphocyte-specific protein (y 
NM_01 4272:Homo sapiens a disin 
cytochrome P450, subfamily HE 
ESTs, Weakly similar to 138022 

hypolS pr^erMG^7 
hypothetical protein FU11149 



Fz,kringle,ig 
ARID,SNF 
7lm_1 

EGF,lamlnin^GF,Xllnk,S_m 

8H2,8H3,pklnase 

PI-PLC-XPI-PLC-Y,C2 



7tmJ 

laclaraase.B 

XPG.N,XPGJ5.3.exonuclea 

tsp.l ,Reprolysin,Pep_M1 2B 

$1,SH2,Ribosoraal_L23,pkin 

cadherin 

pyridoxaLdeC 



inlegiin_A,FG-GAP,Rhabd_g 



SH2,SH3 

lrypsin,kringle,EGF 

ldh,ldh_C,adh_short,Semla 

GalaotosyLT_2 

PH 

ldl_recept_3,fn3,ldl_rece 
C1q,Collagen 
dynamh_2,dynamin,GED 
TNFR_c6 

Gal'aclosyl_T_2,ig 

Giyco_hydroJ 

zf-C3HC4 

SH2,SH3,pkinBse 

ReproIysin,tsp_1,Pep_M12B 



disinlegrin,Reprolysln,Pe 
TPR,zf-C3HC:4,PHD 
FAD Synth 
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10 
15 



404757 
406370 
443611 
424008 
444912 



451394 
436523 
422714 
408924 
414551 
413254 
415010 

432221 



417985 
447507 
418322 



426176 
421395 
430517 



402115 
407601 
404679 



453190 
439975 
412800 



453082 
421168 
422287 



407338 
448426 
409686 
450778 



415701 
423158 
414788 
412915 



AA092510 Hs.5985 

n4588 Hs.8509 

U94905 Hs.277445 

T50872 Hs.2001 

AA054400 Hs.151706 

NM_002342 Hs.1116 

AIS73076 Hs!231958 

X51615 Hs.2798 

AA436959 Hs.253802 



NM_014397 Hs.9625 

R02740 Hs.137555 

AW247380 Hs.12124 

X66945 Hs.743 

NM_002447 Hs.2942 

AI359476 Hs.1 57699 

AK001713 Hs.17860 



NM_013385 Hs.7189 

L36149 HS.24B116 

NM_003595 Hs.26350 

BE612990 HsSZn 

AB018335 Hs.119387 

AW295606 Hs.236131 

A1815639 Hs.76394 

U40272 Hs.75253 

NM_004203 Hs.77733 

AB009698 Hs.23965 

M21191 H3.273415 



090084 Hs.1023 

S80071 Hs.241597 

AI686379 Hs.1 10796 

BE382411 Hs.3764 

AC002300 HS37129 

AI732707 

AB040949 Hs.6733 

BE410243 Hs,78769 

AB002354 Hs,32312 

AW328081 Hs.6817 

AW950852 Hs.74598 

X94630 Hs.3107 
T74640 

BE549343 Hs.82208 

H18444 Hs.134846 

AF235097 Hs.227583 

AJ00B422 Hs.135183 



NM_005781 Hs,153937 

H18835 Hs.31608 

AF182277 Hs,330780 

F16365 Hs.1 14346 

AA648602 

AF273304 Hs.235376 
AA773213 

BE01831S Hs.280776 

AK000002 Hs.55879 

U81375 H5.254S0 

NM.002067 Hs.1686 

AF024690 Hs.248056 

AF051941 Hs.343824 

BE391579 Hs.75087 

NM_003878 Hs.78619 

H97991 Hs.193313 

X78342 Hs.77313 

AVW)87727 Hs.74823 



Tagel Exon 

non-kinase Cdc42 effector prot 
ESTs, Weakly similar lo C3HU c 
diacylglycerol kinase, zela (1 
thromboxane A synthase 1 (plat 
KIAA0134 gene product 
lymphotoxin beta receptor (TNF 
Targets- 



leukemia inhibitory factor rec 
ATPase, (l>la)/K transporting, b 
Target Exon 
interieukinll 



putative prostate cancer susce 
fibroblast growth bclor reoep 
macrophage stimulating 1 recep 
ESTs 

Target Exon 

hypothetical protein FL110851 
phosphodiesterase 4A, cAMP-spe 
C1100Q129:gi|9938014|ref|NP_06 
plecksinn homology, Sec7 and 
chemokine (C motif) XC recepto 
lyrosylprotein sulfotransferas 
single-strand selective monofu 
lflAAC792 gene product 
homeodomain-interacting protei 
enoyi tiienzyme A hydratase, sh 
Isooilrale dehydrogenase 3 (NA 
membrane-associated tyrosine- 
solutecam'er family 22 (orga 
aldolase A, fructose-bisphosph 
G prolein<oupled receptor kin 
ESTs 

crystallin, zela (qUnone redu 
P0P7 (processing of precursor, 
cyclinjependent kinase inNbi 
CDC10 (cell division cycle 10. 



famesyl-dlphosphale farnesylt 
SHJ-domain binding protein 2 
pyravate dehydrogenase (lipoam 
solute carder family 6 (neuro 
SARI proleln 

C1001899*:gi|12722636|reflXP_0 
N»mi624^o sapiens histaml 



IKmeloligopeptidase 
KIAA0356gene prod 
inosinelnphosphalas 
polymerase (DNA dir 



!in3 
centaurin-alpha 

gb:yy65g02.r1 Soares_muttiple_ 
Homo sapiens, Similar to trans 
activated p21cdc42Hs kinase 
hypothetical protein FU20041 
cytochrome P450, subfamily IIB 



Na_K-ATPase 



C15000492*:gi|3127193|gb|AAD05 
ESTs 

XPMC2 protein 



lg,pklnase 

pklnase,SeiirafSI,TIG^4^ 
Cache 



DUF221 

ECH,Peptldase_U7 



Yjihosphatase,DSPc 
GTP.CDCM 

SQS_PSY,dsrm,z-alpha 
PH,SH2 



E1_de 



C2,PI-PLC-Y,P|.PLC-X,RasG 

Peptidase M3 

PH,PHD,RUN 

Hamlpjke 

homeobox 

7tnrL2,GPS,EGF 

A2II4.N,A2M 

Acy|.«oA.dh,Acyl-Co^.dh_M 



. I66f10.s1 

tankyrase, TRFI-inteiacHiig an 

Homo sapiens mRNA; cDNA DKFZp4 ABC_tran 

solute canier family 29 (nuci Nucleosldejran 

guanine nucleotide binding pro &alpha,atf 

G protein-coupled receptor 43 7lrri_1 

nucleoside diphosphate kinase NDK 

Fas-activated serine/threonine 

gamnia-glutamyl hydrolase (conj GATase 

Target CAT Moa\.NifB_PqqE 

cyclin-dependent kinase (CDC2- pkinase 

NM.004&41:Homo sapiens NADH de 

252 
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55 

60 















415503 






N-deacetylase/N-sulfotransfera 






433074 


AIj045019 




Homo sapiens oDNA FU11214 lis 


DEAD,hellcase_C,dsmi,Vira 




409124 


AW292809 


Hs.50727 


N^acetylglucosaminidase, alp a 






428270 


BE601549 


Hs.107040 








435114 


AA775483 














Hs'l867 


progastricsin {pepsinogen C) 






463064 


AI87B90B 


Hs.31547 


Target CAT 








NM 002691 












AF120499 


Hs.78016 


polyniicleotide kinase 3'-phosp 


Viral.heOcasel 




400985 






Target Exon 








Y00815 


Hs.75216 














acid ptiQspiiatase 2, lysosoroal 


acidj}h(^phat 






AI4T6988 


Hs.238272 


inositol 1,4,5-lriphQsphatere 


ion_frans,RYDIl.lTPR,MIR 










CI OOOI991*:gi|6624920|einb|CAB6 












phosphoglycerale dehydrogenase 


2-Haoi(LDH,2-Hacl(LPH_C,M 




409631 


BE3e4319 


Hs,54702 


xylosylprx^teh beta1)4-9aIacto 


GalaotosyLTJZ 


6.7 


449139 


BE268315 


Hs.23111 


phenylalanine-IRNA synthetase- 




5.7 


460207 


T87615 


Hs,14716 


ESTs 




5.7 


400266 








ABC Iran 


6.6 


430713 


AA351647 




eukaryotic translation elongat 


GTP EFnJ,GTP_EFlU D3,GTP 


5.6 


420911 


U77413 


Hs.l 00293 


0-linked N-acetylgluoosamine ( 


TPR 


6.6 


405683 






Target Exon 




6.6 


449181 


X96783 


Hs.23179 


synaptotagminV 


02 


5.6 


414467 


AW514320 


Hs,7616g 


ATPasg, H transporting, lyscso 


ATP-synLC 


6.6 


415193 


ALj048891 


Hs,1218S 


hypotheticat protein UGC14333 




5.6 




AVV381638 




hypothetical protein UGC12969 






433135 


AA443e73 


Hs!l1C477 


doiichyl-phosphate mannosyltra 




is 


413049 


NML002151 


Hs.823 


hepsin (transmemtirane protease 






420899 


NM.001629 


Hs.100194 


arachidonate 5-lipoxygenase-ac 


M.APEG 


is 




NM 001838 




chemokine (C-C molif) receptor 








BEf90832 


Hs!l34729 


FXYD domain-containing ion tra 








NM.000885 


Hs.40034 


integrln. alpha 4 (antigen CD4 


lntegrin_A,FG-GAP 




405594 






NM_021949:Homo sapiens ATPase, 


EI-E2JMPase.Hydrolase 






BE019494 






P5CR,Ootopine_DH_N 






AW451473 


Hs,16134 




pkinase.TYA 




411030 


BE387193 


Hs,67896 


7-60 protein 




5.5 


431498 




Hs,268551 


aspartyl amlnopeptidase 


Peptidase.Mie 


6.5 


433012 


NM_004045 


Hs,279910 


ATX1 (antioxidant protein 1. y 








X90725 




pcb (Orosophl3)-like kinase 






424672 


Mig650 


Hs.92909 


2',3'<yolic nucleotide? pho 




6.5 


406617 






Target Exon 


eihand,Ferric_reducl 


6.5 










trefbil,Glyco_hydro_31 




419525 


T79257 










448093 


AW977382 


Hs,15898 


2.4-dienoyl CoA reductase 2, p 




5!4 




BE242842 


Hs,6780 




Suifbtransfer 




406432 






CD1E antigen, e polypeptide 










polyinerase (RNA) II (DNA direc 






430337 


M36707 


Hs!23g600 


calmodulln-lll<e3 


efhand 


6.4 


427162 


AB011133 


Hs.173864 


KIAA0661 protein 


pk1nase,PDZ 


6.4 


414216 


D86970 


HS75822 


TQFB1-induced anU-apoptoticf 


oxldore^q4,myosii\.head,b 


5.4 


422083 


NM 001141 


Hs,111266 


arachidonate 1 S-lipoxygenase, 


lipoxygenase.PLAT 


6,4 


424373 


AJ133798 


Hs,146219 


copineVII 


02 


5.4 


4494CS 


AA0013S0 




gb2h83h0Sx1 Soares_fet:Lliv 




5,4 


409983 


□60922 


Hs.67729 


Kelch-lil<eEChl^ssociatedprot 


BTBTKefch 


6,4 


455818 


AI733747 


Hs.71174 


interleiil<ln21iscepbr 




5.4 


424357 


AW961058 


H5.44856 


hypothetical protein FU12116 




5,4 


423606 


AB011094 


Hs,129e92 


KIAA0622 protein 


PH,bZiP,ia8ec7 


6.3 


432311 


BE083080 


Hs,274323 


similar to slaiyitransferase 7 


Glyco.transf_29 


5.3 


450080 


AB037831 


Hs,24372 


ESTs, Weakly similar to dJ207H 


deadTgspile 


6.3 




Y09267 


Hs.132821 




FMO-Iike,pyr_redox 


















BE262621 


Hs73798 






6.3 




BE277414 












BE388866 




proteaseome (prosome, tnacrapai 


PA2B_aIptia,PA2B_beta 






NM_016011 




CGI-63 prote'n 








X98507 












BE301871 






HLH,Myc_N_ierm,Myc-lZ 




432336 


NM_002769 






dsrm.pkinase 


6!2 
















AB012722 


Hs,198266 


kinesin-like 3 






432462 


AK000013 


Hs.274701 


thymidine kinase 2, mitochondr 


dNK 


6.2 




AI739312 


Hs.284163 


ANKHZN protein 






406697 








6.2 


450321 


Y16621 


Hs.24812 


CDP-diacylgiycerol synthase {p 


Crtidyiyitrans,Adeno_VII 


6.1 


412939 


AW411491 


Hs.75069 


eukatyoUc banslatkn elongat 


SHMT 


5.1 


446109 


AF039916 


Hs.12330 


ectonudeoside triptiosphate di 


GDA1 OD39 


6.1 


419073 


AW372170 


Hs.183918 


Homo sapiens cDNA FU12797 fis 


ig,bpJ,ZU5 


6.1 


409958 


NM.001523 


Hs.57697 


hyaluronan synthase 1 


aycos_lransf_2 


5.1 


442699 


AF078037 


Hs.324061 




SH3,ank 


6.1 


424305 


BE386096 


Hs.112272 


histonedeacetylaseS 


HisLdeacetyl 


5.1 


427247 


AVI(504221 


Hs.174103 


inlegrin, alpha L (anSgen GDI 


TOa,inlegrin_A,F&GAP 


5.1 


429061 


Y14039 


Hs.196176 


CASP8 and FADD-iike apoptosis 


DED,IOE_p20 


6.1 


420849 


X62221 


Hs.99987 . 


exciston repair cross-compleme 


5.1 



wo 02/102235 



PCT/US02/19297 



418910 
425771 
405202 
451452 
418231 
425165 
407876 
417831 
404716 
405020 
426236 
433178 
422340 
439414 
425846 

424168 
436042 
410775 
423734 
420340 
433075 
400300 
426811 
401577 



421487 
402183 

456748 
424771 
405441 
437053 
443044 
431204 
456417 
436735 

446948 
451564 
403771 
403248 
410214 
407047 



420028 
402912 
443329 
426120 
430509 



R73417 Hs.25391 

Z25821 Hs.89466 

BE561776 Hs.169494 

BE560065 Hs.26433 

AA326896 Hs.83848 

NM_014434 Hs,1 54899 

m 004519 Hs.40866 

HI 6423 Hs,82686 



NM_001183 H3.6551 

AA102174 Hs,159629 

AJ006239 Hs.75438 

L29277 Hs.321677 

AF284422 Hs.119178 

AB0W60 Hs.66196 

BE303044 Hs.192023 



BE259228 Hs.172609 

AA323948 Hs.55407 

BE296216 Hs.172673 

U6S402 Hs.248124 

AF107028 Hs.129783 

AF027406 Hs.104865 



AU077018 Hs.3235 

N28622 Hs.8935 

F28841 Hs.250760 

L36531 Hs.91296 



AU076698 Hs.13276C 



L29555 Hs.301698 
X65965 

AF199364 Hs.119120 

AJ004832 HS.S038 

NM.006672 Hs.251395 

Y13647 Hs.119597 

U66S79 HS.1888S9 

A1762964 HS.20518C 

L00727 Hs.898 

839329 Hs.181350 

D42046 Hs.194665 

AB014680 HS.B786 



Hs.9234 



AL049654 Hs.180871 



Target CAT 
potassium vollage-galed channe 
CD47 antigen (Rh-relaled aniig 
NM_007313':HomQ sapiens v-abi 
Target Exon 

l<inesin family member 3B 
cysteinyl leukolriene C/sLT2 r 
RAB7, member RAS oncogene fara 
ATPase, H transporting, lysoso 
myosin IXB 

quinoid dihydropteridine reduc 
signal transducer and activate 
cation-ctiloride cotransporter- 
nlh (E.coli endonuclease III)- 
eukaryotio translation initial 
CD3Z antigen, zeta polypeptide 



ion_trans,KCNQ1_channel 



myosin_head,DAG_PE-blnd,l 
adh_short 

SHiSTAT,STAT_bmd,STAT_p 



Target Exon 
keratin 4 

quinolinale phosphoribosyllran 
cytochrome c oxidase subunil V 
Integrin, alphas 
mannosyl (beta-1,4-)-glycoprot 
zindircn regulated transporle 
peroxisomal long-chan acyl-co 
hypothetical protein MGC15729 
NM003061:Homa 
ESTs, Weakly similar to 17888 
sialyltransferase 4A (beta-ga! 



slit(D 



E3ublqultlnligaseSMURF1 



solute canter family 22 (oga 
stearoyl-CoAdesaturase (delta 
G prote'n-coijpled receptor 20 
ESTs 

dystrophia myotonica-protein k 
kallikrein 2, prostatic 
DNA2(DI>IA replication h 
carbohydrate (f 
Target Exon 
hypotlietlcal protein MGC1 936 



UvrIMiellcase,ViraLhellc 



4(X)846 
456881 
418172 



AW385129 Hs.41717 
BE256238 Hs.193163 
AAB84517 Hs.31856 



Hs.120866 
Hs.279915 
Hs.258609 



AW162931 Hs.45002 
AL110209 Hs.6770 
AW057736 HsJ23910 . 



plxisphodiesterase 1A, calmodul 



Hemiansky-Pudlak syndrome 
cyclin-dependent kinase inhibi 
KIAA1283 protein 
translocase of inner mitochond 
protein tyrosine phosphatase, 
soililin-related receptor, L(D 
protein phosphatase 2, regulat 



ldUeceptja,fn3,ld|jece 
B56 

pkinase,PH,RGS 



wo 02/102235 



PCT/US02/19297 



41297-1 
414702 
425796 
422454 
408135 



417891 
421661 
426516 



416297 
421962 
415341 
456668 



429362 
429133 
426079 
414814 
433261 
402915 
418267 
430716 
420874 
439902 



448191 
456217 
436415 
429218 
407433 



421771 
448B36 
4t4a21 
431096 



R18978 

L22005 

AJ000479 

U49070 

AA317248 



NM_004714 Hs.1 30988 

AB018263 Hs.180338 

NM_016154 Hs.279771 

ALJ046341 HS.7SSG2 



WB1526 Hs.1 18329 

AW327546 Hs.1 11024 

AA317089 Hs.697 

AW249394 Hs.5002 

AW247615 Hs.37003 



N31854 Hs.197116 

D31220 Hs.166168 

D14697 Hs.77393 

AB040967 Hs.1 12034 

BE389637 Hs.83919 

BE387257 Hs.247831 

X66357 Hs.336478 

AF174499 Hs.6764 

NM.004405 Hs.419 

106419 Hs.75093 

AF210652 Hs.16614 

X92762 Hs.79021 

Aig51628 Hs.127007 

NM.00S881 Hs.20644 

BE253181 Hs.81687 

BE265254 Hs.343258 

AA225065 Hs.198269 



T96509 Hs.248549 

AA157857 Hs.182265 

NM_C05518 Hs.59889 

R51573 Hs.79300 

NM_C14821 Hs.20126 

NM_CC1224 Hs.108131 



AL037111 Hs,75S41 
X75208 Hs.2913 
AF014643 Hs.1 00072 



A1124572 



Hs.22735 
HsJ23879 
AU)37B95 Hs.279861 
BE27BB91 Hs.194691 
AA3G6610 HSJ48183 
AK001663 Hs.135458 
U96132 Hs.171280 
AF131836 Hs.170453 
AJ224M2 Hs.155020 
NM_000947 Hs.74519 
R86701 



225470 Hs.248145 . 



hypothelical protein FU10038 
solute carrier family 18 (vesi 
phosphate cylidylyltransferase 
emopamil-blnding prolan (ster 
cell division cycle 34 
endotlielial differentiation, G 
protein (peplidyl-prolyl ois/1 



cadherin-like22 
signal sequence receptor, dell 
l-acylglycerol-3-ptiosphale 0-a 
inositol polyphosphate phospha 
NM_014624:Homo sapiens SlOOca 
protein pt-,osphatase 1,rBgu:at 
CASP6 and FADI>lil<e apoptosis 



Homo sapiens clone PP1 596 unkn 



Melhyltransf_4 

cadherin,Cadherin_CJemi 

Herpes_UU 



solute canier family 25 (mKo 
FabG (beta-ketoacyl-[acyl-cair 
gb:ye74o04.r1Soares fetal liv 
ESTs,lvloderalely similar to GA 
solute canier family 25 (mlto 



Mr chaperone for superoxid 
v-Ha-ras Harvey rat sarcoma vi 
solute carrier family 7 (callo 

ubiquitin-conjugaling enzyme E 
salute canier family 7 (calio 
peter pan (Drosophila) honiolog 
lamssyl diphosphate synthase 
KIAA1534 protein 
ENSP00000202587*:Blcart)onale t 
gkicosidasel 

Homo sapiens, Simlarto myosi 



histonedeacetylase6 



procollagen-lysine. 2-oxogluta 
5(3)-deoxyrlbonucleoUdase (dN 
tafazzin (cardiomyopathy, dHa 



Target CAT 

gb:Homo sapiens orphan G-prole 
ESTs, Moderately simlar to S6 
l<eratin 19 

3-liydroxy-3-methylglularyl-Coe 



KIAA0317gene product 
caspase 2, apoplosis-relaledc 
hypothetical protein FU103S7 
Fc fragment of IgG, high aflin 
Homo sapiens DNA, cosmid clone 
myomesin 1 (skelemin)(185kD) 
Sigma receptor (SR31 747 Hndin 
IMP {inosine monophosphate) de 
NM_005546'.Homo sapiens IL2-in 
hypothelcal protein FU20277 
gaIactose-1-phosphate uridyly! 
EphB3 

connexin46.6 



retinoic acid Induced 3 



TNFR_c6,death,PH,Xnnk,Rh 

F5_F8_type_C,pkinase 
ABC_membrane,ABC_tran 



Neur_chan_I.BD,NeuiLshanjn 

aminotran_1_2 
sodcu,HMA 

aa_permeases 

dealh,TNFR_c6 

UCLcon 

aajemieases 

7im.i 

polypcenyljsynt 
PH,OxysteroLBP 
HC03_cotransp 
Glyco_hydrQ_63 



HATPaseLP 

NDK,Artsrijlycap 

PeptidaseJU4 



HMG CoA^ynt 

UQ_con 

Rlanln,HECT 

ICej)20,CARD,ICEj)10 

PH,RhoGEF 



Neur_chan_LBD,Neur_chan_ra 



primase, polypeptide 2A (58kD) 
gb:ym86d09.r1 Scares adult bra 
NM_030807:Homo sapiens glucose 
Target Exon 



wo 02/102235 



PCT/US02/19297 



430237 A1272144 Hs.235522 

426175 AF020202 Hs.155001 

409067 aE260459 Hs.50267 

419982 AA252544 Hs.55510 

428394 AU076472 Hs.1 84141 

437696 Z83344 Hs.5790 

464C34 NM_000691 Hs.575 

410237 AI750689 Hs.61258 

461478 NM_012331 Hs.2645B 

415410 AF037332 Hs.273569 



424349 
421318 



419726 

423814 
461365 

428284 
431968 
443639 



442549 
427239 
461126 
426081 
402171 



420148 U34227 

412167 U68487 

412666 AF006011 

425786 .U3S234 

424288 AW137198 

452230 AW135360 

408449 NM_004408 

423883 AF260238 

422676 D28481 

458639 BE247683 
400726 
405370 

413654 AA331881 

432917 NM_014125 

448362 AA641767 

424512 X53002 



AF1412B9 Hs.145550 

AB023180 Hs.7724 

U63973 Hs.1 03501 

ABOSaSe Hs.6535 

N73703 Hs.293267 

AA010233 HS55921 

NM_0142S6 Hs.69009 

AU076916 HsJSm 

BE410590 HS.1192S7 

BE272922 Hs.173936 

D31B33 Hs.1372 

U47927 Hs,3759 

BE268649 Hs.177766 

AI929320 Hs.75516 

BE274009 Hs.772 

U50330 Hs.1 274 

AF105020 Hs.132989 

NM_004197 Hs.444 

NM_006245 Hs.1 18244 

AA535762 Hs.183435 

AF1 17222 Hs.272261 

BE269042 Hs,9661 

AF207989 Hs.58014 

AF026273 Hs.249175 

Z21093 Hs.30352 

AI751601 Hs.8375 
BE270447 

AA015779 Hs.226923 

X74794 Hs.154443 



Hs.73739 
Hs.74375 
Hs.159534 
Hs.278682 
Hs.224170 



Hs.75454 
HS.241S17 
Hs.21015 
Hs.149846 



Target Exon 
DKFZP434P106 protein 
UNC13(C.elegans)-lil(e 
putative GTP-bindirg proteins 
solute carrier family 30 (zinc 
glularyl-Coenzyme A detiydrogen 
tiypotliefcal protein dJ37E16.5 
aldehyde dehydrogenase 3 famil 
argininosticcinate lyase 
methionine sulfoxide reductase 
sorting nexin 17 
Target Exon 

solute carrier family 7 (catio 
KIAA0963 protein 
ihodopsin kinase 
brain-specific Na-dependent in 



PepJ<12B_pto|)ep 

abhydrolase 

DAG_PE-blnd,C2 



inler1eukin10re( 



tibiqtiilin specific protease 5 
ADP-ribosyltransferase (NAD; p 
tyrosine l<inase 2 
glycogen synthase 1 (muscle) 
bone morphogenetic protein 1 
putative protein 0-mannosyllra 
serine/threonine Idnase 19 
protein pliosphalase 2, regulal 
NU 004545:Homo sapiens NADHde 
UDP-Gal:betaGlcNAc beta 1 ,3^|a 



Homo sapiens, Similar to G pro 
interleul(ln-1 receptor-associa 
ribosomal protein S6 Idnase, 5 
NM_007037':Homosapiensadisi 
TNF receptor-associated factor 
ubiqgitin carter protein 
ESTs 



Target Exon 

myosin VII A (Usher syndrome 1 B 
5-hydroxytiyptamine (serotonin 
dislievelled 1 (homologous to D 



pk:na3e,pkinase_C,RGS 

sjgar_lf,BT1 

GIyoo_hydro_31 

WHEP-TRS,GST_C,HGTP_antii: 
SNF2_N,helicase_C,zraHC 
GalactosyLT 
. GMP3ynLC,GATase 
SH3,HSUep 
Tlssuejac 
7lin.1 



Reprolysin,tspJ ,Pep_U 1 2B 
zf-aHC4,MATH,zf-TRAF 

Yjhosphalase 



fflyosin_head,IQ,MyTH4,SH3, 
7tltLl 

PDZ.DEP,DIX,Dfehevelled 
fi)3,ig,Y_phospbatase,DSPc 




413964 1809680. 



27436J 



CAT Number Accession 

1 1 824J AI609645 AI818201 AA948024 AI278970 AA688086 AA858279 F21973 W95840 AW969644 X99726 AA431 579 AA970887 AI885085 AI767835 
BE666516 AA725824 A1000871 AW242322 AW007204 W6828g AA431450 AW466973 BE222544 AA483454 AI968050 W95975 AI381017 
AA776726AI040976AA89 

1534442.1 R0C602Z42921 F06132 

1664.1 igM.004573 M95678 BE242666 AW6041 1 0 AVMOB049 AW402206 AA774879 AW630959 AI439623 AI933994 AW7512B2 AW37441 3 M578823 
Hie054AA310466 F12S78n4300 M3S3176 AW950138AW9S0600 AA912021 AIS24064 AWiB3098 AI416g66AW769231 AI7S7111 AA293723 
AI4222g0AA46S038A 
T74640T74649 

BE257143 BE3121 11 AF2igi37 NM.015720 BB136S8 BE382652 BE252205 BE2S1SS3 F12128T662QB BE255806 BE2544B4 AA324163 H07962 
AL1341 64 AIB67802 AI2049n AI282924 AW1 92547 AI652760 AI266471 AI0B3778 

AW138332 AW207450 AW138931 AW1369B3 NM.012219 AF043938 AA931386 AI0B4600 AA975999 BE551 105 AA450260 AI080368 AA324154 
AF022080 BE009901 AL1 18847 W44458 AI765270 AA453121 AI14B638 AI373696 AA3241 53 BE174809 AA35076S 
BE270447 AW409921 BE207288 BE207170 D56355 BE263223 BE408171 BE262243 BE392439 BE292738 BE261776 BE314300 BE26771 9 
BE268715 BE513876 BE295291 BE297066 AA210923 BE407519 H51344 BE622905 AW248281 AW250313 T19021 AA3551 15 AA3 16879 
BE26g633 BE621936 AA290724 
AI287878AI8C4160AA400787 



27647_1 



277229.1 



256 



wo 02/102235 



PCT/US02/19297 



29266.1 
29737_1 



3679_1 

392764J 



489536_1 
8«917_1 



983107_1 



D79989 NM_014770 U81 031 AA352392 M984512 H38328 AL1 20358 AL134787 AL134589 AI637763 AI671 506 AA526909 AI651627 AW243560 



BE276633 MM 016577 AF16H92 BE276152 AF091031 AA9086C7 U66623 AI570393 AA682567 AW593957 AI148105 AW002431 AI637463 
AI767195 AA339439 R13005 R23431 AW961068 AA233819 AA224118 R19618 A1890314 Z46184 

NrL002959X98248AA233278AA846376AW70560AI47C533BE327147AW291971AA017125AI198417Al365213AI168442Al337018 
A1476049 H86459 AA969896 AASSSOOO AM18326 AA418378 N71981 AL043634AA426361 AA418275 AA232976 AL036861 BE277220 BE387506 
N99710AW375004A 

AB029396 T04934 R21715 R19005 H1 1563 H14256 R46605 Z40857 BE218899 AI457785 BE550988 AI693847 AA961017 H40944 M7861 7 
H38447 N8009a BE549719 BE5S09S2 AW00SS46 AI332686 AI928348 N49234 R4407S AI694943 AI8S8538 AI2g0722 BES507Sg R43116 H40212 

H40089AA01B091 

AA648502 AA814365 AW976711 AA746117 

L48489 AL022312 D1 3789 AI761 974 AW1 73260 AW271715 AA837437 A1076278 AI36701 2 AI953032 AI5711 73 H44868 AA743691 H47026 
AA837368 AA829826 AA713585 AW50261 8 AW500856 AW501 353 AW499765 AA3391 25 H1 9141 H29545 R18883 AW450375 AA326081 
AW406015BE263669N52684A 

AW295871 AI005144 AA909877T52634 AI239684AA87S959 BE171353 AI767633 AW510907 AI742007 

AA001350 AA203114 H83070 R00660 

AI732707AI742120 

AW118072AI531982T15734AA224195AI70145BW20198 F26326AA890570 N90552AW071907AI671352A1375892 T03517R88265A1124088 
AA224388 AI084316 AI354686 T33652 AI140719 AI72021 1 T03490 AI372637 T15415 AW205836 AA630384 T03515 T33230 AA017131 AA4433C3 
T33623 

R86701 R84600AL157655^ 



TABLE 22Q 

Pkey: Unique number oorrespondlng to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column aeGenbank Identifier (Gl) numbers. '[>Jnhaml.etai.' 

human chramosome 22" Dunham, el al. (1999) Nature 402:489^95 
Strand: Indicates DMA strand from which exons were predicted 
Ntjiosillon: Indicates nxleotide positions of predicted exons 



refiers to the publication entitled The DNA sequence of 



9931670 
9796703 
9801071 
9844011 



400772 
400833 
400836 
400843 
400845 



401118 
401134 
4011BD 
401215 
401281 
401454 
401488 
401507 

401542 

401736 



Pius 



9188605 Plus 



7341775 Plus 



8072607 
9280797 
3219338 



NLposition 

105529-105760 

37240-37774 



81941-82434 
93178-93429 
51524-51786 
84237-84398 

34896-35021,41078-41197 

187599-188138 

677-1188 

5863-5970,7653-7784,8892S023,9673-9B07,10634-10789,15254-154( 

34428-34612 

39310-39474 



111939-112126 

51210-51406 

150981-152128 



13622-15130 
114659-114832 

64523-54686,55364-55451,66737-55846,58047-58176,68261-58356 

71055-71259 

46835-47126 

141195-141281,142217-142340 



402115 
402171 
402183 
402191 
402207 
402209 
402211 
402338 
402393 



402497 
402632 
402651 
402665 
402758 



8576073 
8576119 
8576119 



9797775 Plus 



62703-63179 
101750-102018 
79357-79514,83258-83475 
100618-104298 



41683-41851 



67414-68229 
36915-37250 

19813-20084.20163-20263 
41-631 

108901-109254,110246-110S81,113613-113960 

8818-8952 
106204-106535 



101166-101419 
174215-174380 
11824-12090,14290-14544 
87638-87924 
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403379 

403672 

403771 
403949 
404199 
404527 
404523 
404596 
404676 
404679 
404716 



8217451 
7263904 
7406502 
7406502 

7630897 
7656833 

9438244 
8736093 
7283286 



9797204 
9797204 
9838068 
7706327 
5103013 
73B21B9 



7230116 
3810573 
2078469 
2078469 

8439781 

8439781 
1054740 
3820491 



136829-136952,137336-137521 



406432 
406441 
406458 
406495 



9256130 Plus 



7711328 
7711328 
7711474 
7711478 
B224230 



145965-146257,150876-161368 
140-276 

361-474,541-687 
""'-72-162739,164442-164540 



96600-96S81 ,96951-97280,97393-97594 

93303-93938 
112901-113045 
1731-1941 
1669-2740 

127737-127796,128080-128210.129888-130054,132545-132869 
104807-106043 



125964-126092126691-127011,127774-127893 
123145-123417 

100933-101083,101580-101782 



38980^9111 

47657-47766,48461-48596 

153074-153343,154501-154598,156879-156999,158863-159051,169910-160053,161109-161229,163035-163131,165163- 
165259,165868-166003,167375-167552,169252-169364,171127-171281 

172005-172175 
124010-124183 
33200-33646 

161628-161734,162823-163014,164439-164652 
21701-21844 

56765-57010,57696-58016 



50425-50876 
125320-125482 

3804-3930,4026-4120,4929-5109 



145874-146911 
174661-174978 

178947-179264,181779-182087 

11703-11860,14711-14829,14920-14984,16232-16448,16916-17087 

3519»5367,38229-38476,4a08(Ma216,43522-43840 

2117-2257,2436-2540 

36430-36562 



Table 23A lists about 779 genes up-reguiated in ovarian cancer compared (o non-malignant adult ovaries. These were selected as for Table 20A, except tliat the ratio of 
'average* ovarian cancer to 'average' normal ovaries was greater than or equal to 4.0, the 'average' ovarian cancer levd was set to the 93rd percentile value amongst various 
ovarian cancer specimens, the 'average' nonnal adult tissue level was set to the 93Fd percentile value amongst various non-malignant adult ovaries, the 'average' ovarian 
cancer value was greater than or equal to 200 units. 

TABLE 23A: 

Pkey: Unique Eos probeset identifier number 

Ex. Accn; Exemplar Accession number, Genbanl( accession number 



Title: UnlGene gene title 
Protein Dom.: Predicted protein domain 







al ovaries 












UGID 


Title 


PtDleintXim. 




421296 


NM_002666 


Hs.103253 


perilipin 


perillpln,SS 


37.8 


437897 




Hs.146170 


hypothetical protein FLJ22g6g 


SS,TM,zl-DHHC 


29.2 


453028 


AB006532 


Hs.31442 


RecQ protein-lil(e 4 


DEAD,hellcase_C,Foi1(_head 


27.6 


441021 


AW57B716 


Hs.7644 


HI histone family, member 2 




27.2 


422310 


AA316622 


Hs.98370 


cytochrome P450, subfamily IIS 


SS,TM,pkinase,fn3,ig 


26.5 


454017 


AW023617 


Hs.347130 


hypothetical protein FLJ22709 


SS,TM,myosin_head,l«,DAG_ 


25.9 


438424 


AI912498 


Hs.26895 


hypothetical protein FU14996 


SS,TM 


25.8 


435017 


AA336522 


Hs.12854 


angiotensin II, type 1 recepto 




26.0 


409518 


BE384836 


Hs.3454 


KIAA1821 protein 


SS 


23.3 


410418 


1331382 


Hs.63325 


Iransmentiane protease, serine 


SS,TM,ldLtecepLa,trypsi 


22.8 


439924 


AI985897 


Hs.125293 


ESTs 


88 


22.7 


446374 


AA3292S6 


Hs.24756 


ESTs, Moderately similar to al 




22.6 


431773 


BE409442 


Hs.268657 


pleckstrin homology-lilte domai 


PH,SS,Llli4,Tropanin 


21.4 


420839 


AI792682 


HS.28296C 


hypothetical protein MGC1 0870 


SS.(3S.UPF0139.GlycoJiydro 


21.4 


413436 


AF238083 


Hs.68061 


.sphingoslnel<inase1 


DAGKc 


21.2 
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423767 
423552 
422179 
441356 



451643 
434513 
413244 



444804 
406620 
421495 



416819 

406918 
4C7605 
447304 
402365 
407757 



423801 
430397 
411570 



412674 
400460 
417695 



427448 
406230 
432143 
433573 
413726 
431974 
428167 
450461 
412738 
445434 
444008 
444410 
444607 

401210 
434743 
434030 
450029 
439632 



405371 
456741 
458130 
456977 



458216 
451721 
421445 



BE614743 Hs.14fi688 

140027 Hs.11 8890 

BE04S223 Hs.136912 

H03tl7 Hs.111497 

AW139399 HS.9B98B 

AA977935 Hs.127274 

HI 8283 Hs.1 32753 

AF052122 Hs.130712 

AF091619 Hs.1 12567 



ESTs 



oxonlypri 



H08379 Hs.1 55553 

AW1 63034 Hs.5457 

M64437 Hs.234799 

H55995 Hs.37372 

AW955951 Hs.1 59255 

AW451623 Hs.l 09752 

AF161254 Hs.103!96 

HU 012391 Hs.79414 

BE315042 Hs.19210 

U24266 Hs.77448 

AI084452 Hs.22153 

M81105 Hs.146550 

AI5B3067 Hs.149152 

AA4555ee Hs.62405 

C00138 Hs.8535 
M14624 

AA324251 Hs.78950 
AW995839 

U77735 Hs.80205 

AW613841 Hs.301394 
U88357 

W03512 Hs.6479 



W16398 Hs.38628 

AF174487 Hs.293753 

AA831893 Hs.292767 

NM_016071 Hs.1 32942 

AI924633 Hs.105607 

BE144584 Hs.314341 

AI004525 Hs.14587 

X04106 Hs.74451 

AA424317 Hs.6259 

AA433988 Hs.98502 

BE061914 Hs.10844 

A1697160 Hs.143594 

AW663317 HS.6SS88 

W25317 Hs.37616 

R35917 HS.30133B 

BE246449 Hs.2157 

AL040133 Hs.123484 

AF234887 Hs.57652 

AJ278465 Hs.75510 



Homo sapiens clone 23929 mRNA 
dynein, axonemal, intermediate 
ESTs, Weakly similar to JC5024 
hypotheticai protein FU20253 
hypothetlcai protein DKFZp434N 
synaplogyrtn 3 
breakpoint cluster region 
Homo sapiens DNA binding pepti 
kruppel-reiated zinc finger pr 
putaUvc c-Myc-rcsponsivo 
SD6 antigen 

prostate epittielium-specilic E 
hypothetical protein MGC11303 
aldehyde dehydrogenase 4 famil 
Target Exon 

hypothetical protein FLJ21988 
myosin, heavy polypeptide 9, n 
ESTs,Weakly,similarloRHOPM 
hypotheUcal protein FLJ22573 
Homo sapiens mRNA for KI/VM668 
gb;Human 4-beta-galactosyltran 
branched chain keto acid dehyd 
gb:QV4-BN0044-1102013-108-h07 B 
pim-2 oncogene 
hypothetical protein MGC3101 
gb-.Homo sapiens DNA-binding pr 
hypothetical protein MGC13272 
phosptialidyliiiositol glycan, c 
Target Exon 
hypothetical protein 
Bcl-2-related ovarian killer p 
hypothetical protein FU23109 
GTPase regulator associated wl 
bicarbonate transporter relate 
ESTs 

Eos Control 

ESTs, Weakly similar to AF1 51 8 

calpain4,small subunit(30K) 

C11002253*:gi|129091|sp|P23267 

KIAA1698 protein 

CA1 25 antigen; mucin 16 

Homo sapiens cDNA FU14476 fis 



Human D9 1 
hypothetical protein FU12587 
Wiskott-Aldrich syndrome (ecze 
Target Exon 

Homo sapiens, done IMA6E:4178 
cadherin, EGF LAG seven-pass G 



Hs.200934 ESTs 



AA770021 Hs.16332 

BE408081 Hs.46736 

N34731 Hs.74662 

BE391690 Hs.9265 

BE544855 Hs.236572 

BE387360 Hs.33719 

AW405635 Hs.293687 

AI363410 

AW152336 Hs.3709 

AW073380 Hs.267963 

AW410714 Hs.334437 

Y19188 HS.32D461 

AF039ig6 Hs.272367 

W37608 Hs.184492 

AA115811 Hs.6838 

AK000262 Hs.1 69768 

BE258876 Hs.94446 
BE246743 

Z80345 Hs.1 27610 

AWC24282 Hs.104938 

NM.006946 Hs.26915 

AA913059 Hs.104433 



SS,pkinase,PMP22_aaudin 



TM,DnaJ,UBA,ArfGap,homeob 
Synapti:^/rin,SS,TM,PDZ,WD 
RhoGEF,RhoGAP,PH,C2 
SS 

SS,TM,BTB,Pep_M12B_propep 



myosin.head,Myosin_tail,l 



pkinase,SS,TU,OTU,K_tetra 
zf-C2H2,$$ 



ESTs 

hypothetical protein FU23476 
siah binding protein 1; FBPin 
hypothetfcal protein FU20917 
ESTs, Weakly similar to SFR4JH 
ESTs, Moderately similar to 86 
ESTs 

C7001 735*:gi|7768635|dbi|BAA95 

G12000519:gi|7710046|reflNP_05 

ribosomal proteins 18 

low molecular mass ubiqulnone- 

hypolhelical protein FU10535 

hypothetical protein MGC4248 

ESTs 



ras homolog gene family, raembe 
tiypothetical protein FU20245 
polyamine-modulaled factor 1 
tiypothetical proldnFU22635 
acyl-Coenzyme A dehydrogenase, 
hypothetioal protein MGC15906 
spectrin, beta, non-erythracyt 
.Homo sapiens, clone IMAGE:4054 



SS,PeptldaseLC2 

SS,SS,TM,lg 

SS,zf-CCCH 

zf-C3HC4,TM,SuIfate.trans 

RhoGAP,SH3,PH 

HC03_coUansp,SS,TM 

SS,SS,Glyco_tranL43,COLF 

SS,TM,SS,TM 

eniand,SS,CAP_GLY 

SS,TM,SCAN,zf-C2H2,KRAB 

SS,TM,Glyco_hydioJ1,GIyo 

88 

SS,SS,TM,Sema 



SS,PWWP.Exonuclease,llpoc 

SS,SS,SAC3_GANP 

88 

SS,R-PLC-X,PH,PI-PLC-Y.C 



SS,Pyridox^o!(idase,zl^2H 



aldo_keLted,SS,TM,gla 
SS,TM 

Acyl-CoA_dh,Acyl-CoA^dh_M 
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427640 
424618 



408294 
442232 
416866 
419823 
422625 
401264 
407507 



447867 
442472 
455588 



407363 
414413 
431765 
421694 
453683 
418736 



BE046956 Hs^51673 

NM_012068 Hs.975t 

M723157 Hs.73769 

AL031685 Hs.12785 

AF058293 Hs.180015 

AI589535 Hs.94875 

L29472 Hs.1802 

AW139476 Hs.31240 

AA827718 Hs.88218 

AW451101 Hs.169489 

BE408369 Hs,43621 
BE141732 

AI357813 Hs.337460 

AA297366 Hs.80324 

AW271708 Hs.118918 

AW504598 Hs.1 55976 

U73799 



421901 
441353 
443801 



447151 
431898 
454291 



DN A (cytosine-5-)-niethyltransf 
acfivating transcription U 
folate receptor 1(adulO 
KIAACg39 protein 
D-dopadirome lautomerase 
ESTs.Weal(ly similar Id A35363 
major liistocompallbilily compl 
ESTs 



lartoJC 



I HS.33S742 ESTs 



ESTs 
ESTs, U 

Homo sapiens, Similar to liypot 
gb:QV0-HT0101-061099-032-e07H 
ESTs, Wealiiy similar to A47582 
serine/ttireonine protein ptiosp 
ESTs, Weakly similar to M20M_H 
cullin4B 

C1800009C":gi|6678556|ref|fiP_0 
gb:Hiiman dynaclm mlRNA, partia 
CI 1000890:gi|37464431gb|AAC639 



SS,P'/VWP,PHD 

bZlP,NTP_transfJ,SS,TBC 

Folate_rec,SS 

SS,TM.Na_H_Exchanger,ABC2 
MIF,lale_protein L2,SS,GS 
S8 

TM,lg,MHCJUieta.SS.TM/i 



SS,SS,arf,ras,fii3,ias 



D30934 Hs.29549 

NM_001381 Hs.103854 

AI929SB5 Hs.177656 

AW583672 Hs.256311 
AA812046 

A1929377 Hs.173724 

AI525268 Hs.164303 
AW806859 

All 29903 Hs.74669 

AW247735 Hs.101617 

AA454184 Hs.289014 

AA554920 Hs. 105794 

W03445 HS.382C6 

BE559681 Hs.30736 

AF035032 Hs.181125 
BE294877 

AF1 24249 Hs.268541 

BE387430 HS.10688C 

AL079854 H3.1 18598 

T18979 Hs.87908 



C-trpelectMkereceplor-i 



lectinj:,SS,TM 

PH,IRS,TM,PH,IRS,lijpsln, 

efhand,RrnaAD,SS,efhand 



ESTs 



U66048 Hs.92683 
D60945 

AJ261016 Hs.89230 

A1742071 Hs.133205 

AA186733 Hs.292154 
Y10141 

AW173191 Hs.213117 

AK001383 HS.11638S 

AK002194 Hs.19851 

AW958453 Hs.204959 

BE439952 Hs.1 961 77 

AK001502 Hs.91753 

AW245825 Hs.211914 

AW452434 Hs.58006 

N49809 Hs.1 1197 

U38291 Hs.194301 

AB014554 Hs.109299 

AW450211 Hs.1 26825 

AW206942 Hs.253594 

AW004968 Hs.236720 

AI670823 Hs.85573 

AK001035 Hs.130881 

U51333 Hs.159237 

AA321980 Hs.204532 

AA639613 Hs.325116 

AW505368 Hs.1 2460 

D82070 Hs.1 77972 

BE268854 Hs.1 77729 

AW161050 Hs.169511 

AA698869 Hs.173770 

AI811865 Hs.7133 
AL117488 

AL133632 Hs.127808 

AI022813 Hs.92679 

AK000020 Hs.272013 

AW384847 Hs.213534 

AA954810 Hs.239784 

NM_002314 Hs.36666 

AW409701 Hs.1578 

AK000664 HS.1642S6 

BE464016 Hs.238g56 



gb:MR)-ST002(H)81199JXM-c03 S 
vesicle-associated membrane pr 

ESTs, Wealdy similar to T32527 
ESTs 

UDP-gIucose:glycoproteiii gluco 
gb:za05g11.r1 Soares melanocyt 
KIAA01 24 protein 

gb:Homo sapiens done MCA1 L my 
gb:601174162F1 NIH_MGC_17 Homo 
novel SH2-containitig protein 1 
bystin-lil<e 

Homo sapiens mRNA for KIAA1878 
ICBP activator pro 
ins mRNA; cDNA DKFZp4 



gb:HUM1 41 D04B Clonlech human f 
Target Exon 
potassium ir 



SS,helicase_C,AT_hook,SS, 
SS,TM 

TM,CaMBD,SK_cliannel,TM 



le. partia SNF.SS.TM 



hypottietical protein FLJ10521 
peroxisomal biogenesis factor 
hypottietical protein FIJ14886 



hypothetical protein 
ENSP0000023362r:NADH-tibiquino 
ESTs, Wealdy similar to ALUS H 
Homo sapiens, clone IMAGE:3343 



pklnase,SS,SNF2.N,hellcas 



ESTs, Weakly simlar to A46302 



hypotheUcal protein MGC1 0911 
B-cell CLUIymphoma 1 1 A (zinc 
Target Exon 
hexokinase 3 (white cell) 

ESTs 

Homo sapiens, clone MGC:2962, 
gb:UI-HF-BNO-alu-d-0J-O-Ul.r1 
chromosome 4 open reading fram 



Homo sapiens, ckne IUAGE'3161 
Human clone 23564 mRNA sequenc 
Homo sapiens mRNA; cDNA DKFZp4 
Homo sapiens clone CDABP001 4 m 
hypothetical protein FU2a01 3 
ESTs, Weakly similar to MUC2_H 
human homolog of Drosophlla Sc 
UM domain kinase 1 
baculovlrallAPre 



SAM,SS,TM,iTm,PDZ 
SS,TM,HSP20,7tm_1 
GATA 

SS,MATH,zf-TRAF,zf-C3HC4 



hexoklnaschexoKnase^he 



SSTM,ABCJran.ayco_lran 



SS,TM,LRR,amlnotranJ.2 



BIR,TK,SS,TM 
SS,zf-C2H2mn 
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427461 
448993 
443136 
427725 
400923 
419767 



436543 
431811 
414534 
455885 
427721 
430432 

450334 
413564 
410397 
439539 
400286 
416472 
416641 
419492 
420970 
406495 
448043 
401724 
424263 
428092 
453023 
400137 



401674 
412289 
424198 
412173 



AW245185 Hs.6996 ESTs 

AW403954 Hs.25237 mesenchymal stem cell protein 

AI928242 Hs.293438 ESTs,HighlysimilartoAF1984 

D17629 Hs.159479 galxtosaraine (N-acetyl)-B-sul 

BE146961 gb:QV4-HTC222-011199-019-b12 H 

AI34B856 Hs.21627 gb:tb05a05.x2 NCLCGAP_Lu26 Ho 

AK001237 Hs.319088 hypothetical protein FU10376 

AA531527 Hs.33204C hypothetical protein M6C13010 

AI471530 KIAA0144 gene product 

NM_001440 Hs.9018 exostoses (multiple)-!ike 3 

U66839 Hs.1 80533 mitogen-activaled protein Una 

Target Exon 

AA77382C Hs.63970 ESTs 



AA829286 Hs.332063 

BE513301 Hs.29344 

NM_002212 Hs.5215 

AB040972 Hs.301696 

BE257293 Hs.76366 
BE153524 

AI5B2B43 Hs.1804SS 

AB037758 Hs.2414ig 

AW139032 Hs.107376 

AF03595g Hs.24879 



AA180756 Hs.340316 

BE243136 Hs.86947 

AA243547 Hs.l9447 

AA305079 Hs.1342 



serum amyloid A1 
hypothetical protein, clone 24 
Integrin beta 4 binding protei 
hypothetical proteliiaJ1 1560 
BCL2-antagonlst of cell death 
gb:PM04HT0339-241199^2.C03 H 
RAD23(S.cerevisiae)hamolag 
KIAA1337proteii 
hypothetical protein I3KFZp434N 
phosphaMc add phosphatase 
gb:60114599QF1 NIH_MGC_19 Homo 
OKFZp564J157 protein 



PD2-LIM protein mj 
cytochrome c oxidase sul 
Target Exon 



425041 
448340 
406779 
431006 
421273 
409649 
430281 
444672 
405928 
421321 
439905 
451937 
425575 
43B527 
438951 
421758 

405346 
432746 
452798 



AWC28733 Hs.31439 

W94824 Hs.11565 

AA101325 Hs.86154 

NM_014404 Hs.278907 

BE272446 Hs.265317 

AL050341 Hs.37165 

AI620463 Hs.347408 

AW935967 Hs.170162 

AB029010 Hs.143026 
T71071 

AI467908 Hs.8882 

AI907018 Hs.15977 

AW247529 Hs.6793 

AI377150 HS.1509M 

AI492910 Hs.32362 

AA412048 Hs.279574 

AA490544 Hs.1 27269 

AJ245416 Hs.103106 

AA159216 Hs.55505 

AI878842 Hs.237924 



Eos Co 



se inhibitor. Kun 



415056 
454252 
409227 
413908 



NM_C05309 Hs.103502 

AW799755 Hs.1 10953 

AFlig664 Hs.272g9 

AW084791 Hs.133122 

AIC87335 Hs.123473 

U51336 Hs.5453 



AA554512 Hs.24301 

A1918771 Hs.257170 

AA854219 Hs.348137 
BE561688 

AI627912 Hs.130783 

BE258857 Hs.27744 
AF151536 

N52572 Hs.13702 

AW952776 Hs.94943 

AW502869 Hs.65219 



SS,SAAjiroteins,SS,SAA_pr 



RIKENcDNA 2010100012 gene 
hypothetical protein FLJ 12457 
calcium channel, voltage-depen 
hypothetical protein MGC2562 
collagen, type IX, alpha 2 
hypothetical protein MGC13102 
C160014ir:gl[75(Xl34S|plr|[T21 
KIAA1357 protein 
KIAA1087 protein 
gb:yc50b05.r1 Stratagene Pver 



ESTs 
ESTs 

CGI-39 protein; cell dealh-ieg 
ESTs, Wealily similar to T02345 
U6 snRNA-associated Sm-lilie pr 
hypothetical protein FU20442 

laminin, alpha 5 
Target Exon 

glutamic-pyruvate transaminase 
retinoic acid Induced 1 
transcriptional regulator prot 
hypothetical protein FLJ14524 
ESTs 

inositol 1,3,4-triphosphate 5/ 
Gardner-I^heed fisline sarcoma 
Homo sapiens mRIW; cDNA 0KF2p4 
Rag C protein 

polymerase (RNA) II (DNA direc 
ESTs 



RAB3A, member RAS oncogene fam 
ubiqumoI<ytochrame c reducta 
ESTs, Moderately similar to AL 



PAP2,SS 

SS,(iomeobox,UPF0160,DUF23 
SS,Forlehead 

TM,ABC_tran,ABC membrane 
zf-C2H2 

disinlegrin,Reprolysin,Pe 

LIM,SS,SH3,SorttMetalloph 

C0X6B 

SRCR,TM,Acelyllransf 
PHD 

TM,PI^T,SS 

fn3,lg,ll«<,SS,TM,fn3,ig,R 

SS,TM 

Kunitz_BPTI,SS,TMjon_tra 

Corona_7,SS,TM 

UPP_synthelase,HMG14_17 

PMP22_Claudln,SS,TW,PMP22 

Ss!coll^n,SS,Collagen 
SS,TM,gla 

FAD^)xidase_C,FAD_blndlng 

SS,TM,Na.Ca Ex,Calx-bela, 

CPSase_Lcha!n 

SS,TM,7tm_1 



WO40 

Sm,SS,tfWA-synlJ,GST_C,G 

Y_phosphatase,DSPc,TU 

mito_oarr,SS,TM 

laniinin_EGF,laminin_G,EGF 

SS,cystatin,Coprogen oxid 

aminot;an_1_2,SS,TM,LRR 

SS,integrin_B,fn3,Cali<-be 
SS,TM,aminotran_1_2 

SS,oxidored_nitro,SS 
SH2,SH3,pI(inase 



RCCl 



adenosine A1 receptor 



SS,TM 

7tm_1,SS,TM 

SS,TM,voItage_CLC,CBS 



BE409966 Hs323813 
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407299 
422837 
407722 
417810 
446333 
402197 
419390 
447754 
444664 
421190 
432872 
430023 
413343 
417852 
40312B 
413056 
427812 
457761 
463099 
426048 



441197 
421707 
435750 
432353 
427326 
447128 
419444 
457978 
410445 
431857 
407143 
408724 



414927 
434314 
414157 
424415 
406487 
447365 
417900 
442297 
425440 
418256 
431543 
430344 



456960 
461481 
434357 

433333 
430600 
409034 
421642 
431534 



AW674193 Hs.227162 

AA234540 Hs.23871 
AI222969 

778141 Hs.22826 

AW968343 

AWB51734 

W05238 Hs.94316 

AW732847 Hs.70573 

AF129086 Hs.25197 

AW406289 Hs.96593 

N84730 Hs.260616 

AA661771 Hs.3076 

AA081594 Hs.156311 

AW071696 Hs.209065 
AWB16029 

AA460205 Hs.28977C 

U26441 Hs.1 21478 

BE252241 Hs.38041 

D28419 Hs.82609 

BE537641 Hs.44278 

AI701162 HS.902C7 

AW073310 Hs.163633 

N26362 Hs.11615 

U95031 Hs.102482 

AI908984 Hs.278623 

AA1S8243 Hs.227729 

BE392026 Hs.334346 

AJ250562 Hs.82749 



H62087 
AI768853 
H9686C 
AI524849 
BE244638 
NM_014921 HS.1C7C54 
AB029012 H$.4990 
NM_016558 Hs.274411 



ESTs 
ESTs 

ESTs, We*y similar to 155214 
DKFZP434I1735 protein 
gb.MR2-CT0222-011199-007-€lC C 
ESTs,Weal(lysimilartoT31613 
PKCI-1-related HIT protein 
STIP1 homology and U-Boxconia 
hypothetical protein 
isocllrale dehydrogenase 3 (NA 
ESTs 

Musashi (Drosophila) homolog 1 
hypothelical protein FU14225 
gb:MR3^T0220-151299^27-b10 S 



SS,TM,SS,TM,ClattirinJg_c 



hypothefcal protein FU1253f 
Target Exon 

hypothetical protein MGCm: 
intronofHER4 



much S, sub^e B, tracheobro 



Hs.344612 



AI271898 

NMJ02496 Hs.90443 

AA776638 

AA199830 

W19144 Hs.271742 

C14076 Hs.332329 

AI685842 Hs.294143 

W28661 HS.S288 

AI791499 Hs,20S742 

AB028989 Hs.8eSCC 

T83587 Hs.186476 

BE392g21 Hs.3797 

BE2978C1 Hs.103845- 

NM_001975 Hs,146580 

BE383676 Hs.334 

BE250127 HS.829C6 

NM_006202 H3.899C1 

BE382756 Hs.l699C2 

AW845318 Hs.12271 

AW969519 Hs.259768 

AA476827 Hs.l71012 

AW410063 Hs.184877 

API 11 170 Hs.306165 

AA300228 H3.296866 

AW732284 Hs.3828 

AL04C536 Hs.9573 

AI016621 Hs.71816 

AW950967 Hs.274348 

AI684149 Hs.1 72036 

AA411607 Hs.1 18964 

AL1 37531 Hs.258890 

AF231023 Hs.65173 

NM_016240 HS.1288S6 

AA932860 Hs.133864 

NM.0a4729 Hs.gg33 

BE313765 Hs.343443 

AW362198 Hs.12503 

AK001822 

R06185 



ESTs 

gb:yw03bI2.s1 Soares melanoc 
ESTs, Weakly similar to NEL1J 
sterol regulatoiy element bind 
leclomedln-2 
KWA1089 protein 
SCAN domain-containing 1 
gb:qv23ro&x1 NCLCGAP_Lym6 Ho 
cyoIinK 
Target CAT 

gb:ae78g04.s1 Stratagene sclilz 
gb:zq75li01.r1StratageneliHrn 
AOP-iibosyllransfeiase (NAD; p 
EST 

ESTs. Weakly similar to T22914 
Homo sapiens mRNA; cDNA DKFZp4 
ESTs, Weakly similar to ALUA_H 



SS,TM,eniand,elliand 
SS,TM,DEAD,hellcase_C,Larr 



7tm_1,SS,TM,7tm 1 
pfkB,SS 

Porphobil_deam 



DSPc,Rliodanese,SS.TM 
Cys_knot,wi«: 

DUF25,SS,l^bosanaLL3,PDZ 
SS,FKBP.SaPDGF.C2,PI-PLC 
SS,DnaJ 

transmembrane4,SS,TM 

SS.TM,tubuiin,EGF,F5_F8_t 

p450 

SS 

SS,UM,SS 

SS 

TM 

SS,TM,SS,TM,DDOST_48kD 



Utrophilin,OLF,7tmJ,Gal 

SS,TM 

SCAN 

SS,TW,7tm_1,SS,TM 



Target Exon 
Rho guanine nucleotide exchang 
CDC20 (cell division cycle 20, 
phosphodiesterase 4A, cAMP-spe 
solute carrier family 2 (facil 
f-box and leucine-ticb repeat 
adenylate cyclase 1 (brain) 



ESTs, Weakly similar to 138022 
Target Exon 

interieuklnlSteceptar.alplia 
Homo sapiens cDNA FLJ10960 lis 
gb7e94d03.r1 Soares fetal liv 



SS.Thl,pklnase/cUvinjec 



enolase,SS,Alrophln-Ulr 
SS,TM 

SH3,PH,IRhoGEF 
WD40,SS.TM,lh3,EGF,fn3,ig 



solute carrier MIy 25 (mite 
Target Exon 

Homo sapiens 14q32 Jagged2 gen 
hypothetical protein DKFZp434N 
mevalonate (diptiospho) decarbo 
ATP-blnding cassette, sub-fami 
v-al(l murine ttiymoraa viral one 
HLA-B associated lranscript-3 
hypothetical protein similart 
ESTs, Weakly similv to KIAA11 
Homo sapiens mRNA; cDNA DKF2:p4 
cadherin, EGF LAG seven-pass G 
CSR1 protein 
ESTs 



mito_carr,SS,TM,profilin, 



GHMP.kmases,SS,TM 



SS,TM,[as 

SS,TM,7tmJ,cadherin,GPS, 
CollageaSS 

zf-BED 

SS,TM,Yjhosplialase,LON,A 

UPF0027 

SS,sushi,SS 
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hypothetical protein 

BISPra)0()oS':CDfJ«!FU2()404 
gb:MR0-BT024M91299-201-c07 B 
gb:UI-HI^BNMs-a-1(M)-Ul.r1 



RhoGEF,PH,SS,zf-CCCH.vwd 



459133 
404757 
409879 
411219 
421871 
434067 



421543 
407757 
419125 
437141 



422708 
417442 
432751 
432004 
454151 

417677 



447827 
409125 
450437 
415514 



414757 
436014 
421696 

408015 



408215 
408244 
436041 



436540 
427747 
441456 
431630 



457310 
417193 
432545 



BE272686 Hs.15356 
AA146625 

BE409261 Hs.12084 



BE304917 HS.31C97 
AV655783 Hs.661 
AB00619C Hs.25475 



cyclin-dependent l(inase inhibi 

hypotheiicai protein MGC2668 
gb:QV2-TT000J-161199J13-h06 T 
glycoprotein, synaptic 2 
Homo sapiens cDNAFLJ14218fis 
hypothetical protein FU10116 
hypothetical protein FU2C2S4 
gb:zo71c07.s1 Stratagene pancr 
Tu translation elongation fact 
hypothetical protein FU20512 
hypothetical protein MGC5395 
B-cell CLUymphoraa 11A (zinc 
hypothetol protein FU21478 
Target CAT 



ESTs 



rin7 



^s.154C88 I 



AA047169 
BE299427 

NM_01605S Hs.82389 

NM_001089 Hs.26630 

AW749126 Hs.170345 

R41933 Hs.140237 



AF044924 
AF086161 
AL045934 



Hs.114( 



X13966 Hs.24998 

F11301 Hs.138329 

BE383605 Hs.300816 
U24683 

AM19008 Hs.106730 

f«9096 Hs.279939 

Ai219207 Hs.72222 

BE410S73 Hs.283636 

046922 Hs.77252 



AW136771 Hs.244349 

AI702901 Hs.145582 

NM_014931 Hs.72172 

AW295349 Hs.8038 
M79091 

AWC03512 Hs.232770 
R39108 



ESTs 

small GTP-binding protein 
immunogiobulin heavy constant 
chromosonie 22 open reading fra 
mitochondrial canter homolog 
hypothetical protein FU134S9 
Homo sapiens, clone IMAGE:4053 
fragile histidine triad gene 
exosome component Rrp46 
Homo sapiens clone 23771 mRNA 



toFOR4M 
KIAA11 IS protein 
ESTs 

gb;EST01239 Subtracted Hippoca 



ls.11067 ESTs, Higtily similar to T46395 
BE258332 Hs.278362 male-eniianced antigen 



A1498080 

AMI 6856 

BE397032 

AW411425 

AI458911 

I>1M.C02204 

R43144 



Hs.180655 
Hs.1 27765 
Hs.265829 
Hs.21919 
Hs.18376 
Hs.10041 



BE299671 HsiS6310 

AI879159 Hs.108219 

NMJ01327 Hs.157379 

W28363 Hs.239752 



AI279811 
AA706639 

NM_007275 Hs.8186 



h/pothetical protein MGC14226 
serine/threonine kinase 12 
ESTs 

integrin, alpha 3 (antigen CD4 



C7002191*:gi|S0S3028|gb|AAD388 
likelyorthologofmouseZFP28 
wingless-type MMTV integralon 
cancer/lestis antigen (NY-ESO- 
nuclear receptor subfamily 2, 
hypothetical protein FLJ22795 
uracil-DNAglycosylase2 
Homo sapiens, clone IMAGE;3953 
gb:ag90eC9.r1 Stratagene hNT n 



CGI-118 protein 
ATP-binding cassette, sub-fami 
hypothetical protein FLJ13710 
ESTs,Weal<ly similar to ALU1_H 
Hs.30792 hook2 protein 

hypottiellcal protein FLJ1 1806 
gb:DKFZp434M116_r1 434 (synony 
C16001614:gi|7801278|emblCAB91 
protein tyrosine phosphatase, 
axonal transport of synaptic v 
hypothetical protein MGC1 0471 



HC03_co 
ank,SSy\dapjx)mpjsub 
TM,zf-C2H2 
SS.TM 

TM,Steroid_dh,SS 



hormone, SS,p»(B 

GTP_EFTU,GTP EFTIIJ)3,GTP 
TM 

SS.EF1G_domaln,GST_C,GST_ 
SS 

SS,TM,Glyoos_tiansf_4 
MIP,SS,TM 



PDGF,SS 

SS,TM,Glycos_transf_4 

SS,DlX,PDZ,DEP,Dlshevelle 



bZlP,SS,AhpC-TSA 
8S,PI3_PI4J(inase,PI3Ka 



SS,TM,TPR 
SS 

SS,TM,UBA,Rhcniboid,SS,TM 
SS,fM,efhand 

S8,TM,Exo_endoj)lios,BNR,A 
HIT 

RNaseJ>H,RNaseJ>H.C,SS,TG 
SS,TM,efhand,elhand 



SS,TM,DUF60,trypsln,CUB,u 



integtin_A,FG-GAP,Rhabd_g 
TM 

SS,Myosin.lail,M 

SS,HLH 

SaABCJran 



AI275803 Hs,123428 ESTs 
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430513 

427815 

420476 
409960 
436325 
444439 
412915 



50 
55 

60 



412104 
449750 
442725 



AW970440 Hs.23642 

NM_0056S8 Hs.2134 

AI815639 Hs.76394 

AW954377 Hs.26412 

AJ245433 Hs.247323 

AA622394 Hs.153177 

AJ012008 Hs.241586 

BE244700 Hs.147049 

BE072019 Hs.12851 

NM_002342 Hs.1116 

AW676863 Hs.136232 
BE261944 

AL390088 Hs.7393 

AI458883 Hs.143545 

AW087727 Hs.74823 

NM_002419 Hs.39449 

U40714 Hs.239307 

AW295956 Hs.11900 

AW138211 Hs.128746 

NM_004658 Hs.193312 

BE250337 Hs.193273 

AW206197 Hs.240951 



447598 
425858 
440611 



431615 
427204 
425169 
412940 



ENSP00000000452:BAD protein {B 
protein predicted by clone 236 
TNF receptor-associated factor 
eno/l Coenzyme A hydialase, sli 



G6C protein 

cut(Dro3optiiIa)-lil(e1(CCAAT 
phosphatidylserine synthase 2 
lympliotoxin beta receptor (TlilF 
ESTs 

hypothetical protein from EURO 
hypothetical protein MGC11303 
NM_004541:Homo sapiens NADHde 
mitogen-actvated protein l<ina 
tyros yl-tRNA synthetase 
hypothelicai protein FU14972 
ESTs 

f^S protein act'vator like 1( 
Target CAT ■ 

Homo sapiens, Similar to RIKEN 



Hs.131035 
Hs.241161 
HsJ01760 
Hs.172914 

428924 AI016405 Hs.98959 



rs,Weald|r,slmilarl 
A0969protdn 



Hs.195347 ESTs 



AI799968 Hs.199630 ESTs 



AW245890' Hs.21753 
NM 012152 Hs.258583 
AA405404 Hs.215725 



AI14207e HS.1355S2 ESTs 



Target Exon 

FK506-binding protein 4 (59i(D) 
syntaxin binding protein 2 
ESTs, Weakly similar to AF1568 
ESTs, Weakly similar to T1 9486 
ESTs 



Hs,9857 
AA628062 Hs^0358 
AF151815 Hs.4973 



413764 
409169 
446933 
409139 
456672 

421909 



427527 
414265 
433933 



413163 
434962 
418572 
440869 
4S3446 
412159 
438999 



BE513294 Hs,205735 



AL137659 Hs,297214 

AI681917 Hs.3321 

AK002016 Hs.1 14727 

AI083668 Hs.50601 

NM_013375 Hs.109428 

AU077005 Hs.92208 

AJ006973 Hs.9482 

H11097 Hs.61960 

AI809067 Hs.153261 

BE410411 Hs.75864 



C7S094 



Hs.334514 
Hs,77575 
Hs.75216 
AK001574 Hs.4291 
AI751740 Hs.86172 
NM.014297 Hs.7486 
BE299996 

AF286S98 Hs.9271 



SS.Iiormone_rec,2f-C4 

SS,PX,arf,lipocalin,PHD,z 

MATH,SS,MATH,A2M_Ny\2M,NT 

ECH,Pepfidase_U7,SS,TM 

SS>TM,aLN,Cbl_NiCbLN3 

SS,TM,G-patch,ublquitln,a 

SS,TM 

SS,TM,GST_Cabhydrolase 
CUT,homeobox,beta-lactama 
SS,TM,7tm_1 

TNFR_c5,SS 
SS.HU1 



8S,TM,PAF-AH4)JI 



SS,TM 

SS.TM,Peptidase.M10,fn2,h 
SS,TM,MAGE,Ribosomal_S17, 



Oxyslerol.BP.SS 

lamiiin_Nterm,laminin_Nte 

WD40,SS.TM,KOW,HLH 



homeobax,SS,homeaboxJx]me 



.Moderately similar to AL 

hypoHielical protein FU122B7 
C2002219*:gi|12737280|refiXP_0 
HLA class II negion expressed 
NM_021949:Homo sapiens ATPase, 
Target Exon 

proteasome (prosome. macropain 
gb:PM1-HT0454-301299-001-d08H 
(clone PWHLC2-24) myosin light 
HSPC141 protein 
ESTs, Highly similar to IRX1_H 
Homo sapiens, clone MGC:16327, 
hypothefcal protein MGC10986 
TATA-binding protein-binding p 
a dislntegrin and metalloprote 
target of mybl (chicken| homol 
NM_004347:Homo sapiens caspase 
hypothetical protein 
immunoglobulin heavy constant 
endcplasmb reticulum glycopio 
Home sapiens ctone TCCaA00164 
glucocorticoid modulatory elem 
gb:EST386752 MAGE resequences, 
NM.003105':Hamo sapiens soriil 
NG22 protein 

hypothelicai protein MGC3136 
protein tyrosine phosphatase, 
golgi perepheral membrane prot 
paired related homeobox protei 
protein expressed In thyroid 
gb.-600944574F1 NIHJ4GC_17 Homo 
KIAA1071 protein 

gb3tp66c02,x1 NCI_CGAP_Ov39 Ho 



SS,TM 

SS,TM,SS,TM,ABC_fran,ABC_ 
SS,TM,)llamenUF_lail 

kazal,SS,TM,lg,pldnase 

E1-E2JkTPase,Hydro!ase,SS 

SS,TM,MiP 

PCI,SS.COK5_aclii/alor 



SS.TH/l.ank,EGF.nolch,MATH, 
SS,homeobox 

SS,PK,PK_C,myosin_head,Rh 



disinlegrin,Reproly5in,Pe 

VHS,GAT,Tlul,Heme oxygenase 

ICE_p20,ICEj)10,CARD,SS,l 

SS,pkinase 

SS.TM,ig 

SS,TM,SS,TM,SH2,Yj.hospha 
SAND,SS 

SSAfapj»mpjsub,GYF 
!d|jec^pLa.fn3,ldLrece 
SS,TM,pkinase,SH2,SH3,BNR 

fn3jg,Y_phosphatase,SS,T 



lactaraase_B,SS,XRCC1_N,BR 
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418945 
440333 
425615 
458040 



447946 
447205 
416880 
440150 



416487 
413837 
419887 
410277 
415169 
410892 
407754 
409877 
431629 



AA256714 Hs.194864 

AK000405 Hs.76480 

AU)47879 Hs.80475 

AW751955 Hs.22753 

AL043054 Hs.256657 

BE246762 Hs.89499 

AI378424 Hs.288761 

AF023614 Hs.158341 

BE280562 Hs.287711 

AA088470 Hs.83135 

AA582082 Hs.199410 

AA536047 Hs.9850 

H16716 Hs.182648 

AI555164 H3.165827 

BE617015 Hs.11006 

H99640 Hs.53687 

AW975738 Hs.7001 

AF083420 Hs.168913 

Y11739 Hs.198313 

AA788727 Hs.34068 
AI498072 
BE271922 

AW190458 Hs.79347 
AW163525 

AW292562 Hs.187628 

R88621 Hs.26249 

W42913 Hs.78089 

AWB09762 Hs.222056 

AA52734a Hs.288967 

AW502498 HS.1S220 

AU077025 HS.26S827 

AB037108 Hs.6418 

R96881 Hs.63609 

BE207357 Hs.3454 



U91616 



447246 
427101 
447544 



414874 
418373 
424487 



427361 
420421 
414513 

431498 
432593 
404661 
41279C 
456243 
426222 
439594 

429049 
424271 
418741 
450493 
433074 
444893 



Hs.91640 



440104 
423279 
445087 
404036 
431832 



FU2257B 



. Weakly simlar to ALU2_H 
fiypolhelical protein FU22318 
ESTs, Wealdy similar to A46302 
aractiidonate 5-lipoxygenase 
tiypothetical protein FLJ21749 
transmembrane activator and CA 
liypothetlcal protein FU22692 
Homo sapiens, Similar to RIKEN 



ESTs, Moderately similar to T1 

EST 

Homo sapiens, clone IMAGE:3940 
serine/ttireonine kinase 24 (St 
winged-lieiix nude 
ESTs, Weakly similar to A43932 



ESTs, Weakly similar to zinc f 
KIAA0211 gene product 
litin-cap(teletlionln) 
ESTs 

ESTs, Weakly sblla to T2D3_H 
ATPase, vacuolar, 14 kD 
Homo sapiens cDNAFLJ11572fls 
Homo sapiens cDNA FU1410S Us 
zinc finger protein 106 



NM_005781 Hs.163937 

NM_014440 Hs.278910 

AK000633 Hs.164476 

AK001713 Hs.17860 

R87591 H3.172884 

AA401573 Hs.288284 

AI751157 Hs.101395 

AW249276 Hs.343621 

D26351 Hs.77515 

AW750770 Hs.84344 

T08754 Hs.6259 
AA381642 

AF230924 Hs.107187 

NM_001348 Hs.25619 

AW732480 Hs.7678 

AF2B1133 Hs,343689 

AW239400 Hs.76297 

AK001777 Hs.258561 

AW301003 Hs.51483 

NM_014767 Hs.74583 

AI345001 HS.8238C 

BE391706 Hs.168073 

A1245026 Hs.111099 
AA070021 

AW452125 Hs.119273 

AI991887 Hs.305882 

H83265 Hs.8881 

M93718 Hs.166373 

ALC45019 Hs.323462 

AW249312 Hs.12109 

AJ270993 Hs.98428 

AA532963 Hs.9100 



Hpall thy fragments locus 9C 
KIAA1821 protein 
Target Exon 

nuclear factor of kappa light 
ESTs 

aotvated p21cdc42Hs kinase 

tiypothetical protein FU20626 
tiypothetical protein FU10851 
ESTs 

tiypothetical protein FU22378 
NM.002858*;Homo sapiens ATP-bl 
hypothetical protein MGC1 1352 
malate dehydrogenase 2, NAD(m 
inositol 1,4,5-triphosphale re 
CGI-135 protein 
KIAA1698 protein 
gb:EST94ei6ActivaledT-cells 
ESTs 

divalent cation tolerant prole 
death-associated protein kinas 
cellular retinoic acid-binding 
exosome component Rrp41 
G protein-coupled receptor kin 
aspartyl aminopeptidase 
ESTs, Weakly similar to hypoth 
C9000306*:gi|12737280|ref|XP_0 
KIAA0276 gene product 
menage a trois 1 (CAK assembly 
DKFZP727M231 protein 
hypothetical protein MGC10974 
gb:zm67h03.r1 Stratagene neuro 
KIAA0296gene product 
5-oxoprolinase (ATP-hydroIysin 
ESTs, Weakly similar to S41044 
nitric oxide synthase 3 (endot 
Homo sapiens oDNA F1J11214 fis 
WD40 protein Claol 



hypotheiical protein MGC2803 



AW893449 Hs.12303 

AW27S866 HS.19271S 

AA613S96 Hs.28412 

T78716 Hs.120446 

AWB38994 Hs.6363 
BE0610S3 

AL137326 HS.13J483 



SS,TM,SS,TM,Pep8dase_M22 

pkinase,pklnase 

ForK.head,SS,TM,glycolyti 

SS,SS,TI« 

SS,TM,DENN,Cylidylyltrans 
SS,zf-C2H2 
SS,MethylUansf_3 
TM 

SS,TM,SS 

ATP-synLF,SS,TM,CH,Rlani 
SS,TM,SS,TM,TSPN,tspL3,SE 
pkinase,SH2,SH3 



SCAN,rf-C2H2,KRAB,SS,KRAB 



ldh,ldlLC,adh_short,Seroia 
TM,RYDRJTPR,ion_trans,MI 
SS,™,PMP22.aaudln,20Gf 
SS,SS,TM,ayco_lwdroJ1,G 



aP_protease 

SS,TM,tiypsin 

SS,TM,pklnase,Activln_rec 
flavodoxin,FAD binding,NO 
DEAD,heiicase_C,dsnii,Vira 

homeobox,SS,homeobox,honie 

SS,TM,UM,homeobox 

RA,SH2,VPS9,SS,Tli4,Nucleos 

SS,TM,HC03jiotransp 

SS,DS 

S1,SH2,RibosoniaLl-23j)kin 

SS,TM^adheiin,cadhen"n 

Ets,SAM_PNT 
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55 



440745 
419344 
447208 
436163 



AL031186 Hs.289106 

M04768S Hs.62946 
AW20405B 

AW303627 Hs.143301 

U94905 Hs.277445 

BE316291 Hs.237971 
R84938 

AK001528 Hs.347285 

Ai2627B9 Hs.93669 

AW264102 Hs.S 



433472 
408928 
448093 



443443 
426577 
412482 



412338 
419395 
442462 



1117408 
AW975625 
AL161977 
AI073387 
AW372097 
AI802408 
Ai541246 



AW977382 

AW46C671 Hs.189284 

AW368a82 H5.33818 

AK001706 Hs.7778 

AA054400 Hs.161706 

AI344042 Ms.9347 

AW949856 Hs.97165 

AI499930 HS.33488S 

AW067800 HS.15S223 

AC003965 Hs.125532 

AA151527 Hs.69485 

BE268326 Hs.90280 
AF031406 

AW328081 Hs.6817 



BE394920 

BE615408 
Ai151353 
AW372721 
AI420016 
AI751281 
AA524093 



403945 
435593 
421899 



431934 
429499 
453485 



60 413053 
409983 
409936 
421592 

65 414788 
432805 
424927 
456863 
417823 

70 406621 



Human DMA sequence from clone 
ESTs 

transfanning grovrth factor bet 
ESTs 

diac/lglycerol kinase, zeta {1 
hypothetical protein UGC5627 
gb:yt65f04,r1 Scares relina 1^2 
Homo sapiens, Similar to DiGeo 
protein disulfide isomerase re 
ESTs 

DKFZP434P1750 protein 

,1/ouoo ESTs 
.2994 PCTAIRE protein kinase 3 

,133898 ESTs 

.278429 hepatocellular carcinoma-assoc 
ubiquilin A-52 residue nboscm 
.3343 phosphogiycerate dehydrogenase 
.255479 hypothetical protein MGC5566 
":,4-dlenoyi CoA reductase 2, p 



S.7274 



BE208364 Hs.29283 

BE262524 Hs.93183 

X97058 Hs.16362 

Y15065 Hs.4975 

R88872 Hs.4964 

AJ011895 Hs.109281 

AI751768 Hs.155314 
AA324687 

AA022838 Hs.6S70 

AB031481 Hs.272214 

AA453809 Hs.99350 

BE620712 Hs.33026 

BE409283 Hs.193264 

AI242431 Hs.1 18282 

BE301871 Hs.4867 

AWS63263 Hs.65377 

D50922 Hs.57729 

AK001691 Hs.57656 

AF009801 Hs.105941 

AA577466 Hs.143696 



X78342 
X94630 
AW973666 I 



AI791493 



RecQ protein-like 5 
hypothetical protein FU10751 
KIAA0134 gene product 
regulator of &proteln signall 
ESTs 

mitochondrial GTP binding prol 



gb;AF031405 Scares fetal liver 
inosine triphosphatase (nucleo 
Homo sapiens, clone IMAGE-^59 



S.29742 Homo. 



hypothetical protein from EURO 



pynmidinergic receptor P2Y, G 
potassium voltage-gated channe 
Target Exon 
DKFZP686J1624 protein 



BE391579 Hs.75087 

BE409917 Hs.266935 

W49S29 Hs.296200 

T05387 Hs.7991 

AAB12424 Hs.76067 

AL137589 Hs.152149 

X55039 Hs.85004 

X77343 Hs.334334 

□45371 Hs.80485 
AW861123 

AW372170 HS.1B3918 

AA157857 HS.1B2265 

AW151962 Hs.46679 



S8.SS,nmif-RanBP,nni,GA 
88,TM,8SF,FG-GAP,w»a,inte 
ank,DAGKa.lMGKe,DAG_PE-bi 



SS,TM,fn3,FKBP,TPR 

2-Hacid_DH,2-Hxid_DH_C,M 

A_deaniina3e.A_deaniinase 



SS,DEAD,hetee_C,88,DEA 
SS,TM,7tm_1 

helicase_C,PRK,88,TM,7lni. 
TM.NaJ'Lcotrans 



KIAA0095 gene product 
gb:E8T275S8 Cerebellum II Homo 
ESTs, Weakly similar to S10889 
STG protein 
ESTs 

hypothetical protein PP2447 
hypothetical protein MGC3234 
ENSPOO()00239138':Guanlne nucle 
PAP-1 binding prolein 
mannosyl (alpha-1 ,3-]-glycopro 
ESTs, Moderately similar to Kl 
Kelch-like ECH-associaled prol 
hypolhetical protein FU10829 
bagpipe homeobox (Drosophiia) 
coactivator-associated arginin 
oyciln-dependent kinase (CDC2- 
CD97 antigen 

hypothetical protein C321D2.4 
ESTs 

TSC-22-like 

immunoglobulin lambda locus 
ESTs, novel cytochrome P450 



trypsin,SS 
SS.TM,TIG,8ema,PSI 
AICARFrjMPCHas,MGS,AICAR 

Hani1pJike,SS 
SS,eniand 

WD40,TMAilivlnjBcp,pkIn 

coliliruADF,SS,TM 

SS,TM,lg,pklnas9 

SS,TM.arainotraiu1_2 

TM,pklnase 

SS,TM 

SS,TM,SS.TM 
SS,zf-CaH2 
SS,ABCJran 

SS,F6_F8_type_C.EGF,TGT 



Her|ies_HEPA,S8 

VinisJ1S.bZIP,G-gamma,Myo 

SS,TM 



SS,HLH,MyCLN_lemi,Myc-L2, 
TM,SS,TM,EF_TS.UBA,transm 
BTB,Kelch,SS,TM 
SS,TM 



SS,SNF2_N,hellcaseL.C,brDm 
pkinase 

SS.TM,7tm-2.GPS.EGF,SS,TM 



luston_gly,homeobox,TM 



hypothetical prolein AF053356_ 
ESTs 

heat shock 27kD protein 1 
hypothetical protein DKFZp434K 
centromere prolein B(80kD) 
transcription factor AP-2 alph 
adipose most abundant gene Ira 
gb:RC3-CT0297-120200-014-a05 C 
Homo sapiens CDNAFU12797 lis 
keratin 19 

hypothetksal protein FU20739 



CB^P-B,HTHj5 
TF_AP-2,TF_AP-2 
C1q,Collagen,SS 

SS,g,tspJ^US,SS,TM,Nuc 



wo 02/102235 
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421453 
409616 
444744 
412575 



424919 
415528 
415137 
417334 



Z25821 
AL040S04 
AA400678 
AA234652 

AA076248 
BE394732 
Ml 13177 
AF038660 
BE260415 



BE083080 

NMJ05133 

BE314461 

H55052 

AI634834 

AA337572 

AA224483 

NM.002151 

AW410431 

M2130S 



426970 
434848 
468715 
436851 
425538 
444416 
426831 



452747 
444633 
434171 
422155 



449728 A1820761 

4S3245 T99801 

432238 AL133057 

430037 BE409649 

442196 AI9C2646 



U48263 

AI568350 

BE246446 

AA847843 

AW811064 

AJ 271671 

AI677994 

AA225065 



424619 BE387282 



431410 AW299534 



Hs.89466 Homo sapiens, Similar to dodec 
Hs,25063 PRO0461 protein 
Hs.6473 gb:zu70a11.r1 Soares_testi's_NH 
Hs.104555 neuropeptide FF-amide peptide 

gb:zm18c10.r1 Stratagene pancr 
Hs.147562 ESTs 

gb:zm29€05.s1 Stratagene pancr 
Hs.206713 UDP-Gal;betaGIoNAc beta 1,4- g 
Hs.348198 hypottietical protein FU20262 
Hs.326422 hypottieticai protein lvlGC4549 
Hs.144054 ESTs 

gb:RC0-NN1012-270300-031-a10N 

ESTs, Moderately similar to AL 
Hs,274323 similar to sialyitransferase 7 
Hs.288626 RCE1, prenyl protein protease 
Hs.153768 U3sn(— - ' ' 



Hs.72451 Homo sapiens PACctoneRP5-1C8 

Hs.157240 tiypothettcalprotelaMG(»737 

Hs.27239 DKFZP586K0624piDleln 

Hs.823 hepsin (Iransmembrane protease 
Hs.283670 

Hs.20996 
Hs.107635 
Hs.339751 
Hs.274135 
Hs.227789 
Hs.31344 
Hs.1 55342 
Hs.24951 
Hs.1 83232 



SS,bZIP,zf-C2H2,bZIPaf-C 



Hs.61273 
Hs.16695 
Hs.62711 



ls.12135 

Hs!279869 
Hs.182698 
Hs.71574 
Hs.153961 
Hs.105739 
Hs.1 11577 
Hs.127762 



ribosomalprotei 
tiypottietical protein FU14926 
ARP1(actin-related protein 1, 



AK001500 
BE256304 
AK000973 
AA700946 
BE270918 

AW288085 Hs.1 1156 

BE296216 Hs.172673 

BE560e62 Hs.1 1417 

AW956781 Hs.293937 

H6452C Hs.18490 

W31096 Hs.237617 
AF029778 
AI493244 



hypothetical protein FU23436 
hypothetical protein FU 13852 
AD-015 protein 
hypothetical protein FU10111 
ESTs 

Homo sapiens, clone llvtAGE:3534 
hypothetcal protein 
S-adenosylhomocysteine hydrola 
Rab acceptor l(prenylaled) 
ESTs, Weakly similar to FXD2_H 
hypothetical protein FIJ2C452 
Homo sapiens, done IMAGE:3447 



HS.2395C0 hypothetical protein MGC13114 
Target Exon 

BE314877 Hs.24563 hypothetical protein FU12541 
BE50437C ESTs, Wealdy similar to CA13_H 

BE1S3855 HS.6146C Ig superfamHy receptor LNIR 
AP1 11713 Hs.286218 junctional adhesion molecule 1 
BE247688 Hs.347349 KIAA094a protein 
AW249152 
AI571225 
AF078037 
AW373011 
Ai217204 
AA531128 
AA443873 
AW401864 
BE072814 
AAS06101 



Hsi84171 KIAA1535 protein 

Hs.324051 RelA-associated inhibitor 

Hs.64558 hypothetical protein FU22222 

Hs.144968 ESTs 

Hs.115803 ESTs 

Hs.1 10477 doHdiyl-phosphateniannosyllra 

Hs.18720 programnied cell death 8 (apopt 

Hs.258519 ESTs, Moderately similar to 36 

Hs,285813 hypothetical protein FU1 1807 



Glyco transf_29 
Abi,SS,CPSase_Lj 
WD40.SS,KHHiomain 

SS,TM,ion_frans 
SS,TM,SS,TM 

trypsln.SS.TM.ATPIGIJ'LM. 



Homo sapiens mRNA; cONA DKFZp4 
milogen-aclivated protein kina 
hypothetical protein FtJ12586 
protein kinase C, delta 
ESTs 

hypothetical protein FLJ22638 
prepronociceptin 
hypothetical protein MGC2650 
ubiquitin-activating enzyme El 
High mobility group (nonhiston 
gb:MR2-ST0131-211099-008^;06 S 
zinc/iron regulated transporte 
fms-related tyrosine kinase 3 
Target CAT 

hypothetical protein FLJ20241 
Target Exon 

hypothetical protein DKFZp434A 
neuropathy target esterase 
hypothetical protein MGC14333 
hypothefical protein MGC10848 



SS,zf-C3HC4,SPRY,ztB_box 

0(*ids_neuropep,SS 

SS,ART,TM 

ThiF.UBACT 

SS,HMG_box 

SS 

Zip,SS,TM,Cytidylyltransf 



SS,pkinase,tJ^LRRCr,Ribo 



SS,P5CR,Epimerase,zf-C2H2 
SS.TM.SS,TM.LRRP,Peptlda 
IBR,2f-C3HC4,SS,TM,IRF,CK 

SS.SNF2J4.hel'icase_C,brom 

zHaHCM.SpoA,PHD,TM,syna 

AdoHcyase,SS 

S8.TM,llg_chan^NF_recept 

SS,PWWP,TSC22 

S8,TM 

SS 

DSL,EGF,vwc,granulin,SS,T 



SIRiHLH,Myc_N_tenii,Myc-L 
88,TMfll*IPJInding,lon.lr 
8H3,ank,8S,TU,HHH.ig 
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418963 


BE304571 


Hs.89529 


aldo-keto reductase MIyl, 


aldoJteLred 


4.0 


437340 


AL353935 


Hs.135917 


hypothetical protein DKFZp7eiD 


TBC,bZIP,WD40.WD40 


4.0 


455928 


BE170313 




gb:QV4-HT0536^0500-193^ 02 H 


SS 


4.0 


400607 






Target Exon 


S8,honieabox 


4.0 


424825 


AF207069 


Hs.153357 


procollagen-lysine, 2-oi(ogluta 


20G-FelL0)(y,9ycosulrans 


4.0 


438143 


BE500981 


Hs^69652 


ESTs 




4.0 


433173 


Z35093 


HsJ196 




SURF1,SS,TM.SURF1,8URF4 


4.0 


412550 


R52452 


Hs.26370 


gb:yg80g07,r1 Scares Infant l)r 




4.0 



TABLE 23B: 

Pkey: Unique Eos prDbeselidenlHier number 
CAT number Gene cluster number 



410446 
411219 
411674 

412091 
412173 
412575 
413534 
413564 
413764 
413837 



414413, 
415126 
419120 



BE61 4290 AA307674 N36629 AA338538 A;193603 AA781 096 A!680OS1 AI61 3268 AW276647 BE221263 AI348910 AI98S031 AI090078 AI35961 7 
AA666391 AI160210 AI445461 AI355345 A!343638AI34 3 640AI275091 M78745AW252795AW250002AA503756AI934519AW272086 N26520 
AA626639 

BE141732 U76823 BE141331 AW178416 AW178430 BE141343 BE141296 BE141702 BE141285 

AA070021 AA1 26205 AA082771 AA102169 AA0B3530 AA0821 83 AA115915 AA085147 AA125905 AA063336 AA079206 

AA706639 AA064707 AL036920 A1661 598 

AA076248 AA1 20958 AA122152 AA076249 

BE261 944 AA71 5461 AA318136 AA134972 AA319849 W04622 AI291665 AW879092 M130778 BE314003 AA9C8246AW96080B AA385346 
AA205977 C02043 AA135057 AA078e70 AA377395 AA318795AA3187e5 AA319160 BE000960 AW370250 AW370244 T85930 AA759250 
AI567682 AA932839 AI055920 
AA199830 AI143895 AVV9B1629 AA322482 
AW832917AW332913AW832906AW832788AWa32915AW832776 

AW861 123 AW861 1 25 AW866717 AW861 1 16 AW866706 AW856788 AW866774 AW866787 AW856780 AW856782 AW856789 AW856772 
AW856784 AW8S6786 AW8S6776 AW8S663S AW8S6767 
R061 85 AW891805 AWg01892 AW90189S 



433933 
433941 
434303 
434743 



439246 

440317 
442462 
442472 



16270_1 

120374 2 
I236056_1 
1253746.1 

1276564.1 
1280870.1 
I30769_1 



I387163J BE162704 BE162705 BE162732 BE162702 BE162694 

139363.1 AW163525AW163255AW163386AI929359BE279279AA132690AW157329AA584408AW157262AI69219BAW003614T24436AI765658 
AW1S7459AI810740AI659582AI969924AI929284AI34C993AI349083AW299522AW664650AW299613AA132529AI340991 AI912e36 
AI341293AI650609AA279 



286920.1 
3081 53_1 
313927 1 
333110.1 
363095_1 

377883J 
383224_1 
3925.1 



411522_1 
41515 6 
43473 1 
467686J 
47021_3 

49187_1 
543232_1 
543371_1 
64558 1 



D60945 061346 D81S68D80S39 

BE271922 N54771AA234233AA471354BE171081AA253482AA470113AA824327 H24470AW504757 N51688AI400700AA578548 AA714130 
AA609917 AW780349 AW664465 AW467553 AW571643 AA469943 AW474826 AA767165 AA326817 AA693859 AW952245 AW341 739 AA805093 
AA779455AW016655 

AW2491 52 AW2491 53 BE298958 AW192872 AF095714 R05553 AF083107 AF160214 NML012237 BE258447 BE253088 AA297721 H68948 
W391 53 AA070372 HI 4246 AL079367 R24561 AW403997 AA297034 AA297092 F1 1 858 AI3726S7 AA297787 Z42780 AA297072 T81 280 T33544 

AA297053 H26063AA26 

AA324687 AA3251 55 AW962038 

AA364923 AW963483 BE182774 C21461 

AA381 642 AA381 664 AW963560 AW949848 AA381728 AA381608 

AI287878 AI804160 AA400787 

AW879141 AA421182 AI734104 AI733923 AA430600 

M79091 AA773950 AA586573 AA457225 



AA583232AA601715 

AI754389 AW295190 AI056CS8 AI056059 AI863364 AI8633SS AW131720 AI674922 AI949042 Aigg0060 AI6231 78 AW469497 AA62C354 

AA620612AA994983AA994990 

AW204058 AI424379 AI668663 AA629077 AW613C33 

AI363410 AI356019 HC0141 n8748 AL049365 AL079911 AI750972 Z42602 AW452523 AI223826 AA215407 AI633829 AA292122 N42783 
AW505595 AF086096 N90340 N63271 AA131836 AW607273 AA527132 T32315 AA421961 T34951 AW966080 M78807 M31947 AA521 151 

AA278866AA044784AA700 

AA812046AW974514AA764999AA649302 

AA700946AA702712AA947620 

R84938 AL047151 AA310309 AW063200 AI569528 AI307823 N49975 

ALII 7488 AL044479 
AW276611 AA829C50 AA829190 

AI498072 AW251 083 AA985226 AA852987 AI392B09 AA206609 AW190187 AA555262 AF086057 F35814 AW516382 AA377885 N50847 F27148 

AA731186AA417728AI003145 
BE561888 BE560615 BE562102 
AF031405H73416 



447128 
448241 



BE246743 AA436942 AW024744 AW242177 AA975476 AW385185 R07536 R73462 AV654629 T57442 AI399986 R50073 R48743 A1769689 
AI863006 AA3178C6 AI678000 AW189963 AI986207 AW471273 R73463 AI335104 AI5901 61 AI469267 AI954604 H21 964 T251 41 AA856793 
R50074A1708253AI2 

AK001822 AW860325 AA335296 AW965531 AW13C957 AW193951 AI347975 AW081323 AW662627 AI343924 AI380749 AA938153 T66966 
AI655000 AW41 8837 AI380485 AMI 0698 AI520726 BE501 355 AI637925 AW77920O AI524755 AW593995 AI336927 AI336928 AI357036 R60592 
H19058R11124T1 

AI271898 BE048SC2 AI452509 AI244810 X84721 AI858001 AISS3g37 AA1 49853 H0071 9 AI7652S9 AW973696 F25787 F35749 AI56881 5 

AW016380AA554539 C00201 AA961610AW059537 R77127 

AW811C64AW811160AI478413 

AI471630 BE540637 BE265481 AW407710 BES13882 BES46739 AA0535g7 BE140503 BE218514 AWg56702 AI6S6234 AI636283 AIS67266 
AW34C858 BE207794 AA053085 R69173 AA292343 AA454908 AA293504 AI659741 AI927478 AA399460 AI760441 AA34641 6 BE047245 
AA730380AA394083AA454 
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967633_1 BE299996 BE297115BE270415BE295214BE296626 

1228976_1 AWB16029AW813292AW816156AW813333AW816159AW813302AW813344AW813172 

1249752 1 AW851734AW851676AW851693AW851713AW851722AW851616AW851731AW851618AW851648AW852215 



BEOeiOSS BE008gS9 BE0089S7 BE091618 

BE153524BE163576BE153583 

BE170313BE168339BE158290 

AI279811 AI301071 AI214696AI279813AA588460 AA287256BE171665 
AW27227g AA461542 AA4e0615 
AA776638 BE439540 

AI802408 AA907424 Al 279233 AI302762 N33153 BEQ45678 AI863332 AW173558 AI302328 Z20793 D25594 BE326823 

BE504370 AI243453 AI809656 AI702873 AI702163 AI300626 AW072219 AI369492 AI349587 AW779061 W78149 AA055693 AA974162 AI394380 

AI830098 AW054857 AI870008 AW207658 AW665508 AW300595 AI192992 AW62801 9 AI274366 AA906922 N92647 AW054727 AW2066S7 

AW136707AW13761 

ALJD45934AL039532 H55631 



Ref: Sequence souice. The 7 digit numbers in ttiis column are Genbanit Identirier (61) nui 

liuman chnxnosome 22* Dunliam, at al. (1 999] Ma 402:48&49S 
Strand: Indicates DNA strand torn which exons were predicted 
NtjiosHlon: Indicates nucleoOde positions of predicted exons 



iis to the publicatlan entitled The DNA sequence of 



401210 
401264 
401278 
401609 
401674 
401724 
402197 
402365 



402916 
403128 



404333 
404343 
404344 
404365 
404661 
404757 
404807 
405334 



406230 
406244 
406301 
406437 



9454515 Miaus 



8748904 
7331426 
9838273 
7711795 



7717156 
4760409 
7417725 
8676868 
7711306 
7711328 



35559-35295 

3112-4159 

187599-188138 

34428-34512 

94S18-946S9 



166969-167133,169760-169877,171563-171733 



70928-71185 

19813-20084,20153-20263 

80123-80322 

101165-101419 



361-474,541-687 
109532-110225 

122884-123018,123134-123283,123372-123695,123779-123940,124059-124256 

102163-102345,102545-102725 

48636-48822 

32141-32263 

65247-67529,112537-114863 

137948-138024,138111-138300 

122664-122931 

127865-128384 

50151-50319,50859-51098 

33374-33S7S,3376&^0C8 

100933-101083,101580-101782 



139386-139856 

101982-102171 

48325-48491,4913649262 

47667-47765,48461-48596 

161628-161734,162823-163014,164439-164652 

2923-3209 

71715-72515 

39422-39595 

57291-57494 



Table 24A lists about 1 17 genes down-regulated in ovarian cancer compared to non-malignant adult ovaries. These were selected as for Table 23A, except that the numerator 
was set to the 75th percentile amongst various non-malignant ovary specinens, the denominator was set to the 96th peicenfle vdue amongst various ovarian cancers, the 
numerator was g reater than or equal to 75 unite, and the ratio was greater than or equal to 2.0 (i.e, 2-lbld downregulation in tumor vs. normal ovaries). 

TABLE 24A: 

Pke/: Unique Eos probeset identifier number 

Ex. Accn: Exemplar Accession nuntber, Genbank accession number 

\iG\a UniGene number 

THIe: UniGene gene title 

Protein Dcm.: Predicted protein domain 

R1: 1^0 of normal ovaries to tumor 



269 
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444931 

451573 

429570 ■ 

463610 

410295 

438549 

407S69 

414541 

448438 

441422 

413391 



436772 
415162 
427794 
433072 
418318 
410069 
431933 
420303 
438780 



.418444 
453767 
413624 
413306 
414604 



424319 
420174 
421709 
417622 



441916 
422746 
416777 
409403 



452123 
453305 
416157 



410209 
449500 
447806 
441712 
445025 
444161 



Ex. ten UGID 
BE2724S2 Hs.183109 
A1732637 Hs.277901 



BE386801 Hs.21858 
AA046217 Hs.105370 



R43777 
AI223328 
Z39586 



Hs.76392 
Hs.24654 
Hs.21364 
Hs.75335 
Hs.27865 



AA437061 Hs.14060 
AW975688 

AF035718 Hs.78061 



U47732 Hs.84072 

NM 007038 Hs.68324 

AI187067 Hs.132554 

AA258282 Hs.278436 
M64936 

AA410292 Hs.104761 

AWg03297 Hs.236438 

AW970960 Hs.293821 

AI902899. Hs.85155 

AB011792 HS.3S094 

8E177019 Hs.75445 

NM_000426 Hs.323511 

AW069181 Hs.1 16175 

NM_015310 Hs.6763 

NM 012082 Hs.106309 

U54617 Hs.8364 

M36564 Hs.64016 . 

AA883929 Hs.40527 

AW205507 Hs.263499 

AI213950 Hs.1 25461 

AI373949 Hs.279610 

AW961026 Hs.96762 

AI824144 Hs.199749 

AA159394 Hs.107066 

AW298163 H3.82318 

AW960427 Hs.342874 
AI9a9712 

AW779544 Hs.1 15497 

AW978107 Hs.6884 

Hs.46783 

5 Hs.81182 



AA993571 
NM 004484 
AF146760 



AW408163 Hs.58488 

AI452601 Hs.288869 

AL023754 Hs.1 99068 

AI267615 Hs.33022 

R39224 Hs.257997 

NM_003243 Hs.342874 

U14965 Hs.180946 

AA349211 Hs.76205 

NMJ16651 Hs.48950 

AI694143 Hs.326248 

AI064347 Hs.2017 

AB029015 Hs.54886 
AI583661 



Hs.12 



ls.10432 



W03616 

AW391927 Hs.7946 

AI768895 Hs.295727 

N52543 Hs.142940 

BE621719 Hs.173802 

AI160015 Hs.125489 

N33937 Hs.10336 

AI268097 Hs.67317 

NM.004010 Hs.169470 

AA348031 Hs,7913 



general transciiption factor I 
ESTs 

KIAA0022 gene product 



Homo sapiens cDNA FU 1 1 640 lis 



alcoliol deliydragenase 5 (class 
proldneHcin 1 precursor 
metallotli'ianein IE (functional 

transcription factor 21 



tiypoltietical protein DKFZp761K 
ESTs 

butyrate response factor 1 (EG 



SPARC-like1(niast9,tieyin) 

Homo sapiens cDNA: FU23176 « 

steriMphamolifandleucin 

KIAA0942 protein 

Friend of GATA2 

Homo sapiens pyruvate dehydrog 

proteins (alpha) 



hypothetical protein FU11539 
hypothetical protein FLI10493 
ESTs, Weakly similar to ALU8_H 
ESTs 

CED-6 protein 

WAS protein family, memlier 3 
transfomiing growth factor, be 
phospfiafidylinositol transfer 
C11000244:gi|11056030|ref|NP_0 
hypothetical protein FU22555 
Homo sapiens mRNA; cDNA DKFZp6 
■inscDIJA:FU22382fi 

Target Exon 
ESTs 
glypican3 

DKFZP664M1 41 6 protein 
Homo sapiens cDNA: FU22547fi 



similar to calcium/calmodulin 



tiypcthetical protein PR01635 



ESTs, WfeaWy similar to AliJB_H 



88,EGF,ldUecepLb,thyro 
SS,serpin,SS,WD40,FYVE 
SS,Pep_M12B_propep,Reprol 
aldedh 



adhjiiic,HCV_NS4a,TM,adh_ 



trans membrane4 

Fteprolysiii,tspJ,Pep_M12B 

Tlvl,SS,TM 



SS.Pep_M12B_prapep.Reprol 



SS,TM,zonajellucida 

SS.PX.PH,PLDc.PHPLDo;PX 

SS,TM.SS,TFllDJOkD 



C2,PH,PI-PLC-Y,PI-PLC-X 

SS,TI*l,Forl(_head 

SS,TM 



ZZ,CH,WW,speclrin,bZIP,SS 
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420728 
404245 
436420 
410066 
414476 
424137 



428642 
419677 
450435 
450696 
421255 
432467 
408664 
412611 

424666 
458147 
447566 
414496 



AA767718 Hs.93581 



AA301867 Hs.76224 

AA335769 Hs.16262 

AA017472 Hs.107260 

AI209158 Hs.143929 

TS4907 Hs.188672 

AI684710 Hs.201645 
AI493206 

AL038924 Hs.279849 

A1654048 Hs.1 96556 

NM 014899 Hs.10432 

L36531 Hs.91296 

AI595975 Hs.201805 

AI554223 Hs.16026 

BE326214 Hs.93813 

T03667 Hs.239388 

BE018882 Hs.46721 
AA732036 
AW295374 
AW353576 
AW752597 

N50432 Hs.102648 
W73853 

AW119112 Hs,9062 

NM.003652 Hs.78068 

AW95729S Hs.112318 

AL133047 Hs^4715 



SS,AWD 

SS,TM,Sema,PSI,ig 
Sbi1nogen_C,ti3.SS 
SS,TM,PMP22_aaudin,SS,TM 



.164478 
.31412 
.139861 



hypothetical protein DKFZpSeeH 



hypothetical ( 
ESTg 

Human DMA 
UCC1 protein 
hypothetical protein FU21939 

myopodin 
caveolin 2 

gb:IL3-.CT0214-161299-045«6 C 
ESTs 

Homo sapiens cDNA: FU22042 fi 



SS,TM 

PHpCH^peotrin 
SS,ATP-synLC 

zf-C3HC4,myosin_head,DIL, 
SS,MACPF.sushl.ldl_reoept 

TM,l'ntegi(nJ\,FG.GAP 



SS,Ependymln,SS 
SS,TM,Caveolin,Caveolin 



SS,TM,pkinase,F5_F8 type 
SS,TM 

Ztu:arbOpepl,Fz,Dioxygena 



TABl£24B: 
Pkey: Unique Eos pn 
CATnumlier: Gene cluster numlier 



410295 11922.2 AA741357AIB70000 W75997 H50726AV65a709 AI498817ALD37B04W67847 BE01B553AI033256 N76810 N31548A1032084 N36278 A«V075272 
AI032081 R35753 W93372AA700790AI9I)3697 N52985 R82468AWS80252AL(13676()AI052219 R36621VV07047AA088621AI249109WS8776 
W69374AA15 

414496 145392_1 W73853AA928112 W77887AW889237 AA148524AI749182AI7S4442AI338392AI2S3102AI07g403AI370541 AI697341 H97538 AW188021 

AI927669 W72716AI051402AI188071 AI335900 N21488AW770478 W92522AI691028AI913612 AI144448 W73819AA604358 N28900 W95221 
AI868132 H98 

418919 180623L1 AA232B35 AI373703AA233330 

418956 180862.1 AA234831 AI700302 AA906216 AA776957 R49415 AI420777 AA666394 AI8306ig AA779469 AI972390 1440980 AI094453 AA826397 AAS35g94 

AI868257 AI80429S AA897791 AA232893 AI348680 AI3S6232 AA235138 F31 396 AW079977 HI 6405 
424651 241981_1 AI493206 AA732315 AA344619 AA904035 AW952967 AA488889 AA635644 BE245127 AA669979 AA761B74 H28767 AA910081 AA837086 

AA766495 W76175 AIS21825 AA746092 AA7431S2 AI47B562 H88863 
436772 426854 1 AW975688 AA731063 N67084 

438780 46S01.1 M64936A1C25512AI382987BEC61777AA089966BE169930T41176AW594624BE502415AA121893AI269283 T40311 A1684569AA257011 

AI079277 AI241318 BE327710 AW97S21S AW89626B AA884ggO BE327S14 
441916 528799.1 AA993571 AA971518AI937262 

444931 62567.1 AV652066AA459880 T58512 T5B561 AI651255IW9838 H87921 AVV264447AA428C67AA364094AW955685ID62894AW341452AA243652 
AI984618 AI816S(»AI871252AI376942AI7404g6AA452836 AI277917AI149141 AA4S6147AI7B4S66 AI003975 AI24S674 AI433703 AI2a0208 

AI268985AI38 

451573 875588 1 AW130351 AW338699 A1803973 

458147 488021_1 AW762597AWB48781 AW849062AW848490AW752699AW752604AW752700 
TABl£24Cl 

Pl<ey: Unique numlier corresponding to an Eos proliesel 

Sequence souica The 7 digit numbers in this column are Genliank Identliier (Gl) numbers. 'Dunham I. et at.' refers to the publication entitled The DMA sequence of 
human chromosome 2? Dunham, et at. (1999) Itelurs 40Z-489^95 
Strand: Indicates DNAstrand from which exonswere predicted 
NUioslton: Indcates nucleotide positions of predicted exons 

PItey Ref Strand NLposition 

400829 8570385 Plus 152176-152516 
403247 7656833 Mhus 76626-77140 

404245 7406725 Plus 3601 9-36282,37073-37813,38946-39314,40355-40651,4273843028,43391-43696,45698-46030,51 110-5141 5,52779-53072,54648- 

54935.55201-55509,55926-56240,563555667257078-57401,59966-60262,62600-62926,63363-63686,66693-67025,68180- 
68497.68909^9232.71372-71695.720 



Table 25A provides UnigenelD. UnlgeneTitle, Pkey, and Exemplar Accession for sequences in Table 26. The information in Table 25A is linked by SEQ ID NO: to Table 26. 
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Ex. Accn: Exemplar Accession number, Genbank accession number 
UGID: UnlGene number 
Title: Unl6enetiUe 

SEQ ID Ntt Sequence Identflcatbn number for sequences in Table 2B 



Pkey 




UGID 


TiUe 


SEQ ID NO 


452838 


U65011 


HsJ(J743 


preferentaily expressed antigen in mela 


SEQ ID NO: 1-2 


421478 


AI683243 


HS57258 


ESTs, Moderatel/ similar to S29539 nbos 


SEQ ID NO: 3-4 


436982 


AB018305 


Hs.5378 


spondin 1 , (f-spondin) extracellular mat 


SEQ ID NO: 5-6 


468627 


AW088642 


Hs.97934 


SRY (sex determining region Y)-box 17 (S 


SEQ ID NO: 7-8 






Hs.1 22579 


ECT2 prolan (Epittielial cell Iranslbmii 


SEQ ID NO: 9-10 


410102 


AW248608 


Hs.279727 


ESTs; tiomologue of PEUM pona savignyi 


SEQ ID NO: 11-12 


428579 


MM 006756 


Hs.1 84942 


G protein-coupled receptor 64 


SEQ ID NO: 13-22 


428227 


AA321649 


Hs.2248 


small inducible cytol<ine subfamll/ B (Cy 


SEQ ID NO: 23-24 


451110 


AI966040 


Hs.265398 


PAR-6 be ta (partitioning defective 6 h 


SEQ ID NO: 26-26 


428187 


AI687303 


Hs.285529 


G protein-coupled receptor 49 


SEQ ID NO: 27-28 


424905 


m 002497 




MIMA (never in mitosis gene a)-related k 


SEQ ID NO: 29-30 


433159 


AB036898 


H3.150587 


kinesin-Iike protein 2 
TTK protein l<inase 


SEQID NO; 31-32 


426427 


M86699 


Hs.169840 


SEQ ID NO: 33-34 


426371 


D4S441 


Hs.1 55981 


mesothelin 


SEQ ID NO; 35-38 


418606 


AA084248 


H3.85339 


G protein-coupled receptor 39 


SEQ ID NO: 39-40 


456646 




Hs.203845 


KCNK15 potassium ctiannd. subfamily K, m 


SEQ ID NO: 41-42 




NM_000869 


Hs.2142 


5-hydroxytryplamine (serotonin) receptor 


SEQ ID NO: 43-44 


445537 


AJ245671 


Hs.12844 


EGF-ike^omain, multiple 6 


SEQ ID NO: 45-46 


424620 


AA101043 


Hs.151254 


K^likrein 7 (chymoliypCc, stratum com 


SEQ ID NO: 47-48 


412078 


X6S69g 


Hs.73149 


paired box gene 8 


SEQ ID NO: 49-62 


409178 


BE393948 


HS.S0S1S 


kalliktelnS 


SEQ ID NO: 53-54 


448243 


AV\069771 
BE616633 




inlegi1n,bela8 


SEQID NO: 55-56 


426514 


Hs.170195 


bone morphogenetk; protein 7 (osteogenic 


SEQ ID NO: 57^ 


419452 


U33635 


Hs.gOS72 


PTK7 protein tyrosine kinase 7 


SEQIDNO:59«) 


431130 


NM_006103 


Hs.2719 


HE4; epididymis-specific, whey-acldic pr 


SEQID NO: 61-62 


416639 


AI733881 


Hs.72472 


BMP-RIB 


SEQID NO: 63-64 


423961 


D13666 


Hs.136348 


perioslin (OSF-2os) 


SEQ ID NO: 65-66 


417433 


BE270266 


Hs.82128 


5T4 oncofetal trcphcblast glycoprotein 


SEQ ID NO; 67-68 


422867 


L32137 


H3.1584 


cartilage oligonieric matrix protein (pse 


SEQ ID NO; 69-70 


409542 


AA503020 


Hs.36563 


tiypottietical protein FU22418 


SEQ ID NO; 71-72 


444381 


BE387335 


Hs.283713 


ESTs, Weakly similar to S64054 hypolhetl 


SEQ ID NO; 73-74 


452747 


BE153855 


H3.61460 


Ig superfamily receptor LNIR 


SEQ ID NO; 75-76 


450375 


AA009647 




a disintegrin and metaiioprotanase doraa 


SEQ ID NO; 77-78 


426216 


AW963419 


Hs.166223 


stanniocalcin 2 


SEQ ID NO; 79-80 


430044 


AA464510 


HS.1S2812 


ESTs 


SEQID NO; 81 


447033 


AI357412 


Hs.157601 


ESTs 


SEQID NO; 82-67 


410418 


D31382 


Hs.63325 


transmembrane protease, serine 4 


SEQ ID NO; 83-89 


411274 


NM_002776 


Hs.69423 


kallikrein 10 


SEQ ID NO; 90-91 


422260 


AA315993 


HS.1CS484 


regenerating gene type IV 


SEQ ID NO; 92-93 


409041 


AB033026 


Hs.50081 


Hypothetical protein, XP 051860 (KIAA119 


SEQID NO; 94-96 


428664 


AK001666 


HS.18909S 


similar to SALL1 (sal (DrosophilaHike 


SEQ ID NO; 95-97 


404977 






lnsulin-ll<e growth factor 2 (somatomedl 


SEQ ID NO; 98-99 


427747 


AW411425 


Hs.180655 


serine/thieonhe kinase 12 


SEQID NO: 100-101 


412140 


AA2196gi 


Hs.73625 


Wm interacting, kinesln-like (rabkines 


SEQ ID NO: 102-103 


431846 


BE0ig924 


HS.2715B0 


uroplakinlB 


SEQ ID NO: 104-105 


425465 


L18964 


Hs.1904 


protein kinase C Iota 


SEQID NO: 106-107 


432938 


T27013 


Hs.3132 


steroidogenk; acute regulatory protein 


SEQID NO: 108-109 


421451 


AA291377 


HS.S0831 


ESTs 


SEQID NO: 110-117 


437478 


AL390172 


Hs.317432 


branched chain aminotransferase 1, cytos 


SEQID NO: 118-119 


411945 


AL033S27 


Hs.92137 


L-m/c-2 protein (MYa2) 
paternally expressed 3 


SEQID NO: 120-121 


424078 


AB006626 


Hs.139033 


SEQID NO: 122-123 


408400 






kallikrein 8 (neuropsin/ovasin) (KLK8) 


SEQID NO: 124-125 


428450 


MM 014791 


Hs.1 84339 


KIAA01 75 gene product 


SEQID NO: 126-127 


438167 


R28363 


Hs.24286 


chemokine binding protein 2 (CCBP2), mRN 


SEQID NO: 128-129 


416530 


U62801 


Hs.79361 


kallikrein 6 (neurosin, zyme) 


SEQ ID NO: 130-131 


430591 


C14187 


Hs.1 57208 


aristaless-related homeobox protein ARX 


SEQID NO: 132-133 


408081 


AW461597 


Hs.1 67409 


intron of baslc-helix-loop-heSx-PAS pro 


SEQ ID NO: 134 


411773 


NM_006799 


Hs.72026 


protease, serine, 21 (teslisin) 


SEQID NO: 135-136 


407792 


AI077715 


Hs.39384 


putative secreted ligand homologous to f 


SEQ ID NO: 139-140 


428093 


AW694506 


Hs.104830 


ESTs 


SEQ ID NO: 141 -144 


431630 


MM 002204 


Hs.265829 


integrin, alpha 3 (antigen CD49C, alpha 


SEQID NO: 145-148 


421502 


AF111866 


Hs.1 06039 


solute carrier family 34 (sodium phospha 


SEQID NO: 149-150 




U81961 


Hs.2794 


sodium channel, nonvoltagfrgated 1 alpha 


SEQID NO: 161-162 


431369 


BE184456 


Hs.251754 


secretory leukocyte protease inhibitor ( 


SEQ ID NO: 163-154 


436972 




Hs.26640 


claudln3 


SEQ ID NO: 155-156 


429504 


X99133 


Hs.204238 


lipocain 2 (oncogene 24p3) (NGAL) 


SEQID NO: 157-158 




AB041036 


Hs.57771 




SEQID NO: 1SS-160 



TABLE 25B: 

Pkey: Unique Eos probeset identmer n 
CAT number Gene cluster number 



Pkey CAT Number Accession 

448243 75629 1 AW369771 AW748174AA290801 AA419198AA044331 AA127909 AVV995442A1480343 AA(m582AW956159AA373451 AA127965 A^^ 
AW994956 BE622314 BE006298 BE0C6312 BE006305 BE006317 BE006303 AA043906 AA234175 AA479726 

450375 83327J AA009647AA131254AA374293AW954405 H0441CAW6062a4AA151166BE157467BE1576C1 H04384 W46291 AW663674 H04021 HC1532 
AA190993 HCQ231 H59605 H01642AA852876 AA113758 AA626915 AA74e952 AI161014 AA099554 R69067 
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TABLE 26C: 

Pkey: Unique number corresponding to an Eos prabeset • 
Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (GI) numbers. "Dunham I et al." refers to the publication entitled lite DMA sequence of 
human chromosome 22" Dunham, et al. (1999) Nature 402:489^ 



Ntj>osition: Indicates nucleotide positions of predicted ex 



20 



45 



60 
65 



r ACAGCTCCCC CGCRGCCAGA AGCCGGGCCT GCAGCCCCTC AGCACCGCTC 
CGGGACACCC CACCCGCTTC CCRGGCGTGA CCTGTCAACA GCAACTTCGC GGTGTGGTGA 
ACTCTCTGAG GAAAAACCAT TTTGATTATT ACTCTCAGAC GTGCGTGGCA ACA.!iGTGACT 
GAGACCTAGA AATCCAAGCG TTGGAGGTCC TGAGGCCAGC CTAAGTCGCT T 
25 ACGRAGGOST TTGTGGGGTT CCATTCAGAG CCGATACATC AGCATG; 

CCCACGGRGA CTTGSrGGAGC TGGCAGGGCA GfeCCTGCTG AAGGATGAGG C 

TGCOSCCCTG GAGTTGCTGC COMSGGAGCT CTTCCCGCCA CTCTTCATGG CAGCCTTTGA 42 0 

CGGGAGACAC AGCCAGACCC TGRRGGCAAT GGTGCAGGCC TGGCCCTTCA CCTGCCTCCC 4 80 

TCTGGGAGTG CTGATGAAGG GACAACRTCT TCACCTGGAG ACCTTCAAAG CTGTGCTTGA 540 

30 TGGACTTGAT GTGCTCCTTG CCCAGGAGGT TCXSCCCCAGG AGGTGGAAAC TTCAAGTGCT 600 

GGATTTACGG AAGAACTCTC ATCAGGACTT CTGGACTCTA TGGTCTGGAA ACAGGGCCAG 660 

TCTGTACTCA TTTCCAGAGC CAGAAGCAGC TCAGCCCATG ACRAAGAAGC GAAAACTAGA 720 

TGGTTTGfiGC ACAGAGGCAG AGCAGCCCTT CATTCCAGTA GftGGTGCTCG TAGACCTGTT 780 

CCTCAAGGAA GGTGOCTGTG ATGAATTGTT CTCCTACCrC ATTGAGAAAG TGAAGCGAAA 840 

35 GftftRAATGTA CTACQCCTGT GCTGTAAGAA GCTGAAGATT TTTGCAATGC CCATGCAGGA 900 

TATCAAGATG ATCCTGAAAA TGGTGCAGCT GGACTCTATT GAfiGATTTGG AAGTGACTTG 960 

TACCTGGAAG CTACCCACCT TGGCGAAATT TTCTCCTTAC CTGGGCCAGA TGATTAATCT 1020 

GCGTAfiACTC CTCCrCTCCC ACATCCATGC ATCTTCCTAC AITTCCCCGG AGAAGGAAGA 1080 

GC3«3TATATC GCCOVGTTCA CCTCTCAGTT CXrTCAGTCTG CXGTGCCTGC AGGCTCTCTA 1140 

40 TGTGGACTCr TTATTTTTCC TTAGAQGOCG CCTGGATCAG TTGCTCAGGC ACSTGATGAA 1200 

CCCCTTGGAA ACCCTCTCRA TAACTAACTG CCGGCTTTCG GAAGGGGATG TGATGCATCT 1260 

GTCCCAGAGT CCC3«3CGTCA GTCAGCTAAG TGTCCTGAGT CIAAGTGGGG TCATGCTGAC 1320 

CGATGTAAGT CCCGAGCCCC TCCAAGCTCT GCTGGAGAGA GCCTOTGCCA CXCTCCAGGA 1380 



CTTGCAGAGT CTCXTOCAGC ACCTCATCGG GCTGAGCAAT CTGACCCACG TGCTGTATCC 1560 

TGICCCCCTG GAGAGTTATG AGGACATCC» TGGTACCCrC CaCCTGGAGA GGCTTGCXWA 1620 

TCTGCATGCC AGGCICAGGG AGTTGCTGTG TGAGTTGGGa OGGCCCAGCa TGGTCTGGCT 1680 

TAGTGCCAAC CCCTGTCCTC ACTGTGGGGA CAGAACCTTC TATGACCCX3G AGCCCATCCT 1740 

50 GTGCCCCTGT TTCATGCCTA ACTAGCTGGG TGCaCATATC AftATGCTTCA TTCTGCATAC 1800 

TTGGACACTA AAGCCAGGAT GTGCATGCAT CTTGAAGCAA C3«AGCAGCC ACSGTTTCAG 1860 

ACAAATGTTC AGTGTGAGTG AGGAAAACAT GTTCAGTGAG GAAAAAAC3iT TCSGACAAAT 1920 

GTTCAGTGAG GAAAAAAAGG GGAAGTTGGG GATAGGCAGA TGTTGSCTTG AGGAGTTAAT 1980 

GTGATCTTTG GGGAGATACA TCTTATAGAG TTAGAAATAG AATCTGAATT TCTAAAGGGA 2040 

55 GATTCTGGCT TGCGRAGTAC ATGTAGGAGT TAATCCCTGT GrTAGACTGTT GTAAAGAAAC 2100 
TGTTGAAAAT AAAGAGAAGC AATGTCAAGC AAAAAAAAAA AAAAAAAA 

Protein Accession #: NP^OSlOS.l 

1 11 21 31 41 SI 

I I I I I I 

MERRRLKGSI QSRYISMSVW TSPRRLVELA GQSLLKDEAL AIAALELLPR ELFPPLFMAA 60 

FDGRHSQTLK AMVC3AWPFTC LPLGVLMKOQ HLKLETFKAV LDGLDVLLAQ EVRPRRWKLQ 120 

VLDLRKNSHQ DFWTVWSGNR ASLYSFPEPE AAQPMTKKRK VDGLSTEREQ PFIPVEVLVD 180 

LFLKEGACDE LFSYLIEKVK RKKNVLRLCC KKLKIFAMPM QDIKMILKMV QLDSIEDLEV 240 

TCTWKLPTLR KFSPYLGQMI NLRRLLLSHI HASSYISPEK EEQYIAQFTS QFLSLQCLQA 300 

LYVDSLFFLR GRIiDQLLRHV MNPLETLSIT NCRLSEGDVM HLSQSPSVSQ LSVLSLSGVM 360 

LTDVSPEPLQ ALLERASATL QDLVFDECGI TDDQUAbLP SLSHCSQLTT LSFYGNSISI 420 

SALQSLLQHL IGLSNLTHVL YPVPLESYBD IHGTLHLERIi AYLHARLREL LCELGRPSMV 480 



Seq ID NO: 3 DNA sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence: 264.. 782 

1 11 21 31 41 51 

I I I I I I 

OCCTGCTCCA GTCACACCCG GAAGCTGACT GGTCX3VCX3C3V CAGCTGABGC ATGAGGAAAC 
TCATCGCGGG ACTAATTTTC CTTAAAATTT AGACTTGCAC AfiTAAGGACT TCAACTGACC 
TTCCTCAGAC TGAGAACTGT TTCXXGTATA TACATCAAGI CACTGASATC TCCAGCACCC 
TGCCGGTGGC ACTACTGAGA GACGAGCTGC CAGGGTGGTT CCTGAAAGTG CCTGAGCCCC 
AACTTATOIG CAAGGAGCTC ATCaTGCTOA CAGAAGTCAT GGAGGTCTGG CATGGCTTAG 
TGATCGCGGT GGTGTCCCTC TTCCTGCfiGG CCTGCTTCCT CACCGCCSITC AACTACCTGC 
TCAGCAGGCA CATGGCCCAC AAGAGTGAAC AGATACTGAA AGCGGCCAGT CTCCAGGTTC 
CCAGGCCCAG CCCTGGCCAC CATCATCCAC CTGCTGTCAA AGAGATGAAG GAGACTCAGA 
CAGAGAGAGA C»TCCCAATG TCTGATTCCC TTTACAGGCA TGACAGCGAC ACACCCTCAG 
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ATAGCTTGGA TAGCTCCTGC AGTTCGCCTC CTGCCTGCCA GGCCRCAGAG GATGTGGATT 600 

ACACACAAGT CGTCTTTTCT GACCCTGGAG AACTAAAAAA TGACTCCCCG CTGGACTATG 660 

AGAACATAAA GGAAATCACA GATTATGTCA ATGTCAATCC AGAAAGACAC AAGCCCAGTT 720 

TCTGGTATTT TGTCAACCCT GCTCTGTCTG AGCCAGCGGA ATATGATCAA GTGGCCRTGT 780 

GAATTCCAAA TATTTTTAAT GGGGTCCAGT TCTCTATGGA TTCTTACATT TAATTTGTAG 840 

GGAftATGCCA TTTTTCCCCC TTAAACflAGG CATGGGGCTC ACAAGTCTAT GGRG&CAGGC 900 

CAAAAAG»AT GTGGAGAAGA AAACTGATAA ATACACAGAG GTCCTCAAGA CCCATGGACT 9 SO 

CCTGGTCTGT ACCOUUiAAA GCTGTTCGTT CCTCa^AAAAC AAAAACAAGG CTTGGCIGGG 1020 

AAAACAGGCC AATGCCCCQG CAAGAAAGGT TGAGATCAGA TGTTAGGAAG AACTTTCAGG lOBO 

k GAACTATGGA GTCCATCAGC AGAGATAGTA GTGRAGTCTC TCCCCAGGGA 1140 

A AAGSTTGAAT CAGCTGTTST AGAGTTCTAT TTGGCaiATCr C3VTGGTTAAA 1200 

TGACTTCCCT TTGAGCTCTT TAATTATTGG CAATAAACAA CTTCTTTAAA AGTTTTAAAT 1260 



I I I I I I 

MLTEVMEVWH GLVIAWSLF LQACFLTAIN YLLSRHMAHK SEQILKAASL QVERPSPGHH 
E TQTERDIPMS DSLYRHDSE)T PSDSLDSSCS SPPACQATED VDYTQWFSD 
D YVNVNPERHK PSFWYPVNPA LSEPAEYDQV AM 



AGCGGCCGCG GCRCAAAGTT GGGGGCCG03 AAGATGAGGC TGTCCCCGGC GCCCCIGAAG 



TCCGACGAGA CCCTGGACAA AGTGCCCAAG TCAGAGGGCT ACTGCAGCCG T 
GCCCAGGGCR 'CGCGGCGCGA GGGCTACACC GAGTTCAGCC T 
GACTTCTACA AGCOSGGAAC CAGCTACCGC GTAACACTTT CAGCTGCTCC T 
TTCAGAGGAT TCACATTAAT TGCCCTCAGA GAGAACAGAG A " 
CRTGCTGGGA CCTTCCAGAT CRTAGACGAA GAAGAAACTC AGTTTATGAG C 

GTTGCAGTCA CTGRAAGCAC TCCACGGAGG AGGACCXTCGA TCCAGGTGTT TTGGATAGCA 480 

CCACCAGCGG GAACAGGCTQ CGTGATTCIG AAGGCCAGCA TOGTACAAAA ACGCATTATT 540 

TATTTTCAAG ATGAGGGCTC TCTGACCAAG AAACTTTGTG AACAAGATTC CACATTTGAT 600 

GGGGTGACTG ACAAACCCAT CTTAGACTGC TGTGCCTGCG GAACTGCCAA GTACACACTC 660 

ACATTTTATG GGAATTGGTC CG&GAAGACA CACCCAAAGG ATTACCCTCO TCOGGCCAAC 720 

CRCTGGTCTG CGATCATCG6 AGGATCCCAC TCCAAGAATT ATGTACTGTG GGAATATGGA 780 
GGATATGCC» GOSASfiGCGT CaAACAAGIT GCAGAATTGG G 
GAAGAAATTC GACAACAGAG TGATGAGGTC CTCACCGTCA TCAA/.GCCAA A 
CXAGCCTGGC AGCXrrCTCAA CGTGAGAGCA GCACCITCAG C 

ACGOGCCATT TAATCTCXTTT CCTGACCATG ATGGGCKCTA GTCCCGACTG GAACGTAGGC 1020 

TTATCTGCAG AAGRTCTGTG CACCAAGGAA TGTGGCTGGG TCCSGAAGGT GGTGCAAGAC 1080 

CTGATTCCCr GGGAOSCrGG CACCGACaGC GGGGTGACCT ATGftGTCACC CAACAAACCC 1140 

ACCATTCCCX: AGGAGAAAAT CCGGCCCCTG ACCAGCCTGG ACCATCCTCA GAGTCCTTTC 1200 

TATGRCCCAG AGGGTCQGTC CATCACTCAA GTAGOCXGAG TTGTCATOGA GAfiAATCGCA 1260 

CGGARGGGTC AACAATGCSiA TATTGTACCT GACAATGTCG ATGATATTGT fifiCTQACCTG 1320 

GCTCCAGAflG AGAAAGATGA AGATGACXCX: CCTGAAACCT GCSWCTACTC CAACTGGTCC 1380 

CC*TGGTCa3 CCIGCAGCTC CTCCACCTGT GACAASGGOi AfiBGGATGO: ACAGCGCATG 1440 

CTGAAAGCAC AGCTGGACCT CAGCGTCCCC TGCCCIGACa CCCAGGRCTT CCAGCCCTGC ISOO 

ATGGGCCCTG GCTGCAGTGA OGAAGAaXSC TCCACCTGCa CC3iTGTCXX3A GTGGATCACC 1560 

TGGTCGCCCT GCAGCATCTC CTGCGGCATG GGCATGACGT CCCGGGAGAG GTATGTGAAG 1620 

CAGTTCCCGG AGGACGGCTC CGTGTGCAOG CTGCCCACTG AGGAAACGGA GAAGTGCACG 1680 

GTCAACGAGG AGTGCTCTCC CAGCAGCTGC CTGATGACCG AGTGGGGCXa^ GTGGGACGftfi 1740 

TGCAGCGCCA CCTGCGGCAT GGGCATGAAG AAGCGGCACC GC31TGATCAA GATGAACCCC 1800 

GCAGATGGCT CCATGTGCAA AGCCGAGACA TCAC3«3GCAG AfiAAGTGCAT GATGCCAGfiC 1860 

TGCCACACCR TCCCATGCTT GCTGTCCCCA TGGTCEGAGT GGRGTGRCTG CAGOSTGACC 1920 

TGCGGGAAGG GCATGCOAAC CCGACAGOSG ATGCTCAAGT CTCTGGCAGA ACTTGGAGAC 1980 

TGCAATGAGG ATCTGGAGCA GGTGGAGAAG TGCATCCTCC CTGAATGCCC CATTGACTGT 2040 

GAGCTCACCG AGTGGTCCCA GTGGTCGGAA TGTAACAAGT CATGTGGGAA AGGCCACGTG 2100 

ATTCGAACCC GGATGATCCA AATGGAGCCT CAGTTTGGAG GTGCACCCTG CCCAGAGACT 2160 

GTGCAGCGAA AAAAGTGCCG CATCCGAAAA TGCCTTCGAA ATCCATCCAT CCAAAAGCTA 222 0 

CGCTGGAGGG AGGCCCGAGA GAGCCGGCGG AGTGAGCAGC TGAAGGAAGA GTCTGAAGGG 2280 

GAGCAGTTCC CAGGTTGTAG GATGCGCCCA TGGACGGCCT GGTCAGAATG CACCAAACTG 2340 

TGCGGAGGTG GAATTCAGGA ACGTTACATG ACTGTAAAGA AGAGATTCAA AAGOTCCCAG 2400 

lAGACAA GAAGGAGATC AGAGCATGCA ATGTTCP.TCC TTGTTAGCAA 2460 
T TCCCCAGGGC TGCACTCTAG ATTCCAGAGT C 
A GACAATTTAA ATTGTGTACG CTAGTTT 



C TTCCTGCRGA GGAGTAGTGI CAGCCACCTT 
C CCTCTGGAGC TTCCACCTGG CCAGGGAGGA CGGAGACTTT 
GACCTACTCC ACATGGAGAG GCAACCATGT CTGGAAGTGA C 
GCGGCAGGTA GGAAACATTC A 

r GAAACCATTG TTTGCCCATC T 

C AGTCATCAAT AGTTCCTGGG GAAAAACSiGA GCTGGTAGAC T 

CCCTGATATT GGTTCCTGAT GCCCCCCCAA CAAAAATAAA TAAATAAATT ATGGCTGCTT 3120 

TATTTAAATA TAAGGTAGCT AGTTTTTACA CCTGAGATAA ATAATAAGCI TAGAGTGrTAT 3180 

TTTTCCCTTG CTTTTGGGGG TTCAGAGGAG TATGTACAAT TCTTCTGGGA AGCXXGCCTT 3240 

CTGAACTTTT TGGTACTAAA TCCTTATTGG AACCAAGACA AAGGRAGCAA AATTGGTCTC 3300 

TTTAGAGACC AATTTGCCTA AATTTTAAAA TCTTCCTACA CACATCTAGA OSTTCAAGTT 3 3 SO 
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\ TTTTATATGC AGAGRGRAAA AGTTACC(3fiG ACAGRAAACR AATCTAAGGG 3540 

AASGGAATAT TATGGGATTA AGCTGAGCAA GCAATTCTGG TGGAAAGTOl AACCTGTCAG 3600 

TGCTCCRCAC CAGGGCTCTG GTCCTCCCRG ACATGCATAG GAATGGCCRC AGGTTTACAC 3660 

TGCCTTCCCA GCAATTATAA GCACACCAGA TTCAGGGAGA CTGACXaVCCR AGGGATAGTG 3720 

TAAAAGGACA TTTTCTCAGT TGQQTCCATC AGCaWSTTTTT CTTCCTGCRT TTATTGTTGA 3780 

AAACTATTGT TTCATTTCTT CTTTTATAGG CX:TTATTACT GCTTAATCCR AATGTGTACC 3840 

ATTGGTGAGA CACATACAAT GCTCTGAATA CACTACGAAT TTGTATTAAA CACATC3«3AA 3900 

TATTTCCAAA TACAACATAG TATAGTCCTG AATATGIACT TTTAACACAA GAGAGACTAT 3960 

TCAATAAAAA CTOiCTGGGT CTTTCATGTC TTTAAGCTAA GTAAGTGTTC AGAAGGITCT 4020 

TTTTTATATT GTCCTCCACC TCCATCATTT TCAATAAAAG ATAGGGCTTT TGCTCCTTTG 4080 

TTCTTGGAGG GACCATTATT ACATCTCTGA ACIACCTTTG TATCCAACAT GTTTTAAATC 4140 

CTTAAATGAA TTGCTTTCTC CCSAAAAAAG CACAATATAA AGAAACACAA GATTTAATTA 4200 

TTTTTCTACT TGGGGG6AAA AAAGTCCTCA TGTAGAAGCA CXICACTTTTG CAATGTTGTT 4260 

CTAAGCTATC TATCTAACTC TCAGCCCATG ATAAAGTTCC TTAAGCTGGT GATTCOTAAT 4320 

CAAGGACAAG CCACCCTAGT GTCTCATGTT TGIATTTGGT CCCAGTTGGG TACATTTTAA 4380 

AATCCTGATT TTGGAGACTT fiAAACCAGGT TAATGGCTAA GAATGGGTAA CRTGACTCTT 4440 

GTTGGATTGT TATTTTTTGT TTGCAATGQG GAATTTATAA GAAGCATCSVA CTCTCTTTCT 4500 

TACCAAAGTC TTGTTAGGTG GTTTATAGTT CTTTTGGCTA ACAAATCATT TTGGAAATAA 4560 



I I I I' I I 

MHLSPAPLKL SRTPALLALA LPLAAALAFS DETLDKVPKS EGiTCSRILRA QGTRREGYTE 60 

FSLRVEGDPD FYKPGTSYRV TLSAAPPSYF R'GFTLIALRE NREGDKEEDH AGTFQIIDEE 12 0 

:STPRRR TRIQVFWIAP PAGTGCVILK ASIVQKRIIY FQDEGSLTKK 180 

3 YASEGVKQVA ELGSPVKMEE EIRQQSDEVL TVIKAKAQWP AWQPLNVRAA 300 

PSAEFSVDRT RHLMSFLTMM GPSPDWNVGL SAEDLCTKEC GWVQKWQDL IPWDAGTDSG 360 

VTYESPNKPT IPQEKIRPLT SLDHPQSPFY DPEGGSITQV AHWIERIAR KGEQCNIVPD 42 0 

NVDDIVMLA PEEKDEDDTP ETCIYSNWSP HSACSSSTCD KGKRMRQRML KAQLDLSVPC 480 

PDTQDFQPCM GPGCSDEDGS TCTMSEHITW SPCSISCGMG MRSRERYVKQ FPEDGSVCTL 540 

PTEBIEKCTV NEECSPSSCL MTEWGEWDEC SATCGMGMKK RHRMIKMNPA DGSMCKAETS 600 

QAEKCMMPEC HTIPCLLSPW SEWSDCSVTC GKGMRTRQRM LKSLAELGDC NEDLEQVEKC 660 

MLPECPIDCB LTBWSQWSEC NKSCGKGHVI RTRMIQMEPQ PGGAPCPBTV QRKKCEIRKC 720 

LRMPSIQKLR WREARESRRS EQLKEESEGE QFPGCRMRPW TAWSECTKLC GGGIQERYMT 780 
VKKRFKSSQF TSCKDKKEIR ACNVHPC 

Seq ID KO: 7 DNA sequence 

Nucleic Acid Accession #< HM_022454 

Coding sequences 205.. 1449 

1 11 21 31 41 51 

GCAGTGTCXC TAGGCCXK3CT GGGGGCCCTG GGTAOGCTGI AGACCAGACC GCGACa«KKC 60 

AGAACa-CGGG CGGOSGCTTC GGGCCGGGAG ACCCGOSCaG CCCTCGGGGC ATCTCAGTGC 120 

CTCATTCCCC ACCCCCTCCC CCGGGTCGGG GGAGGCGGOS CGTCCGGCGG SGGGTTGAGG 180 

GGAGCGGGGC AGGCCIGGAG CGCCATGAGC AGCCCGGATG CQGGATACXSC CAGTGACGAC 240 

CAGAGCCAGA CCCAGAGCGC GCTGCCCGOG GT6ATGGCCG GGCTGGGCCC CTGCCCCTGG 300 

GCCGAGTCGC TGAGCCCCAT CQGGGAC»TG AAGGTGAAGG GOSAGGCXSCC GGOSAACAGC 360 

3 CCGGGGCCGC GGGCOQAGCC AAQGQCGAGT CCCGTATCCG GCGGCCGATG 420 

A TGGTGTGGGC TAAGGACGAG CGCAAGOSGC TGGOGCaGCA GAATCCAGAC 480 

3 CCGAGTTGAG CAAGATGCTG GGCAAGTOGT GGAAGGCGCT GACGCTGGCG 540 

GAGAAGCGGC CCTTCGTGGA GGAGGCAGAG CGGCTGCGCG TGCAGCACAT GCAGGACCAC 600 

CCCAACTACA AGTACCGGCC GCGGCGGCGC AAGCAGGTGA AGCGGCTGAA GCGGGTGGAG 660 

GGCGGCTTCC TGCACGGCCT GGCTGAGCCG CAGGCGGCCG CGCTGGGCCC CGAGGGCGGC 720 

CGCGTGGCCA TGGACGGCCT GGGCCTCCAG TT<XCCGAGC AGGGCTTCCC CGCCGGCCCG 780 
CCGCTGCTGC CTCCGCACAT GGGCGGCCAC TACCX3CGACT GCCAGAGTCT GGGCGCGCCT ' 840 

CCGCTCGACG GCTACCCGTT GCCCACGCCC GACACGTCCC CGCTGGACGG CGTGGACCCC 900 

GACCCGGCTT TCTTCGCCGC CCCGATGCCC GQGGACTGCC OGGCGGCCGG CACCTACAGC 960 

TACGCGCAGG TCTCGGACTA CGCTGGCCCC CCGGAGOCTC CCGCCGGTCC CATGCACCCC 1020 

CGACTCGGCC CAGAGCCCGC GGGTCCCTCG ATTCCGGGCC TCCTGGCGCC ACCCAGCGCC 1080 

CTTCACGTGT ACTACGGCGC GATGGGCTCG CCCGGGGCGG GCGGOGGGCG CGGCTTCCAG 1140 

ATGCAGCCGC AACACCAGCA CCAGCACCAG CACCAGCACC ACCCCCCGGG CCCCGGACAG 12 00 

CCGTCGCCCC CTCCGGAGGC ACTGCCCTGC CGGGACGGCA CGGACCCCAG TCAGCCCGCC 12 60 

GAGCTCCTCG GGGAGGTGGA CCGCACGGAA TTTGAACAGT ATCTGCACTT CGTGTGCAAG 1320 

CCTGAGATGG GCCTCCCCTA CCAGGGGCAT GACTCCGGTG IGAATCTCCX: CGACAGCCAC 13 80 



G TTACACACTT 15 00 

CCTGGAGGAG CT; 

TTAAAAGGTG TGTTGGCATA TAATTTATGG TAATTTATTT TGTCTGCCRC T 

TGGGGGGGTG AGGTTTCATT TAAAATTTGT TCAGAGATTT GTTTCCCACA GTTGGATTGT 1680 

CaAAACCCTA TTTCCAAGTT CAAGTTAACT AGCTTTGAAT GTGTCCCAAA ACAGCTTCCT 1740 

CCATTTCCTG AAAGTTTATT GATCAAAGAA ATGTTGTCCT GGGTGTGTTT TTTCAATCTT 1800 
CTAAAAAATA AAATCTGGAA TCCTGAAAAA AAAAAAAAAA A 



1 11 21 31 41 51 

I 11 I I . I 

MSSPDAGYAS DDQSQTQSAIi PAVMAGLQPC PWAESLSPIG DMKTKGBAPA NSGAPAGAAG 
RAKGESRIRR PMNAFMWWAK DERKRLAQQN PDLHMAELSK MLGKSWKAI.T lABKRPFVBE 
AERLRVQHMQ DHPHYKYRPR RRKQVKRLKR VEGGFLHGLA EPQAAALGPE GGRVAMDGI/3 
LQFPBQGFPA GPPLLPPHMG GHYEDCQSLG APPLDGYPLP TPDTSPLDGV DPDPAFFAAP 
MPGDCPAAGT YSYAQVSDYA GPPEPPAGPM HPRLGPEPAG PSIPGLLAPP SALHVYYGAM 
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Coding sequence: 112.. 285 6 

1 11 21 31 41 51 

I I I I I I 

ARGCTTGCGG C0GC0GGC6A GGAATGGCGG TATTTGTGAG MGAGTCQGC GTTTGAAGRG 
GTGGRRCTOC TAGGGCTTTT TTGAGAGTGC TGATTTAGAA GAATACAAAT OlTGGCrGftR 
ARTAGTCTAT TAACATCCAC TACTGGGflGG ACTAGCTTGG CAGACTCTTC CRTTTTTGAT 
TCTAAAGTTA CTGAGATTTC CAAGGRAAAC TTACTTATTG GATCTACTTC ATATGTAGAA 
GAAGAGATGC CTCAGAITGA AACAAGA6IG AIATTGGTTC AAGAAGCIGG AAAACAAGAA 
GAACTTATAA AAGCCTTAAA GACIATTAAA ATAATGGAAG TOXTGTTAT AAAGATAAAA 
GAAAGTTGTC CTGGAAAATC GGATGARAAA TTAATAAAAA GTGTTATTAA TATGGACATT 
AAAGTGGGCI CTGTAAAGAT GGAGTCAGI6 6AAGAATTTG AAGGITTGGA TTCTYCGGAA 
TTKGAAAATG TATTTKKAGK CACGGACTIT CAGGATTCTG TCTTTAATGA CCTCTACAAG 
GCTGATTGTA GAGTTATTGG ACCftCCAGTIT GTATTAAATT GTTCACAASA AGGAGAGCCT 
TTGCCfiTTTT CATGTCGCCC GTTGTATTGT ACAAGTATGA TGARTCTAGT ACTATGCTTT 
ACTGGATTTA GGAAAAAAGA AGAACTAGTC AGGTTGGTGA CATTGGTCCA T 
A GTTACACATT TGGTGGCAAA T 
T TCAGGGTTGC TGTGAGTCTA GGTACTCCAA TTATGAAGCC AGAATGGATT 
TATAAAGCTT GGGAAAGGCG GAATGAACAG GATTTCTATG CAGCAGTTGA TGACTTTAGA 
r TGTATTTTAA GTTTCCTGGG ATTTTCAGAT 



S CATTCAAATG 
k TATGAAAAGG CAAATACTCC TGAGCTCAAG 
AAATCAGTGT CAATGCTTTC TCTAAATACC CCTAACAGCA ATCGCAAACG ACGTCGTTTA 
AAAGAAACAC TTGCTCAGCT TTCRAGAGftG ACAGACGTGT CACCATTTCC ACCCCX3TAAG 
CGCCCATCAG CTGAGCATTC CCTTTCCSCTA GGGTCSCTCC TAGATATCTC CAACACACCA 
GAGTCTAGCA TTAACTATGG AGAC»CCCCA AAGTCTTGTA CTAAGTCTTC T 
ACTCCAGTTC CTTCAAAGCA GTC3«3CAAGG TGGCAAGTTG CAAAAGf 

r ATTGGCAACA ATTATTCAGT TATTTCAAGT A 

3 ACCTATCCTT GCACCAGAGG AGATTAAGAC TATTTTTGGI 1620 

AGCATCCCAG ATATCTTTGA TGTACACACT AAGATAAAGG ATGATCTTGA AGACCTTATA 1680 

GTTAATTGGG ATGAGAGCAA AAGCATTGGT GACATTTTTC TGAAATATTC AAAAGATTTG 174 0 

GTAAAAACCT ACCCTCCCTT TGTAAACTTC TTTGAAATGA GCAAGGAAAC AATTATTAAA 1800 

TQTGAAAAAC AGAAACCAAG ATTTCATGCT TTTCTCAAGA TAAACCAAGC AAAACCAGAA 1860 

TGTGGACGGC AGAGCCTTGT TGAACTTCTT ATCOGACCAG TACAGAGGTT ACCCAGTGTT 1920 

GCATTACTTT TAAATGATCT TAAGAAGCAT ACAGCTGATG AAAATCCAGA CAAAAGCACT IS 80 

TTAGAAAAAG CIATTGGATC ACTGAAGGAA GTAATGACGC ATATTAATGA GGATAAGAGA 2040 

AAAACAGAAG CICSAAAGCa AATTTTIGAT GTTGTTTATG AAGTAfiATGG ATGCCCAGCT 2100 

AATCTTTTAT CTTCTCACCG AAGCTTAGTA CAGCGGGTTG AAACAATTTC TCTAGGTGflG 2160 

CACCCCTGTG ACaiGAGGAGA ACAAGTAACT CTCTTCCTCT TCAATGATTG CCTAGAGATA 2220 

GCAAGAAAAC GGCACAAG6T TATTGGCACT TTTAGGAGTC CTCATGGCCA AACCCGACCC 2280 

CCAGCTTCIC TTASGCATAT TCACCTAATG CCTCTTTCTC AGATTAAGAA GGTATTGGAC 2340 

ATAAGA6AGA CAGAAGATTG CCATAATGCT TTTGCCTTGC TTGTGAGGCC ACCAACAGAG 2400 

CAGGCAAATG TGCTACTCAG TTTCCAGATG ACATCSGATG AACTTCCSAA AGAAAACTGG 2460 

CTAAAGATGC TOTGTOGACa TGTAGCTAAC ACCATTTGTA AAGCAGATGC TGAGAATCTT 2S20 

ATTTATAdG CTGATCCAGA ATCCTTTGAA GTAAATACAA AAGATATGGA CAGTACATTG 2580 

AGTAGAGCaT CAAGAGCAAT AAAAAAGACT TCSiAAAAAGG TTfiCAAGAGC ATTCTCTTTC 2640 

TCCAAAACTC CAAAAAGAGC TCTTCGAAGG GCTCTTATGA CATCCCACGG CTCAGTGGftG 2700 

GGAAGAAGTC CTTCCAGCAA TGATAAGCAT GTAATGAGTC GTCTTTCTAG CACATCATCA 2760 

TTAGCAGGTA TCCCTTCTCC CTCCCTTGTC AGCCITCCTT CCTTCTTTGA AAGGAGAAGT 2820 

CATACGTTAA GTAGATCTAC AACTCATTTG ATATGAAGCG TTACCAAAAT CTTAAATTAT 2880 

AGAAATGTAT AGACACCTOA TACTCAAATA AGAAACTGAC TTAAATGGTA CTTGTAATTA 2940 

GCACTTGGTG AAAGCTGGAA GGAAGATAAA TAACACTAAA CTATGCTATT TGATTTTTCT 3000 



TTTTAAAAGA TTTGCCCTCA TTAACAAGAA TAACATTTAA AGGAGATTGT TTCAAAATAT 
AAT TGAGATAAGG ACAGAAAGAT TGAGAAACAT TGTATATTTT GCAAAAACAA 
i GCTGTTTCAG AGAGAGTACG GTATATTTAT GGTAATTTTA TCCACTAGCA 
T TAGTTTGATA GTGTGTGGAA TTTTATTTTG AAGGAIAAGA CCATGGGAAA 
ATTGTGGTAA AGACTGTTTG TACCCTTCAT GAAATAATTC TGAAGTTGCC A 
CTAATCTTCT GTGAAATGCA TAGATATGCG CATGTTCAAC TTTTTAT 
TTAAATGTAA AATTGAAAAT TCATTTGCTG TTTCAAAGIG TGATATCTTT C 
TTTTTATAGT CAQTAATTCA GAATAATCAA GTTCATATGG ATAAATGCAT T 
TATTTCTTTA GGGAGTGCTA CAAATGTTTG TCACTTAAAT TTCAAGTTTC TGTTTTAATA 3960 
GTTAACTGAC TATAGATTGT TTTCTATGCC ATGTATGTGC C3VCTTCTGAG AGTACTAAAT 4020 
GACTCTTTGC TACATTTTAA AAGCAATTGT ATTAGTAAGA ACTTTGTAAA TAAATACCTA 4080 
AAACCCSAAA AAAAAAAAAA AAAAA 



I I I I I I 

MABNSVLTST TGRTSLADSS IFDSKVTEIS KENLLIGSTS YVEEEMPQIE TRVILVQEAG 
KQEELTKALK DIKVGFVKME SVEEFEGLDS PEFENVFWT DFQDSVFHDL YKSDCRVIGP 
PWLHCSQKG EPLPFSCRPL YCTSMMNLVL CFTGFRKKBE LVELVTLVHH MGGVIRKDFH 
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Z TQGEKFRVAV SLGTPIMKPE WIYKAWEKRN EQDFYAAVDD FENEFKVPPF 
QDCIFSFLGF SDEEKTNMEE MTEMQGGKYL PLGDERCTHL WEENIVKDL PFEPSKKLYV 
VKQEWFWGSI QMDAEAGETM YLYEKANTPE LKKSVSMLSL NTPNSNRKRR RLKBTLAOLS 
RDTDVSPFPP RKRPSAEHSL SIGSLLDISN TPESSINYGD TPKSCTKSSK SSTPVPSKQS 
ARWQVRKELY QTESNYVNIL ATIIQLFQVP LEBEGQRGGP ILAEEEIKTI EGSIPDIFDV 
HTKIKDDLED LIVNWDESKS IGDIPLKYSK DLVKTYPPFV HFFBMSKBTI IKCEKQKPRF 
HAFLKINQAK PECGRQSLVE LLIRPVQRLP SVALLLNDLK KHTADENPDK STLEKAIGSL 
KBVMTHIKBD KRKTEAQKQI FDWYEVDGC PflNLLSSHRS LVQRVETISL GEHPCDRGEQ 
VTLFLPHDCL ElftRKRHKVI GTFRSPHGQT RPPASLKHIH IiMPLSQIKKV LDIRETEDCB 
NAFALLVRPP TBQAMVLLSF QMTSDELPKE NWLKMLCRHV ANTICKADAE NLIYTSDPES 
FBVNTKDMDS TLSEASRAIK KTSKKVTRAP SFSKTPKHAL RRALMTSHGS VEGRSPSSND 
KHVMSRLSST SSLASIPSPS LVSLPSFFER RSKTLSRSTT RLI 

Seq ID HO. 11 DMA sequence 

Nucleic Acid Accession #: XM_044166 

Coding sequence: 1..1576 

1 11 21 31 41 51 

CTTTTGTTTC GCJCATGCCTA GTCIAGTGGT ATCTGGAATA ATGGAAAGAA ATGGGGGCIT 

TGGAGAACTA GGATGTTTCG GGGGAAGCGC TAAGGACCGA GGGCTGCTGG AAGACMAGCB 
CGCCCTTCAG CTGGCTCTCG ATCAACTCTG CCTCCTGGGT TTGGGGGAGC CCCOCGCCCC 
, CAGGGCGGGC GAGGACGGGG GAGGTGGGGG GGGCGGCGCC CCCGCGCAGC CGACAGCCCC 
CCCGCAGCCG GCGCCGCCGC CGCCGCCCGC GGCGCCCCCG GCCGCCCCX3A OGACGGCCCC 
CGCAGCGCAG ACGCCCCAGC CCCCCACCGC CCCCAAAGGG GCGAGCGAOG CCAAGCTCTG 
3 AGCTGCGCCT GAAGGGCAGC AGCAACACCA CGGAGTGTGT 
A GATCGTGGGC AGGCAAGGCT G 
3 GCCAAGACCA ACACCTACAT CAAGACA 

Z GGGAGGACGT GGCCACAGCC CGGCGGGAAA TCATCTCAGC 
AGCGGAGCAC TTCTCCATGA TCCGTGCCTC CCGCAACAAG TCAGGCGCCG C 
GGCTCCTGCT CTGCCCGGCC AGGTGACCAT CCGTGTGCGG GTGCCCTACC 0 
GCTGGTGGTG GGCCCCAAAG GGGCAACCAT CAAGCGCATC CAGCAGCAAA CCAACACATA 
CATTATCACA CCAAGCCOTG ACCGCGACCC CGTGTTCGAG A 



GGGGAGCCCC GACGCAGCAA TCGATAGCCG 960 

3i CTACTCCGAC GCCTGGCGGG TGCACCAGCC CGGCTGCAAG CCCCTCTCCA CCTTCCGGCA 1020 

GAACAGCCTQ GGCTGCATCG GCGAGTGCGG AGTGGACTCT GGCTTTGAGG CCCCACGCCT 1080 

GGGTGAGCAG GGCGGGGACT TTGGCTACGG CGGGTACCTC TTTCCGGGCT ATGGCGTGGG 1140 

CAAGCAGGAT GTGTACTACG GOSTGGCCGA GACTAGCCCC CCGCTGTGGG OSGGCCAGGA 12 00 

GAACGCCACG CCCACCTCCQ TGCTCTTCTC CTCTGCCTCC TCCTCCTCCT CCTCTTCCGC 1260 

40 CAAGGCCCGC GCTGGGCCCC OGGGCGCACA CXX3CTCCCXT GCCACTTCOG CGGGACCCGA 1320 

GCTGGCCGGA CTCCCGAGGC GCCCCCCGGG AGAGCCGCTC CAGGGCTTCT CTAAACTTGG 1380 

TGGQGGCGGC CTGCGGRGCC CCGGCGGOGG GCGGGATTGC ATGGTCTGCT TTGBGAGCGA 1440 

AGTGACTGCC GCCCTTGTGC CCTGCGGACA CAACTTGTTC TGCATQGAGT GTGCAGTACXS 1500 

CATCTGCGAG AGGAQSGACC CAGAGTGTCC CGTOTGCCAC ATCACAGCCA CXSCAAGCCAT 1560 

45 CCGAATATTC TCCTAAGCCC OGTGCCCCAT GCCTCCGGGG CCCACTCCAC TGGGCCCaCC 1620 

CTGGACCTGT TTTCCaCTAR AGCCTTTTGG AAAGOGGTGA TTTGAGGGGC AAQGTGCTTA 1680 

GAGRTACTCG CTCGCJrGGGG AAGGGGGGAfi GGAGGCACTG GTGGCTGGAG GGTGCGCCAC 1740 

TTTCAGAGCC TCTGSTCACC CTGTCCrGGA AAGATTGGGA GCX3GGCCAGA CTGSAAATTT 1800 

TACTAGAGTT ACAACICTGA TACCTCRACA CACCCTTASA TCTGGAAGCa GCTAAGAGAA 1860 

50 ACTTTTGTTT IGCCAGAGGT GGCCACTAAG GCATTCTGAC GCCCTCTGCC CACCTCCCCC 1920 

GCTGTGTGTC ACTCCaCCCC TTCTTOCGAG GAGGGGGTGG GTAAAAGGGA GAGGGAGAAT 1980 

TACCACCTGT ATCTAGAGGT GCTCTTTGCA ATCCCTAAGC CCTCTGSTCC TGACCTCCGA 2040 

CCTCCCAGCT CTGTCTTGTT CCTTGTCTTT GTCTTTCTTC CCTTCCCCCI GCCCCTGCCC 2100 

CTACCAGCCC AGCTTTGGGG ACACCSiTCCT TCTGGGGAGA AGTAGGGGGR GGAATATTTG 2160 

55 GATGGTCCCT CCATICCTCT TCAGGCATCT GGAGGCCCTC TCCCCCACTC CTCCAAAGAA 2220 

ACATCTCAAA TTATIGATGG AATGTATCCC CATTCTCAGT GAAAATGTGA GGftGGGGACT 2280 

AATACTGGGG TAAAGGGTCA AACCCCCACC TTCATCACTA TGGGC31TTAT ATTTAfiCSGAG 2340 

TAGTTCTTGG GCTGGATTTT CTGGTTGTGG AAGTGGGGGC GCCAGAGTAG TGTGTCTGCT 2400 

ATTTAAAGGA GCAGGAAAGG GCGTGAGGCR GGAGGAGAGA CTGGTGGAGG GAAGAGCTGC 2460 

60 TCCTCCCATG CAGTGCCCGA CTCCCTGCAC CCCTCTCAAC CIGACCTGAA CCTITATTGA 2520 

ATCCTTATTA GCTTGAATCC TTATTAGCTT GAATCCTCCA TGCAAATCAT GGAGTCTGTG 2530 

TCCCACCTGA TGTGGTTGAG GAGAAGCCAG GTCTTCAAAG AGGGGTCAGC CTGGGGCAAA 2640 

GCAGGACTGG GGGGRGGTGG GCAGCAGGGC CTATTCTGAG AATCACATAT TGTTACAGGC 2700 

CTTGCACCCC CTTTGCTGCT TCCCTCCTGC TCATTTGGGG CTGCCACCAG CTCTCCACCC 2760- 

65 TCCTGGTTCC GCTGGCCGGG CCAAGAGAGG ATGGAGGGAT GGGAGTCCCA GGAGATCCTT 2 820 

GTAAATAGTG GGGIGGGACT GTTCTGAGTG ATCACCCGAG CRCTTAAAGC TCCAGAGTCC 2 880 

CATTCTTCCT GGATGGAGCA GGTGGAGGTG CAGAGGGGAT TTCCTCCTCT CCITCCTCCT 2940 

GTCGAGAATT AACACCTCTC CACAGCCTTC CCCTCCAGAA CACCAGCCAG GGAGGGGTGG 3000 

GGAAGGAGGT CACAGCCAAG AAAACTGCCC TGTGACGACT TCCCTCCTTC CCGCCTATGT 3 060 

70 GAGCCATCCT GAGATGTCTG TACAATAGAA ACCAAACCAA ATGGGCACCC TCGGTTGCCG 3120 



CTCCACTCTT TACCCACAAA GGCAGAAGAC T 
CCCACCCTTA CCAATTGAGC CAAACCTAGA AACAAACACA AAACSiCGAAT AGTGAGAGAC 
AAAATAGAGG AGAGAAAGAG AGCATGAGAG GGAGCGAGAC AGGCX3ACCBA CACAGAGGAG 
75 AGAAAACAAA AATAGCAAAA AAAAAAAAAA AAAGCAGTTC TTTATAATTT AATATTCTAT 
TTTAATAAAG GCGITTATTA CCATATAAAT GTAGCASAGA ACCTGGGOTA ATATGAA 



- FCFAMPSLW SGIMERNQQF GELGCFGGSA KDRGLLBDER AI.QLALDQLC LLGLGEPPAP 
HAGEDGGGGG GGAPAQPTAP PQPAPPPPPA APPAAPTTAP AAQTPQPPTA PKGASDAKLC 
ALYKEAELRL KGSSNTTECV PVPTSEHVAE IVGRQGCKIK ALRAKTNTYI KTPVRGEEPV 
FMVTGRREDV ATREREIISA AEHFSMIRAS RNKSGAAFGV APALPGQVTI RVRVPYRWG 
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r KEIQQQTNTY IITPSRDRDP VFEITGAPGH VEEAHEEIET H 
EVNNENDFLA GSPDAAIDSR YSDAWRVHQP GCKPLSTFRQ NSLGCIGECG VDSGFEAPRIi 
GEQGGDFGYG GYLFPGYGVG KQDVYYGVAE TSPPLWAGQE NATPTSVLFS SASSSSSSSA 

i RSPATSAGPE LAGLPREPPG EPLQGFSKLG GGGLRSPGGG RDCMVCFESE 

H NLFCMECAVR ICERTDPECP VCHITATQAI RIFS 



11 21 31 41 51 

I I I I I I 

ATGGTTTTCT CTGTCAGGCA GTGTGGCCAT GTTGGCAGAA CrGAAGAACT TTTACTGAOG SO 

TTCAAGRTAT TCCTTGTC3\T OiTTTGTCTT CATGTCGITC TGGTAACATC CCTGGAAGAA 120 

GATACTGATA ATTCCAGTTT GTCACX»CCA CCTGATGITA CTTTAAGCTT ACTCXCTTCA 180 

AACGAAACAG AAAAAACTAA AATCACTATA GTAAAAACCT TCAATGCTTC AGGOSTCAAA 240 

CCCCAGAGAA ATATCTGCAA TTTGTCATCT ATTTGCAATG ACTC»GCATT TTTTAGAGGT 300 

GAGATOVTGT TTCAfiTATGA TAAAGAAAGC ACTGTTOCCC AGAATCAACA TATAAC!GAAT 360 

GGCACCTTAA CTGGAGTCCT GTCTOTAAGT GAATTAAAAC GCTCSGAGCr CAAC3iARACC 420 

CTGCAAACCC TAAGTGAGAC TTACTTTATA ATGTGTGCTA CAGCAGAGGC CCAAAGCACA 480 

TTAAATTGTA CATICACAAT AAAACTGAAT AATACAATGA ATGCRTGTGC TGTAATAGCT 540 

GCTTTGGAAA GAGTAAAGAT TCGACCAATG GAACACTGCT GCTGTTCTGT CRGGATACCC 600 

TGCCCTTCCT CCCCAGAAGA GTTGGAAAAG CTTCAGTGTG ACCTGCASSA TCCCATTGTC 660 

TGTCTTGCTG ACCATCCACG TGGCCCACCA TTTTCTTCCA GCCAATCCAT CCCAGTGGTG 720 

CCTCGGGCCA CTGTGCTTTC CCAGGTCCCC AAAGCTACCT CTTTTGCTGA GCCTCCAGAT 780 

TATTCRCCTG TGACCCACAA TGTTCCCTCT CCAATAGGGG AGATTCAACC CCTTTCACCC 840 

CAGCCTTCAG CTCCCATAGC TTCCAGCCCT GeCATTGACA TGCCCCCRCA GTCTGRAACG 900 

ATCTCTTCCC CTATGCCCCA AACCCATGTC TCCGGCACCC CACCTCCTGT GAAAGCCTCA 960 

TTTTCCTCTC CCACCGTGTC TGCCCCTGCG AATGTCAACA CTACCAGCGC ACCTCCTGTC 1020 

CAGACAGACA TCGTCAACAC CAGCAGTATT TCTGATCTTG AGAACCAAGT GTTGCAGATG 1080 

: TGTCCTTGGa CAGCCTGGAG CCTAACCTCG CAGGAGAAAT GATCAACCAA 1140 

C ATGCTGGCCC CTCTGGCTCA AAGATTGCTG 1200 

3 ATGACATTGG CCTACAGCTG AACTTTTCAA ACACGACTAT AAGTCTAACC 1260 

TCCCCTTCTT TGGCTCTGGC TGTGATCAGA GTGAATGCCA GTAGTTTCAA CACAACTACC 1320 

AGTATTGGCA CAATTACTCT TCCTTCATCG CTGATGAATA ATTTACCAGC TCATGACATG 1440 

GAGCTAGCTT CCAGGGTTCA GTTOAATTTT TTTGAAACAC CTGCTTTGTT TCAGGATCCT 1500 

TCCCTGGAGA AC'CTCTCICT GATCAGCTAC GTCATATCAT CGAGTGTTGC AAACCTGACC 1560 

GTCAGGAACT TGACAAGAAA CGTGACAGTC ACATTAAAGC ACATCAACCC GAGCCRGGAT 152 0 

GAGTTAACAG TGAGATGTGT ATTTTGGGAC TTGGGCAGAA ATGGTGGCAG AGGAGGCTGG 1680 

TCAGACAATG GCTGCTCTGT CAAAGACAGG AGATTGAATG AAACCATCTG TACCTGTAGC 1740 

CaTCTAACAA GCTTCX3GCOT TCTGCTGGAC CTATCTAGGA CATCTGTGCT GCCTGCTCAA 1800 

ATGATQGCTC TGRCGTTCAT TACATATATT GGTTGTGGGC TTTCATCAAT TTTTCTGTCA 1860 

GTGACTCTTG TARCCTACAT AGCTTTTGAA AAGATCOGGA GGGATTACCC TTCCAAAATC 1920 

CTCATCCAGC TGTGTGCTGC TCTGCTTCTG CTGAACCTGG TCTTCCTCCT GGACTCX3TGG 1980 

ATTGCTCTGT ATAAGATGCA AGGCCTCTGC ATCTCMTGG CTGTATTTCT TCATTATTTT 2040 

CrCTTGGTCr CATTCAOITG GATGGGCCIA GRAGCS^TTCC ATATGTACCT GGCCCITGTC 2100 

AflAGTATTTA ATSCTTACAT CCX3AAAATAC ATCXITTAAAT TCTGCATTGT CGGTTGGGGG 2160 

GTACCAGCTG TGQTTGTGAC CATCATCOTG ACTATATCCC CAGATAACTA TGGGCTTGGA 2220 

TCCTATGGGA AATTCCCCAA TGGTTCACCG GRTGACTTCT GCTGGATCAA CAACBATGC» 2280 

GTATTCTACA TTSOjGTGGT GGGATATTTC TGTGTGATAT TTTTGCTGAA CGTCAGCATG 2340 

TTCATTGTGQ TCCTC6TTCA GCTCTGTCGA ATTAAAAAGA AGAAGCAACT GGGAGCCCAG 2400 

CGAAAAACCA GTATTCAAGA CCTCAGGAGT ATCGCTGGCC TTACATTTTT ACTGGBAATA 2460 

ACTTGGGGCT TTGOCTTCTT TGCCTGGGGA CCAGTTAACX3 TGACCTTCAT GTATCTGTTT 2520 

GCCATCTTTA ATACCTTACA AGGATTTTTC ATATTCATCT TTTACTGTGT GGCCSWAGAA 2580 

AATGTCAGGA ASCAATGGAG GCGGTATCTT TGTTGTGGAA AGTTAOXSCr GGCTGAAAAX 2640 

TCTGACTGGA GTAAAACTGC TACTAATGGT TTAAAGAAGC AGACTGTAAA CCAAGGAGTG 2700 

TCCAGCTCTT CAAATTCCTT ACAGTCAAGC AGTAACTCCA CTAACTCCAC CACACTGCTA 2760 

GTGAATAATG ATTGCTCAGT ACACGCAAGC GGGAATGGAA ATGCTTCTAC AGAGfiGGAAT 2820 

TGCCTTC AOSATTTCAC TGGRAAAC3«3 2880 



I I I I I 

MVFSVRQCGH VGRTEEVLLT FKIFLVIICL HWLVTSLEE DTDNSSLSPP P 
NETEKTKITI VKTFNASGVK PQRNICNLSS ICNDSAFFRG SIMFQYDKES TVPQNQHITN 
GTLTGVLSLS ELKRSELNKT LQTLSETYFI MCATAEAQST LNCTFTIKIiN NTMNACAVIA 
AIjERVKIRPM EKCCCSVRIP CPSSPEELEK LQCDLQDPIV CLADHPRGPP FSSSQSIPW 
PRRTVLSQVP KATSFAEPPD YSPVTHNVPS PIGEIQPLSP QESAPIASSP AIDMPPQSET 
ISSPMPQTHV SGTPPPVKAS FSSPTVSAPA NVNTTSAPPV QTDIVNTSSI SDLENQVLQM 
EKALSLGSLE ENLAGEMINQ VSRLLHSPPD MLAPLAQRLL KWDDIGLQL NFSKTTISLT 
SPSLALAVIR VNASSFNTTT FVAQDPANLQ VSLETQAPEN SIGTITLPSS LMNNLPAHDM 
ELASRVQFNF FETPALFQDP SLEHbSLISY VISSSVANLT VMILTRNVTV TLKHINPSQD 
ELTVRCVFWD LGRNGGRGGW SDNGCSVKDR RLNETICTCS HLTSFGVLLD LSRTSVLFAQ 
MMALTFITYI GCGLSSIFLS VTLVTYIAFE KIRRDYPSKI LIQLCAALLL LNLVFLLDSW 
IAI,YKMQGLC ISVAVFLHYF LLVSFTWMGL EAFHMYLALV KVFNTYIRKY ILKFCIVGWG 
VPAVWTIIL TISPDNYGLG SYGKFENGSP DDFCWINNNA VFYITWGYF CVIFLMVSM 
FIWLVQLCai IKKKKQIGAQ RKTSIQDLRS lAGLTFLLGI TWGFAFFAWG PVNVTFMYLF 
AIFNTLQGFF IFIFYCVAKE NVEKQWEEYL CCGKLRIiAEN SDWSKTATNG LKKQTVNQGV 
SSSSHSLQSS SHSTNSTTIiL VNMDCSVHAS GHGNASTEEN GVSFSVQHGD VCLHDFTGKQ 
HMFNEKEDSC NGKGRMAUIH TSKRGSLHFI EQM 

Seq ID NOt 15 DNA sequence 
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1 11 21 31 41 SI 

I I I I I I 

AIC3GTTTTCT CTGICAGGCA GTGTGGCOIT GTTGGCAGAA CTGAAGARGT TTTACTGACG 
TTCAAGftTAT TCCTTGTCAT CATTTGTCTT CftTGTCXSTTC TGGTAACATC CCTGGAAGAA 
GftTACTGATA ATTCCAGTTT GTCRCCACCa CKTGATGTTA CTTTAAGCTT ACTCCCTTCA 
AACGAAACAG AAAAAACIAA AATCRCTATA GTAAAAACCI TCRATGCTTC ftOGOSTCASA 
CCCCAGAGAA ATATOTGCAA TTTGTCATOT ATTTGCAATG ACTCAGCATT TTTTAGAGGT 
GAGATCATGT TTCAATATGA TAAAGAAAGC ACTGTTCCCC AGAATCBACA TATAACGAAT 
GGCACCTTAA CTGGAGTCCT GTCTCTAAGT GAATTAAAOl CATTAAATTG TACATTCACA 
ATAAAACTGA ATAATACAAT GAATGCATGT GCTGTAATAG CTGCTTTGGA AAGAGTAAAG 
ATTOGACCAA TGGAACACTG CTGCTGTTCT GTCAGGATAC CCTGCCCTTC CTCCX:CAGAA 
GAGTTGGAAA A6CTTCAGTG TGACCTGCAG GATCCCATTG TOIGTCTTGC TGACXXTCCA 
CGTGGCCCAC CATTTTCTTC CAGCCAATCC ATCCCSiGTGG TGCCTCGGGC CACTGTGCIT 
TCCCAGGTCC CC3UUVGCTAC CTCTTTTGCT GAGCCTCCAG ATTATTCACC TGTGACCC3«: 
AATGTTCCCT CTCCAATAGG GGAGATTCAA CCCCTTTCAC CCCSfiCCTTC AGCTCCCATA 
GCTTCCAGCC CTGCCATTGA CATGCCCCKA CRGTCIGAAA CGATCTCTTC CCCTATGCXX: 
CAAACCCATG TCTCCGGCAC CCCACCTCCT CTGAAAGCCI CATTTTCCTC TCCCACCGIG 
G CGAATGTCAA CACTACCAGC GCACCTCCTG TCCAGACAGA CATCGTCAAC 



1140 

GCTGTGATCA GAGTGARTGC CAGTAGTTTC AACACAACTA CCTTTGTGGC CCAAGACCCT 12 60 

GCAAATCTTC AGGTTTCTCT GGAAACCCAA GeTCCTGAGA ACAGTATTGG CACAATTACT 132 0 

CTTCCITCAT CGCTGATGAA TAATTTACCA GCTCATGACA TGGAGCTAGC TTCCAGGGTT 1380 
CAGTTCAATT TTTTTGAAAC ACCTGCTTTG TTTCAGGATC CTTCCCl 

r ACGTCATATC ATCGAGTGTT GCAAACCTGA CCGTCAGGAA C 

A GCACATCAAC CCGAGCCAGG ATGAGTTAAC AGTGAGATGT 1560 

GTATITTGGG ACITGGGCAG AAATGGTGGC AGAGGAGGCT GGTCAGACAA TGGCTGCTCT 162 0 

GTCAAAGACA GGAGATTGAA TGAAACCATC TGTACCTGTA GCCATCTAAC AAGCTTCGGC 1680 

GTTCTGCTGG ACCTATCTAG GACATCTGTG CTGCCTGCTC AAATGATGGC TCTGACGTTC 174 0 

ATTACATATA TTGGTTGTGG GCTTTCATCA ATTTTTCTGT CAGTGACTCT TGTAACCTAC 18 0 0 

ATAGCTTTTG AAAAGATCCG GAGGGATTAC CCTTCCAAAA TCCTCATCCA GCTGTGTGCT 1860 

GCTCTGCTTC TGCTGAACCT GGTCTTCCTC CTGGACTCGT GGATTGCTCT GTATAAGRTG 1920 

CAAGGCCTCT GCATCTCAGT GGCTGTATTT CTTCATTATT TTCTCTTGGT CTCATTCACA 1980 

TGGAIGGGCC TAGAAGCATT CCATATGTAC CTGGCCOTG TCAAAGTATT TAATACTTAC 2040 

ATCCGAAAAT ACATCCTTAA ATTCTGCATT GTCGGTTGGG GGGTACCAGC TGTGGTTGTG 2100 

ACCATCATCC TGSCTATATC CCCAGATAAC TATGGGCTTG GATCCTATGG GAAATTCCCC 2160 

AATC3GTTCAC OGGATGACTT CTGCTGGATC AACAACAATG CaOTATTCTA CATTAOGGTG 2220 
GTGGGATATT TCTGTGTGAT ATTTTTGCTG AACGTCAfiCA TGrTCAl 
CAGCICTCIC GAATTAAAAA GAAGAAGCAA CTGGGAGCCC AGCGAAAAAC C 

GACCTCftfiGA GTATCGCTGG CCTTAC»TTT TTACTGGGAA TAACTTGGGG CTTTGCCTTC 2400 

TTTGCCTGGG GftCXAGTTAA CXSTGACCTTC ATGTATCTGT TTGCCATCTT TAATACCITA 2460 

CAAGGATTTT TCATATTCAT CTTTTACTGT GTGGCCAAAG AAAATGTCAG GAAGCAATGG 2520 

AGGOXSTATC TTTGTOXSTGG AAAGTTACGG CTGGCTGAAA ATTCTGACTG GAGTAAAACT 2580 

GCTACTAATG GTTTAAAGAA GCAGACTGTA AACCAAGGAG TGTCCaGCTC TTCRAATTCC 2640 

TTACfiGTCAA GCAGTAACTC CACTAACTCC ACCaCACTGC TAGTGAATAA TGATTGCTCA 2700 

GTAC3«S3CAA GOGGGAATGG AAATGCTTCT ACAGSfiAGGA ATGGGGTCTC TTTTAGTGTT 2760 

CAGAATGGSG ATGTGTGCCT TCACGATTTC ACTGGAAAAC AGCACaTGTT TAACGAfiRAG 2820 

GAAGRTTCCT GCAATGGGAR AGGCCGTATG GCTCTCACSAA GGACTICAAA GCGGGGAAGC 2880 
TTACACTTTA TTGAGCAAAT GTGA 

Seq ID NOi IS Protein sequence 

ilVFSVRQCGH VGRTEEVLLT FKIFLVIICL HWLVTSLEE DTDNSSLSPP PDVTLSLLPS SO 

GTLIGVLSLS ELNTLNCTFT IKLNNTMNAC AVIAALERVK IP.PMEHl 
ELEKLQCDLQ DPIVCLADHP RGPPFSSSQS IPWPRATVL SQVPKATSFA E 
NVPSPIGEIQ PLSPQPSAPI ASSPAIDMPP QSETISSPMP QTHVSGTPPP VKfiSFSSPTV 
SAPANVNTTS APPVQTDIVN TSSISDLENQ VLQMEKALSL GSLEPNLAGE HINQVSRLLH 
SPPDMLAPLA QRLLKWDDI GLQLNFSNTT ISLTSPSLAL AVIRVNASSF NTTTFVAQDP 
ANLQVSLETQ APENSIGTIT LPSSLMNNLP AHDMEIiASRV QFHFFETPAL FQDPSLENLS 
LISYVISSSV ANLTVRNLTR NVTVTLKHIN PSQDELTVEC VFWDLG: 

I CTCSHLTSFG VLLDLSRTSV LPAQMMALTF ITYIGCGLSS I 
KILIQLCA ALLLLNLVFL LDSWIALYKM QGLCISVAVF L 
i LALVKVFNTY IRKYILKFCI VGWGVPAWV TIILTISPDN Y 
NGSPDDFCWI NMNAVFYITV VGYFCVIFLL NVSMFIWLV QLCRIKKKKQ LGAQRKTSIQ 
DLRSIAGLTF LLGITWGFAF FAWGPVNVTF MYLFAIFNTL QGFFIFIFYC VAKENVRKQVI 
RRYLCCGKLR LAENSDWSKT ATNGLKKQTV NQGVSSSSNS LQSSSNSTNS TTLLVHNDCS 



Seq ID NO: 17 DNA sequence 
Nucleic Acid Accession #: Eos sequence 
- . Coding sequences 1. .2811 

80 1 11 21 31 41 51 

I I I I I I 

ATGGTTTTCT CTGICAGGCA GTGTGGCCAT GTTGGCAGAA CTGAAGAAGT TTTACTGAOS 60 
TTCAAGATAT TCCTTGTCAT CATTTGTCTT CATGTCGTTC TGGTAACATC CCTGGAAGAA 120 
GATACTGATA ATTCCAGTTT GTCACCACCA CCTGAGGTTG AAACAACAAG CCTCAATGAT 180 
85 GTTACTTTAA GCTTACTCCC TTCAAACGAA ACAGGCGTCA AACCCCAGAS AAATATCTGC 240 



279 



wo 02/102235 



PCT/US02/19297 



AATTTGTCAT CTATTTGCAR TGACTCaGCA TTTTTTAGAG GTGAGATCAT GTTTCAATAT 300 

GATAAAGRAA GCACTGTTCC CCAGAATCAA CATATAAOSA ATGGCACCTT AACTGOAGTC 360 

CTGTCrCIAA GTGAATTAAA ACXSCTCAGAG CTCAACAAAA CXICTGCAAAC CCTAAGTGAG 420 

ACTTACTTTA TAATGTCTGC TACAGCSGAG GCCCAAAGCA CATTAAATTG TACATTCACa^ 480 

ATAAAACIGA ATAATACAAT GAATGCATGT GCTGTAATAG CIGCTTTGaA AAGAGTARAG 540 

ATTCGACCAA TG6AACACTG CTGCTGTTCT GTC3«a3ATAC CXTTGOXTTC CTCCCCAGAA 600 

GAGTTGGAAA AGCTTCAGTG TGACCTGCAG GATCCCATTG TCTGTCTTGC TGACCATCCA 660 

CGTGGCCCAC CATTTTCTTC C»GCCftATCC ATCCCAGTGQ TGCCTCGGGC Ca«JrGTGCTT 720 

TCCCflGGTCC CCAAAGCTAC CTCTTTTGCT GAGCCTCCAG ATTATTCACE TGIGACCCAC 780 

AATGTTCCCT CTCCAATAGG GGAGATTCAA CCCTTTTCRC CCCAQCCTTC AGCTCCCATA 840 

GCTTCCAGCC CTGCX»TTGA CaTGCCCCC» CMTCTGAAA CGATCTCTTC CCCTATGCCC 900 

CRRACCCATG TCTCXXSGCaC CX:CACCTCCT GTGAAAGCET CATTTTCCTC TCCCSCCGTG 960 

TCTGCCCCTG CGAATGTCAA CACTACCAGC GCaCCTCCTG TCCAGACAGA CAICGTCSUIC 1020 

ACCAGCAGTA TTTCTGATCT TGAGAACCAA GTGTTGCAGA TGGAGAAGGC TCTGTCCTTG 1080 
GGCAGCCTGG AGCCTAACCT CGCAGGAGAA ATGATCAACC AAGTCAGCAG A 



C AGTGAGATGT 

C AAGCTTCGGC 
C TCrrGACGTTC 



C TGAACTTTTC AAACACGACT A 
GCTGTGATCA GAGTGAATGC CAGTAGTTTC AACACAACTA CCTTTGTG3C C 
GCAAATCTTC AGGTTTCTCT GGAAACCCAA GCTCCTGAGA ACAGTATTGG CACAATTACT 
20 CTTCCTTCAT CGCTGATGAA TAATTTACCA GCTCATGACA T 

CAGTTCAATT TTTTTGAAAC ACCTGCTTTG TTTCAGGATC CTTCCCTGGA G 
CTGATCAGCT ACGTCATATC ATCGAGT 
AACGTGACAG TCACATTAAA GCACATCAAC C 
GTATTTTGGG ACTTGGGCAG AAATGGTGGC AGAGGAGGCT G 
25 GTCAAAGACA GGAGATTGAA TGAAACCATC TGTACCTGTA GCCATCTAAC A 

GTTCTGCTGG ACCIATCTAG GACATCl 

ATTACaTATA TTGGTTGTGG GCTTTCATCA ATTTTTCTGT CAGTGACTCT T 
ATAGCTTTTG AAAAGATCCG GAGGGATTAC CCTTCCAAAA T 

GCTCTGCTTC TGCTGAACCT GGTCTTCCTC CTGGACTOGT G^x,l,iv^x^i ^^^^^^^^ 
30 CAAGGCCTCT GCATCTCAGT GGCTGTATTT CTTCATTATT TTCTCTTGGT CTCATTCACA 
TGGATGGGCC TAGAAGCATT CCATATGTAC CTGGCCCTTG TCRAAGTA 
ATCCGAAAAT ACATCCTTAA ATTCTGCATT GTCGGTTGGG G 

ACCATCATCC TGACTATATC CCCAGATAAC TATGGGCTTG GATCXTATGG GAAATTCCCC 
AATGGTTCAC CGGATGACTT CTGCTGGATC ARCAACAATG CAGTATTCTA CATTACGGTG 
35 GTGGGRTATT TCTGTGTGAT ATTTTTGCTG AACGTC3«KA TGTTCATTGT GGTCCTGGTT 
CAGCTCTGTC GAATTAAAAA GAAGAAGCAA CTGGGAGCTC Al 
GaCCTCAOGA GTATCGCTGG CCTTACATTT TTACTGGGAA TAACTTGGGG C 
TTTGCCTGGG GACCAfiTTAA CGTGACCTTC ATGTATCTCT TTGCCATCIT T 
CAAGGATTTT TOiTATTCAT CTTTTACTGT GTGGCCAAAG AAAATGTC31G GAAGCAATGG 2580 
40 AC3G0GGTATC TTTGWTGTGG AAAGTTACGG CTGGCTC3RAA ATTCTGGAAA TGCTTCTACA 2640 
GASAGC3AATG GGSrCTCTTT TAGTGTTCAG AATGGAGATG TGTGCCTTCA CGATTTCaCT 2700 
S CCGTATGGCT 2760 



I I I I I I 

MVFSVEQCGH VGRTEEVLLT FKIFLVIICL HWLVTSLEE DTDNSSLSPP PEVETTSLND 
VTLSLLPSNB TGVKPQRNIC NLSSICNDSA FFRGEIMFQY DKESTVPQNQ HITOGTLTGV 
LSLSELKRSB LNKTLQTLSE TYFIMCATAE AQSTLNCTFT IKLMNTMNAC AVIAALBRVK 
IRPMEHCCCS VRIPCPSSPE ELEKLQCDLQ DPIVCLADHP RGPPFSSSQS IPWPHATVL 
SQVPKATSFA EPPDYSPVTH NVPSPIGEIQ PLSPQPSAPl ASSPAIDMPP QSETISSPMP 
QTHVSGTPPP VKASFSSPTV SAPANVNTTS APPVQTDIVN TSSISDLENQ VLQMEKALSL 
GSLEPNLAGE MINQVSRLLH SPPDMLAPLA QRLLKWDDI GLQLNFSNTT ISLTSPSLAL 
AVIRVNASSF NTTTFVAQDP AHLQVSLETQ APEMSIGTIT LPSSLMNNLP AHDMELASRV 
QFNFFETPAL FQDPSLENLS LISWISSSV ANLTVRNLTR NVTVTLKHIN PSQDELTVRC 
VFMDLGRNGG RGGWSDNGCS VKDRRLNETI CTCSHLTSFG VLLDLSRTSV LPAQMMALTF 
ITYIGCGLSS IFLSVTLVTY lAFEKIRRDY PSKILIQLCA A 
QGLCISVAVF LHYFLLVSFT WMGLEAFHMY LALVKVFNTY I 
TIILTISPDN YGLGSYGKFP NGSPDDFCWI NKNAVFYITV VGYFCVIFLL NVSMFIWLV 
QLCRIKKKKQ LGAQRKTSIQ DLRSIAGLTF LLGIIWGFAF FAWGPVNVTF MYLFAIFNTL 
QGFFIFIFYC VAKEKVRKQW RRYLCCGKLR LAENSGHAST ERNGVSFSVQ NGDVCLHDFT 
GKQHMFNEKE DSCNGKGRMA LRRTSKRGSL HFIEQM 

Seq ID NO: 19 DNA sequence 



I I I I I I 

ATGGTTTTCT CTGTCAGGCA GTGTGGCCAT GTTGGCAGAA CTGAAC3U1GT TTTACTGACG 
TTCAAGATAT TCCTTGTCAT CATTTGTCTT CATGTCGTIC TGGTAACATC CCTGGAAGAA 
GATACTGATA ATTCCAGTTT GTCACCACCA CCTGCTAAAT TATCTGTTGT CSfiTTTTGCC 
CCCICCTCCA ATGAGGTTGA AACAACAAGC CTC3UITGATG TTACTTTAAG CTTACTCCXTT 
TCAAACGAAA CAGAAAAAAC TAAAATCACT ATAGTAAATU^ CCTTCAATGC TTCAGGCGTIC 
AAACrCCAGA GAAATATCTG CARTTTGTCA TCTATTTGCA ATGACTCAGC ATTTTTTAGA 
GGIGSlGATCA IGTTTCAATA TGATAAAGAA AGCACTGITC CXXafiAATC» ACATATAACG 
AATGGCACCT TAACTGGAGT CCTGTCTCTA AGTGAATTAA AACGCTCafiA GCTCAACAAA 
ACrCTGCAAA CCCTAAGTGA GRCTTACTTT ATAATGrSTG CIACAGCAGA GGCCCAAAGC 
ACATTAAATT GTACATTCAC AATAAAACTG AATAATACAA TGAATGCATG TGCTGCAATA 
GCCGCTTTG9 AAAGRGTAAA GATTCGACCa ATGGAACfiCT GCTGCTGTTC TCTCSGGATA 
COCTGCCCTT CCTCCCCAGA AGAGTTGGGR AAGCTTCAGT STGACCTGCa GGATCCCaTT 
GTCTGTCTTG CTGACCATCC ACGTGGCCCA CCATTTTCTT CCAGCCAATC CATCCCAGTG 
G CCACTGTGCT TTCCCAGGTC CCCAAAGCTA CCTCTTTTGC TGAGCCTCCA 



GATTATTCAC CTGIGACCCA CAATGTTCCC TCTCCAATAG GGGAGATTCA ACCCCTTTCA 
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CCCCflGCCTT CAGCTCCCAT AGCTTCCaGC CCTGCCATTG ACATGCCCCC ACAGTCTGAA 
ACGATCTCTT CCCCTATGCC CCAAACCCAT GTCTCCGGCA CCCCACCTCC TG7GAAAGCC 
TCATTTTCCT CTCCCACCGT GTCTGCCCCT GCGAATGTCA ACACTACCAG CGCACC7CCT 
GTCCAGACAG ACATCGTCAA CACCAGCAGT ATTTCTGATC TTGAGA-^lCCA AG7GTTGCAG 
T GGGCAGCCTG GAGCCTAACC " 
A TTCCCCGCCT G 

CTGAAAGTAG TGGATGACAT TGGCCTACAG CTGAACTTTT CAAACACGAC TA7AAG7CTA 1320 

ACCTCCCCTT CTTTGGCTCT GGCTGTGATC AGAGTGAATG CCAGTAGTTT CAACACAACT 1380 

ACCTTTGTGG CCCAAGACCC TGCAAATCTT CAGGTTTCTC TGGAAACCCA AGCTCCTGAG 1440 

AACAGTATTG GCACAATTAC TCTTCCTTCA TCXSCTGATGft ATAATTTACC AGCTCATGAC 1500 

ATGGAGCTAG CTTCTAGGGT TCAGTTCAAT TTTTTTGftAA CACCTGCTTT GTTTCAGGAT 1560 

CCTTCCCTGG AGAACCrCTC TCTC3ATC3«SC TACGTCATAT CATCGAGTGT TGCAAACCTG 1620 

ACCC3ICAGQA ACTTGACAAG AAACGTGACA GTCACATTAA AGCACATCAA CCCGAGCOW 1680 

GATGAGTTAA OWSTGRGATG TGTATTTTGG GACTTGGGCX GAAATGGTGG CAGAGGAGGC 1740 

TGGTCAGACA ATGGCTGCTC TGTCAASGAC SGGAGATTGA ATGAAACCAT CTGTACCTGT 1800 

AGCCATCTAA CAAGCTTCGG CGTTCTGCTG GACCTATCTA GGACaTCTGT GCTGCCTGCT 1860 

CAAATGATGG CTCTGftCGTT CATTACATAT ATTGGTTGTG GGCTTTCATC AATTTTTCTG 1920 

TCAGTGACIC TTGTAACCTA CATAGCTTTT GAAAAGATOC GGAGGGATTA CCCTTCCAAA 1980 

ATCCTCATCC AGCT6TGTGC TGCTCTGCTT CTGCTGAACC TGGTCTTCCT CCTGGACTCG 2040 

TGGATTGCTC TGTATAAGAT GCAAGGCCTC TGCATCTCAG TGGCTGTATT TCTTCATTAT 2100 

TTTCTCTTGG TCTCATTCAC ATGGATGGGC CTAfiAAGCaT TCCATATGTA CTIGGCCXTTT 2160 

GTCAAAGTAT TTAATACTTA CATCCGAAAA TACATCCTTA AATTCTGCAT TGTOGGTTGG 2220 

GGGGTACCAG CTGTGGTTGT GACCATCATC CTGACTATAT CCCCAfiRTAA CTATGGGCIT 2280 

GGATCCTATG GGAAATTCCC CSiATGGITCA GCGGATGACT TCTGCTGGAT CAACAACAAT 2340 

GCAGTATTCT ACATTACGGT GGTGGGATAT TTCTGTGTGA TATTTTTGCT GAACGTCAGC 2400 

ATGTTCATTG TGGTCCTGGT TCAGCTCTGT CGAATTAAAA AGAAGAAGCA ACTGGGAGCC 2460 

CAGCGAAAM CCAGTATTCA AGACCTCAGG AGTATCGCIG GCCTTACATT TTTACTGGGA 2520 

ATAACTTGGG GCTTTGCCTT CTTTGCCTGG GGACCAGTTA ACGTGACCTT CATGTATCTG 2580 

TTTGCCATCT TTAATACCTT ACAAGGATTT TTCATATTCA TCTTTTACTG TGTGGCCRAA 2640 

GAAAATGTCA GGAAGCAATC GAGGCGGTAT CTTTGTTGTG GAAAGTTACG GCTGGCTGAA 2700 

AATTCTGACT GGAGTAAA^iC TGCTACTAAT GGTTTAAAGA AGCAGACTGT AAACCAAGGA 27S0 

GTGTCCAGCT CTTCAAATTC CTTACAGTCA AGCAGTAACT CCACTAACTC CACCACACTG 282 0 

CTAGTGAATA ATGATTGCTC AGTACACGCA AGCGGGAATG GAAATGCTTC TACAGAGAGG 2880 

AATGGGGTCT CTTTTAGTGT TCAGAATGGA GATGTGTGCC TTCAOSATTT CACTGGAAAA 2940 

CAGCACATGT TTAACGAGAfl GGAAGATTCC TGCAATGGGA AAGGCEGTAT GGCTCTCfiGA 3000 
AGGACTTCAA AGCGGGGAAG CTTACACTTT ATTGAGCAAA TGTGA 

Protein Accession #! Eos sequence 

1 f r r r r 

MVFSVRQCGH VGRTEEVLLT FKIFLVIICL HWLVTSLEE DTI 
PSSNBVBTTS LNDVTLSLLE SNETBKTKIT IVKTFNASGV KPQRNICaiLS S 
GBIMFQmKE STVPQNQHIT NGTLTGVLSL SELKRSBLNK TLQTLSETYF IMCATABAQS 
TLNCTFTIKL NHTMNACAAI AALERVKIEP MEHCCCSVRI PCPSSPEE:.G KLQCDLQDPl 
VCXADHPRGP PFSSSQSIPV VPEATVLSQV PKATSFAEPP DYSPVTHNVF SPIGEIQPLS 
PQPSAPIASS PAIDMPPQSB TISSPMPQTH VSGTPPPVKA SFSSPTVSAP AlWNTTSAPP 
VQTDIVNTSS ISDLENQVLQ MEKALSLGSL EPNIiAGEMIN QVSRLLHSPP DMLAPLAQRL 
LKWDDIGLQ LNFSNTTISL TSPSLALAVI RVNASSFNTT TFVAQDPANL QVSLETQAPE 
NSICTITLPS SLMNNLPAHD MBLASRVQFN FFBTPALEQD PSLBNLSLIS WISSSVANL 
TVBNLTRNVT VTLKHIHPSQ DELTVRCVFW DLGRNGGRGG WSDHGCSVKD RRLNETICTC 
SHLTSFGVLL DLSRTSVLPA QMMALTFITY IGOGLSSIFL SVTLVTYIAF EKIEKDYPSK 
ILIQLCAAIiL LUJLVFLUJS WIAIiYKHQGL CISVAVFLHY FLLVSFTWMG LBAFHMXLAL 
VKVEHTYIHK YILKFCIVGW GVPAVWTII LTISPDMYGL GSYGKFPHGS PDDFCWmUH 
AVFYITWGY FCVIFLUWS MFIWLVQLC RIKKKKQLGA QRKTSIQDLR SIAGLTFLLG 
ITWGFAFFAW GPVNVTFMYL FAIFNTLQGP FIFIFYCVAK BNVEKQWRRY L 
NSDWSKTATN GLKKQTVNQG VSSSSNSLQS SSNST ' 
NGVSFSVQNG DVCLHDFTGK QHMFNBKBDS CNGK6RMALR RTSKRGSLHF lEQM 

Seq ID NO! 21 DNA sequence 



AGGACGCGAG CGGCAGGTGT GCACAGAGGT TCTCCACTTT G 



GTTTTACTGA CGTTCAAGAT ATTCCTTGTC ATCATTTGTC TTCATGTCGT T 
A TAATTCCAGT TTGTCACCAC CACCTGCTAA A' 
3 CCCCCTCCTC CAATGAGGTT GAAACAACAA GCCTCAATGA TGTTACTTTA 
AGCTTACTCC CTTCAAACGA AACAGAAAAA ACTAAAATCA CTATAGTAAA AACCTICAAT 
A GAGAAATATC TGCAATTTGT CATCTATTTG CAA7GACTCA 
3TTTCAA TATGATAAAG AAAGCACTGT TCCCCAGAAT 
CAACATATAA CGAATGGCAC CTTAACTGGA GTCCTGTCTC TAAGTGAATT AAAACGCTCA 
GAGCTCAACA AAACCCTGCA AACCCTAAGT GAGACTTACT TTATAATGTG TGCTACAGCA 
GAGGCCCAAA GCACATTAAA TTGTACATTC ACAATAAAAC TGAATAATAC AATGAATGCA 
TGTGCTGCAA TAGCCGCTTT GGAAAGAGTA AAGATTCGAC CAATGGAACA CTGCTGCTGT 
TCTGTCAGGA TACCCTGCCC TTCCTCCCCA GAAGAGTTGG GAAAGCTTCA GTGTGACCTG 
CAGGATCCCA TTGTCTGTCT TGCTGACCAT CCACGIGGCC CACCATTTTC TTCCAGCCAA 
TCCATCCCAG TGGTGCCTCG GGCCACTGTG CTTTCCCAGG TCCCCAAAGC TACCTCTTTT 
GCTGAGCCTC CAGATTATTC ACCTGTGACC CACAATGTTC CCTCTCCAAT AGGGGAGATT 
CAACCCCTTT CACCCCAGCC TTCAGCTCCC ATAGCITCCA GCCCTGCCAT 
CCACAGTCTG AAACGATCTC TTCCCCTATG CCCCAAACCC ATGTCTCXXSG C 
CCTGTGAAAG CCTCATITTC CTCTCCCACC GTGTCTGCCC CTGCGAATGT CAACACTACC ; 
AGCGCACCTC CTGTCCAGAC AGACATOSTC AACACCAGCA GTATTTCIGA TCTTGAGAAC 
CAAOTGTTGC AGATGGAGAA GGCTCTGTCC TTGGGCAGOC TGGAGCCTAA CCTCGCAGGA 
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GAAATGATCA ACCAAGTCAG CRGACTCCTT CATTCCCOGC CTGACATGCT GGCCCCTCTG 1320 

GCTCflAAGRT TGCTGRAAGI AGTGGATGAC ATTGGCCTAC AGCT6AACTT TTCftflACACG 1380 

ACTATAASTC TAACCTCCCC TTCTTTGGCT CTGGCTGTGA TCAGAGTGAA TGCCAGTAGT 1440 

TTCAACACflA CTACCITTGT GGCCCRAGAC CCTGCAAATC TTC»GGTTTC TCTGGAAACC 1500 

5 CAAGCTCCT6 AGAACAGTAT TGGCACAATT ACTCTTCCTT CATCGOTGAT GAATAATTTA 1560 

CCAGCTCaTS ACATGGAGCT AfiCTTCCAGG GTTCAGTTCA ATTTTTTTGA AACACCTGCT 1620 

TTOITTCaGG ATCCTTCCCT GGAGAACCTC TCTCTGATCA GCPACSTCAT ATCATCGAGT 1680 

GTTGCAAACC TGACCGTCAG GAACTTGACA AGAAACGTGA CAGTCAC»TT AAAGCACSTC 1740 

AACCCGAGCC AGGATGAGTT ARCAGTGAGA TGTGTATTTT C3GGACTTGGG CAGAAATGGT 1800 

10 GGCAGAGGAG GCTGGTCAGA C34ATGGCTGC TCTGTCAAAG ACSGGAGATT GAATGAAACC 1860 

ATCTGTACCT GTAGCCATCT AACASGCTTC GGOSTTCTGC TGGACCTATC TAGGACATCT 1920 

GTGCTGCCTG CTCAAATGAT GGCTCTGACB TTCATTACAT ATATTGGTTG TGGGCTTTCA 1980 

TCAATTTTTC TGTCAGTGAC TCTTGIAACC TACATAGCTT TTGAAAAGAT COSGAGGGAT 2040 

TACCCTTCCA AAATCCTO^T CCAGCTGTGT GCTGCTCIGC TTCTGCTGAR CCTGGTCTTC 2100 

15 CTCCTGGACT CGTGGATTGC TCTGTATAAG ATQCAAGGCC TCTGCATCTC AGTGGCTGTA 2160 

TTTCTTCATT ATTTTCTCTT GGTCTCATTC ACATGGATGG GCCTAGAAGC ATTCCATATG 2220 

TACCTGGCCC TTGTCAAAGT ATTTAATACT TACATCCGAA AATACATCCT TAAATTCTGC 2280 

ATTGTCGGTT GGGGGGTACC AGCTGTGGTT GTGACCATCA TCCTGACTAT ATCCCCAGAT 2340 

.AACTATGGGC TTGGATCCTA TGGGAfiATTC CCCflATGGTT CACCGGATGA CTTCTGCTGG 2400 

20 ATCAACAACA ATGCAGTATT CTACATTACG GTGGTGGGAT ATTTCTGTGT GATATTTTTG 24S0 
CTGAACGTCA GCATGTTCAT TGTGGTCCTG GTTCAGCTCT GTCGAATTA" 
CAACTGGGAG CCCAGCGAAA AACCAGTATT CAAGACCTCA GGAGTATCG 
TTTTTACTGG GAATAACTTG GGGCTTTGCC TTCTTTGGCT GGGGACCAGT T 

25 TGTGTGGCCA AAGAAAATGT CAGGAAC 

CGGCTGGCTG AAAATTCTGA CTGGAGTAAA A 



TCCACCACAC TGCTAGTGAA I 

TCTACAGAGA GGAATGGGGT CTCTTTTAGT GTTCAGAATG GAGATGTGTG CCITCACGAT 
TTCACTGGAA AACAGCACAT GTTTAACGAG AAGGAAGATT C 
ATGGCTCTCA GAAGGACTTC AAAGCGGGGA AGCTTACACT T 
CTTTCTTCTA AAATCAAAGC ATGATGCTTG ACAGTGTGAA ATGTCCAATT T 

CACAATGTGA GATGTATGAA AATCAACTCA TTTTATTCTC GGCAACATCT GGAGAAGCAT 3240 

AAGCTAATTA AGGGCGATGA TTATTATTAC AAGAAGAAAC CAAGACATTA CACCATGGTT 33 00 

TTTAGACATT TCTGATTTGG TTTCTTATCT TTCATTTTAT AAGAAGGTTG GTTT7,\AAO\ 3 3 SO 

ATACACTAAG AATGACTCCT ATAAAGAAAR CAAAAAAAGG TAGTGAACTT TCAGCTACCT 3420 

TTTAAAGAGG CTAAGTTATC TTTGATAACA TCATATAAAG CAACIGTTGA CTTCAGCCTG 3 4 80 

TTGGTGAGTT TAGTTGTGCA TGCCTTTGTT GTATATAAGC TAAAITCTAG TGACCCATGT 3 540 

GTCAAAAATC TTACTTCTAC ATTTTTTTGT ATTTATTTTC TACTGTGTAA ATGTATTCCT 3600 

TTGTAGAATC ATGGTTGTTT TGTCTCAOST GATAATTCa«3 JiAAATCCTTG CTCX3TTCXX3C 3660 

AAATCCTAAA GCTCCTTTTG GAGATGATAT AGGATGTGAA ATACAGAAAC CTC»GTGAAA 3720 

TCAAGAAATA ATGATCCCAG CCAGACTGAG AAAATGTAAG CAGRCSGTGC C»C»GTTAGC 3780 

TCATACAGTG CCTTTGAGCA AGTTAGGAAA AGATGCCCCC ACTGGGCAGA C»CAGC3CCTA 3840 

TGGGTCATGG TTTGACAAAC AGAGTGAGAG ACCATATTTT AGCCCCACTC ACCCTCTTGG 3900 

GTGCACGACC TGTACAGCCA AACaCAGCAT CCAATATGAA TACCCATCCC CTGACCGCAT 3960 

CCCCAGTAGT CAGATTATAG AATCTGCACC AAGATGTTTA GCTTTATACC TTGGCCACAG 4020 

AGAGGGATGA ACTGrTCATCC AGACCATGTG TCAGGRAAAT TGTGAACGKA GATGAGGTAC 4080 

ATACACTGCC GCTTCTCAAA TCCCCAGAGC CTTTAGGAAC AGGAGAGTAG ACTAGGATTC 4140 

CTTCTCTTAA AAAGGTACAT ATATATGGAA AAAAATCATA TTGCCMTTCT TTAAAAGGCA 4200 

CATTGTTGA TTGTTATGAC TGGTACACTC TGGCCCAGCC AGAGCTATAA 4260 

A AATGTGTCTT GAAGAATGCA CaOTGACAAG GGOAGTAGCT ATTGGOAACa 4320 



TTATTAGGAA CATTTCAAAC CCCTTTTAGT TAAGTCTTTC ACTAAGGTTC TCTTGCATAT 4500 

ATTTCAAGTTG AAIGTTGGAT CTC34GACIAA CCATAGTAAT AATACACATT TCTGTGAGTG 4560 

CTGACTTGTC TTTGCAATAT TTCTTTTCTG ATTTATTTAA TTTTCTTGTA TTTATATGTT 4620 
AAAATCAAAA ATGTTAAAAT CSiATGAAATA AATTTGCaWSI TAAGA 

Seq ID NOi 22 Protein sequence 
Protein Accession #: NP_005747,1 

1 11 21 31 41 51 

I' I I I I I 

MVFSVRQCGH VGRTEEVLLT FKIFLVIICL HWLVTSL3E DTDNSSLSPP PAiCLSWSFA 60 
PSSNEVETTS IiNDVTLSLLP SNETGVKPQR NICNLSSICK DSAFFRGBIM FQTOKBSTVP 120 
QNQHITNGTL TGVLSLSELK RSELNKTLQT LSETYFIMCA TAEAQSTLNC T 
NACAVIAALE RVKIRPMEHC CCSVRIPCPS SPE3LEKLQC DLQDPIVCLA D 
, SQSIPWPRA TVLSQVPKAT SFAEPPDYSP VTHNVPSPIG EIQPLSPQPS APIASSPAID 
MPPQSETISS PMPQTHVSGT PPPVKASFSS PTVSAPANVN TTSAPPVQTD IVMTSSISDI. 
BNQVLQMEKA LSLGSLEPNL AGEHINQVSR LLHSPPDMLA PLAQRLLKW DDIGLQLNFS 
NTTISLTSPS LALAVIRVNA SSFNTTTFVA QDPANLQVSL ETQAPENSIG TITLPSSLMN 
NLPAHDMELA SRVQFNFFET PALFQDPSLE NLSblSYVIS SSVANLTVRN LTRNVTVTLK 
HINPSQDELT VRCVFWDLGR NGGRGGWSDN GCSVKDRRLN ETICTCSHLT SFGVLLDLSR 
TSVLPAQMMA LTFITYIGCG LSSIFLSVTL VTYIAFEKIR RDYPSKILIQ LCAALLLUtt 
VFLLDSWIAL YKMQGLCISV AVFLHXFLLV SFTWMGLEAF HMYLALVKVF NTYIRKYILK 
FCIVGWGVPA VWTIILTIS PDNYQLGSYG KPPNGSPDDF CMIlTONAVFr ITWGYFCVI 
FLLNVSMFIV VLVQLCRIKK KKQLGBQRKT SIQDLRSIAG LTFLLGITWG FAFFAWGPVN 
VTFMYLFAIF NTLQGFFIFI FYCVAKEMVE KQWRRYLCXXS KLRLAENSDW SKTATHGLKK 
QTVNQGVSSS SHSLQSSSNS TNSTTLLVNN DCSVHASGNG NASTBBNGVS FSVQNGDVCL 
HDFTGKQHHF NEKEDSCNGK GRMALRSTSK RGSLKFIEQH 

Seq ID NO: 23 DHA sequence 

Coding sequence: 67.. 3 S3 
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30 



GAGACATTCC TCftATTGCTT AGACATATTC TQAGCCTACA GCAGAGGflAC CTCCRGTCTC 60 

AGCaCCATGA ATCAAACTGC GATTCTGATT TGCTGCCTTA TCTTTCTGAC TCTRRGTGGC 12 0 

ATTCAAGGRG TACCTCTCTC TAGAACCGTA CGCTGTACCT GCATCAGCAT TAGTAATCRA 180 

CCTGTTAATC CAAGCTCTTT AC3AAAAACTT GAAATTATTC CTGCAAGCX^ ATTTTGTCCA 240 

CGTGTTGAGA TCATTGCTAC AATGAAAAAG AAGGGTGAGA AGAGATGTOT GAATCCAGAA 300 

TCGAAGGCCA TCAAGAATTT ACIGAAAGCA OTTAGCAfiGG AAATGTCTAA AAGATCTCCT 360 

TAAAACC3«3A GGGGAGCAAA ATCGATGCAG TGCTTCCAAG GATGGACCAC ACAGAGGCTG 420 

CCTCTCCCAT CACTTCCCTA CATGGAGTAT ATGTCAAGCC ATAATTGTTC TTAGTTTGCA 480 

GTTACACTAA AAGGTGACCA ATGATGGTCA CCAAATa«3C TGCTACTACT CCTGTAGGAA 540 

GGTTAATGTT CATC»TCCTA AGCTATTCAG TAATAACTCT ACXCTGGCAC TATAATGTAA 600 

GCTCTAOIGA GGTGCTATGT TCTTAC3TGGA TGTTCTGACC CTGCTTCAAA TATTTCCCTC 660 

ACCTTTOCCA TCTTCCAAGG GTACTAAGGA ATCTTTCTGC TTTGGGGTTT ATCAGAATTC 720 

TCAGAATCTC AAATAACTAA AAGGTATGCA ATCBAATCTG CITTTTAAAG AATGCTCTTT 780 

ACTTCATGGA CTTCCACTGC CaTCCTCCCa AGGGGCX:CAA ATTCTTTCAG TGGCTACCTA 840 

CATACAATTC CAAACACATA CSGGAAGGTA GAAATATCTG AAAATGTATG TGTAAGTATT 900 

CTTATTTAAT GAAAGACTGT ACAAAGTATA AGTCTTAGAT C5IATATATTT CCTATATTGT 960 

TTTCAGTGTA CATGGAATAA CaTGTAATTA ASTACTAICT ATCAATGAGT AACAGGAAAA 1020 

TTTTAAAAAT ACAGATA6AT ATATGCTCTG CATGTTACAT AAGATAAATG TGCTGAATGG 1080 
TTTTCRAATA AAAATGAGGT ACTCTCCTGG A 

Seq ID NO: 24 Protein sequence 



MNQTAILICC LIFLTLSGIQ SVPLSRTVRC TCISISNQPV NPRSLBKLBI IPASQFCPRV 



CGGGGCGGGC AGCGGCTGCC TGGGCACTAT GGAGGTGAAG 
GAGCTGAATT T 



35 GAGGAGTTTT ATGGATTACT ACAACATGTT CATAAGATCC C 

G3CTATGCAG ACATCCATGQ AGACTTACTA CCIATAAATA ATGATGATAA TTATCACAAA 240 

GCTGTTTCAA CGGCCAATCC ACTGCTTAGG ATATTTATAC AAAAGAAGGA AGAAGCAGAC 300 

TACAGTGCCT TTGGTACAGA CACGCTAATA AAGAAGAAGA ATGTTTTAAC CAACGTATTG 360 

CGTCCTGACA ACCATAGAAA AAAGCCAC^T ATAGTCATTA GTATGCCCCA AGACTTTAGA 420 

40 CCTGrrGTCTT CTATTATAGA CX3TGGATATT CTCCCAGAAA CGCATCGTAG GGTACGTCTT 480 

TACaAATAOS GCaCGGAGAA ACCCXITAGGR TTCTACATCC GGGATGGCIC CaGTGTCAGG S40 

GTAACaCCAC ATGGCTTASA AAAGGTTC3CA QGQATCITTA TATCCAGGCT TGTCCCAGGA 600 

GGTCTGGCTC AAAGTAOVGG ACTATTAfiCT GTTAATGATG AAGTTTTAGA AGTTAATGGC 660 

ATAGAAGTTT CAGGGAAGAS CCTTGATCAA GTAACSGACA TGATGATTGC AAATAGCCGT 720 

45 AACCTCATC» TAACAGTGAG ACCQGCAAAC CSGAGGAATA ATGTTGTGAG GAACAGTCGG 780 

ACTTCTGGCa GTTCCGGTCA GTCTACTGAT AACAGCCTTC TTGGCTACCC ACAQC»GATT 840 

GAACCAAGCr TTGAGCCAGA GGATGAAGAC AGCGAAGAAG ATGACATTAT C»TTGAAGAC 900 

AATGGASTGC CACafiCaGAT TCCAAAAGCT STTCCTAATA CTGAGAGCCT GGAfiTCATTA 960 

ACACAGATAG AGCTAAGCTT TGAGTCTGGA CaGAATGGCT TTATTCCCTC TAATGAAGTTG 1020 



Protein Accession #i XP_0305S9 

1 11 21 31 41 51 

MNRSHRHGAG SGCLGTMEVK SKFGAEFRRF SLERSKPGKF EEFYGLLQHV HKIENVDVLV 
PINNDDNYHK AVSTANPLLR IFIQKKEEAD YSAFGTDTLI KKKNVLTNVL 
[VISMPQDFR PVSSIIDVDI LPETHERVRL YKXGTBKPLG FYIRDGSSVR 
VTPHGLEKVP GIFISRLVPG GLAQSTGLLA VNDEVLEVNG lEVSGKSLDQ VTDMMIAHSH 
NLIITVRPAN QRNHWRNSR TSGSSQQSTD NSLLGYPQQI BPSFBPBDBD SBBDDIIIBD 
NGVPQQIPKA VPKTESLESL TQIBLSFBSG QNGFIPSNBV SLAAIASSSN TBFBTHAPDQ 




a ACCTGGC-GCT C 
CCTTCCAACC TCAGOSTCTT CACCTCCTAC CTAGACCTCfi GTATGA; 
CTGCTCCCGA ATCCCCTGCC CAGTCTCCGC TTCCTGGAGG AGTTACGTCT jacaQGfA?.C 
GCTCTGACAT ACATTCCCAA GGGAGCATTC ACTGGCCTTT ACAGTCTTA.'i A 
CTGCAGAATA ATCAGCTAAG ACACGTACCC ACAGAAGCIC TGCAGAAT7T G 
CARTCCCTGC GTCTGGATGC TAACCACATC AGCTATGTGC CCCCAAGCTG TTTCAGTGGC 
CTGCATTCCC TGAGGCACCT GTGGCTGGAT GAC3UlTGaGT TAACAGAAAT CCCCGTCCAG 
GCTTTTAGAA GTTTATCGGC ATTGCAAGCC ATGACCTTGG CCCTGAACAA AATACACCAC 
ATACCAGRCT ATGCCTTTGG AAACCTCTCC AGCITGGTAfi TTCTACATCT CCATAACAAT 



GAACTACATT TCTATGACAA TCCCATCCAA TTTGTTGGGA GATCTGCTTT TCAACATTTA 
CCTGAACTAA GAACACTGAC TCTGAATGGT GCCTCACAAA TAACTGAATT TCCTGATTTA 
G CAAACCTGGA OAGTCTGACT TTAACTGGAG CACAGATCTC ATCTCTTCCT 
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CAAACCSJICT GCAATCAGTT ACCTAATCTC CAAGTGCTAG ATCTGTCTTA CAACCTATTA 1020 

6AAGATTTAC CCAGTTTTTC AGTCTGCCAA AAGCTTCAGA AAATTGACCT AAGACATAAT 1080 

GAAATCTACG AAATTAAAGT TGACACTTTC CAGCAGTTGC TTAGCCTCOS ATCGCTGAAT 1140 

TTGGCTTGGA ACAAAATTGC TATTATTCAC CCCAATGCAT TTTCCACTTT GCCATCCXITA 1200 

ATAAAGCTGG ACCTATCGTC CAACCTCCTG TCGTCTTTTC CTATAACTGG GTTACATCGT 1260 

TTAACTCACT TAAAATTAAC AGGAAATCAT GCCTTACaGA GCTTGATATC ATCTGAAAAC 1320 

TTTCCAGAAC TCAflGGTTAT AGAAATGCCT TATGCTTACC AGTGCTGTGC ATTTGGAGTG 1380 

TGTGAGAATG CCTATAAGAT TTCTAATCAA TGGAATAAAG GTGACAACAG CAGTATGGAC 1440 

GACCTTCATA AGAAAQATGC TGGAATGTTT CS^QGCTCAAG ATGAACGTGA CCTTGAAGAT 1500 

TTCCTGCITG ACTTTGAGGA AGACCTGAAA GCCCTTCaTT CMTGCAGTG TTCACCTTCC 1560 

CCAGGCCCCT TCAAACCCTG TGAACACCTG CTTGATGGCT GGCTGATCAG AAITGGAGTG 1620 

TGGACCATAfi CAGTTCTQGC ACTTACTTQT AATQCTTTGG TGACTTCAAC AGTTTTCaGA 1680 

TCCCCTCTGT ACATTTCCXX: C»TTAAACTG TTAATTGGGG TCATCGCAGC AGTGAACATG 1740 

CTCACGGGAG TCTCCaGTGC CGTGCTGGCT GGTGTGGATG CX3TTC31CTTT TGGCAGCTTT 1800 

GCAC6ACATG GTGCCTGGIG GGAQAATGGG GTTGGTTGCC ATGTCATTGG TTTTTTGTCC 1860 

ATTTTTGCTT CAGAATCATC TGTTTTCCTG CTTACTCIGG CafiCCCTGGA GOSTGGGTTC 1920 

TCTGTGAAAT ATTCTGCAAA ATTTGAAACG AAAGCTCCAT TTTCTAGCCT GAAAGTAATC 1980 

ATTTTGCTCT GTGCCCTGCT GGCCTTGACC ATGGCCGCafi TTCCCCTGCT GGGTGGCaGC 2040 

AAGTATGGCG CCTCCCXTCT CTGCCTGCCI TTGCCTTTTG GGQAGCCCAG CACCATGGQC 2100 

TACATGGTCG CTCTCATCTT GCTCAATTCC CTTTGCTTCC TCATGATGAC CATTGCCTAC 2160 

T ACTGCAATTT GGACAAGGGA GACCTGGAGA ATATTTGGGA CTGCTCTATG 2220 

r GCTCTTCACC AACTGCATCC TAAACTGCCC TGTGGCTTTC 2280 

TTGTCCTTCT CCTCTTTAAT AAACCTTACA TTTATCRGTC CTGAAGTAAT TAAGTTTATC 2340 

CTTCTGGTGG TAGTCCCACT TCCTGCATGT CTCAATCCCC TTCTCTACaT CTTGTTCAAT 2400 

CCTCACTTTA AGGAGGATCT GGTGAGCCTG AGAAAGCAAA CCTACGTCTG GACAAGATCA 2460 

AAACACCCAR GCTTGATGTC AATTAACTCT GATGATGTCG AAAAACAGIC CTGTGACTCA 2520 

ACTCAAGCCT TGGTAACCTT TACa«3CTCC AQCATC»CTT ATGACCTGCC TCCCAGTTCC 2580 

GTGCCATCAC CAGCTTATCC AGTGACTGAG AGCTGCCATC TTTCETCTGT GGCATTTGTC 2640 
CCATGTCTTA A 

Seq ID NO! 2 8 Protein sequence 

l"" 11 '21 ' 31 41 SI 

I I I I I I 

MDTSRLGVLL SLPVLLQLAT GGSSPRSGVL LRGCPIHCHC EPDGRMLLRV DCSDLGLSEL SO 

PSNLSVFTSY LDLSMNNISQ LLPNPLPSLR FLEBLRLAGN ALTYIPKGAF TGLYSLKVLM 120 

LQMNQLRHVP TERLQNLRSL QSLRLDANHI SYVPPSCFSG LHSLRHLWLD DNALTEIPVQ 180 

AFRSLSALQA MTLALNKIHH IPDYAFGNLS SLWLHLHNN RIHSLGKKCF DGLHSLBTLD 240 

LNYMNLDEFP TAIRTLSNLK ELHFTONPIQ FVGRSAFQHL PELRTLTLNG ASQITEFPDL 300 

TGTANLESLT LTGAQISSLP QTVCNQLPNL QVLDLSYNLL EDLPSFSVCQ KLQKIDLRHN 360 

BIYBIKVDTF QQLLSLRSLH LAWNKIAIIH PNAFSTLPSL IKLDLSSNLL SSFPITGLHG 420 

LTHLKLTGNH ALQSLISSEN FPBLKVIEMP YAYQCCAFGV CENAYKISNQ WNKGDNSSMD 480 

DLHKKDAGMF QAQDEHDLED FLLDFEEDLK ALHSVQCSPS PGPFKPCEHL LDGWLIRIGV 540 

WTIAVLALTC NALVTSTVFR SPLYISPIKL LIGVIAAVNM LTGVSSAVLA GVDAFTFGSF 600 

SRHGAMWENG VGCHVIGFLS IFASESSVFL I/TLAALEEGF SVKYSAKFET KAPFSSLKVI 660 

ILLCALIiAIjT MAAVPIiGGS KYGASPLCLP LPFGEPSTMG YMVALILLNS LCFLMMTIAY 720 

TKLYCNLDKG DLENIVTOCSM VKHIALLLFT NCILHCPVAF LSFSSLINLT FISPEVIKFI 780 

LLWVPLPAC UJPLLYILFN PHFKBDLVSL RKQTYVWTRS KHPSLMSINS DDVEHQSCDS 840 
TQALVTFTSS SITYBLPPSS VPSPAYPVTE SCHLSSVAFV PCL 

Seq ID HO: 29 DNA sequence 

Nucleic Acid Accession #: HM_002497.1 

Coding sequence: 135.. 1472 



GTTCCTGGTC CCTGGAGCTC CGCACTTGGC 



CAGGCTCCTA CGGCCGCTGC 
QGAAAGAACT TGACTATGGC 
TGAATTTGCT TCGTGAACTG 



T GGTGGTCATA 



3 GGCTGCTTGC TGTATGAGTT ATGTGCA7TA AT3CCTCCAT 
r TAGCCAGAAA G 
TTCCATACCG TTACTCTGAT G 

ATTACCATCG ACCTTCTGTT GAAGARATTC TTGAGAACCC T 
CAGACGAGCA AAGAAGAAAT CTTGAGAGAA GAGGGCGACA ATTAGGAGAG C 

CGCAGGATTC CAGCCCTGTA TTGAGTGAGC TGAAACTGAA GGAAATTCAG TTACAGGAGC 1080 

GAGAGCGAGC TCTCAAAGCA AGAGAAGAAA GATTGGAGCA GAAAGAACAG GAGCTTTGTG 1140 

TTCGTGAGAG ACTAGCAGAG GACAAACTGG CTAGAGCaGA AAATCTGTTG AAGAACIACA 12 00 

GCTTGCTAAA GGAACGGAAG TTCCTGTCTC TGGCAAGTAA TCCAGAACTT CTTAATCTTC 12 SO 

CATCCTCAGT AATTAAGAAG AAAGTTCATT TCAGTGGGGA AAGTARAGAG AACATCXTGA 1320 

QGAGTGAQAA TTCIGAGAGT CAGCTCACAT CTAAGTCC3iA GTGCAAGGAC CTGAAGAAAA 1380 

GGCTTC3M:GC TGCCCAGCTG CGGGCTCAAG CCCTQTCAQA TATTQAGAAA AATTACCAAC 1440 

TGAAAAGCAG ACAGATCCTG GGCATGCGCT AGCCAGGTAG AGAGACACAG AGCTGTGTAC 1500 

AGGATOTAAT ATTACCAACC TTTAAAGACT GATATTCAAA TGCTGTAGTG TTGAATACTT 1560 

GGCCCCATGA GCCATGCCTT TCTGIATAGT ACACATQATA TTTCGGAATT GGTTTTACTG 1620 

TTCTTCAGCA ACTATTGTAC AAAATGTTCA CATTTAATTT TTCTTTCTTC TTTTAAGAAC 1680 

ATATTATAAA AAGAATACTT TCTTGGTTGG GCTTTTAATC CTGTGTGTGA TTACTAGTAG 1740 

GAACATQAGA TGTGACATTC TAAATCTTGG GAGAAAAAAT AATATTAGGA AAAAAATATT 1800 
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TATGC3«3<3AA GAGTAGCACT CACTGRATAG TTITAAATGA CTGAGTGGTA TGCTTACSAT 1860 

TGTCATGTCT AGATTTAAftT TTTAAGTCTG AGATTTTAAA TGTTTTTGAG CTTAGRAAAC 1920 

CCAGTTAGAT GCAATTTGGT CATTAATACC ATGRCATCTT GCTTATAAAT ATTCCaTTGC 1980 

TCTGTAGTTC AAATCTGTTA GCTTTGTGAA AATTCATCAC TGTGATGTTT GTATTCTTTT 2040 

TTTTTTTCTG TTTAACAGAA TATGAGCTGT CTGTCATTTA CCTACTTCTT TCCCACTAAA 2100 



1 11 21 31 41 51 

I I I I I I 

MPSRAEDYEV LYTIGTGSYG RCQKIRRKSD GKILVWKBLD YGSMTBABKQ MLVSBVNLLR 
ELKHPNIVRY YDRIIDRTNT TLYIVMEYCE GGDLASVITK GTKERQYLDE EFVIRVMTQL 
TLAIiKBCBRR SDGGHTVLHR DLKPRNVFLD GKQNVKLGDF GLARILNHDT SFAKIFVGTP 
YYMSPEQMNR MSYNEKSDIW SLGCLLYBLC ALMPPFTAFS QKBLAGKIRB GKFRRIPYRY 
SDELNEIITH MLNLKDYHRP SVEEILENPL lADLVSDEQR RNLEHRGRQL GEPBKSQDSS 
PVLSBLKLKE IQLQERERAL KAREERLEQK BQELCVRBRL AEDKLAEAEH LLKHYSLLKE 
RKFLSLASNP ELLNLPSSVI KKKVHFSGES KENIMRSEHS ESQLTSKSKC KDLKKRLHAA 
QLRAQALSDI EKNYQLKSRQ ILGMR 



1 I I 

CAGTCGCGCG CGGTGCAGTC GGGAGGTGGA G 
GGGTGAGGGC GCTATGGCAC CCGGCTGCAA A. 

GTCTAACCAA CCAAGTAATG AAGGTGATGC CATCftAAGTT TTTGTGCGAA T 

TGCAGAAAGA TCTGGGTCAG CTGATGGAGA GCAGAACTTA TGCTTATCTG TGCTGTCCTC 240 

CACGAGTCTC CSGCTGCACT CCAACCCTGA GCCCAAGAOC TTCACGTTTG ATCATGTTGC 300 

AGATGTGGAT ACCACTCAGG AATCTGTATT TGCAACTGTG GCTAAAAGCA TTGTGGAGTC 3 60 

TTGCATGAGC GGTTATAATG GTACCATCTT TGCATATGGA CAGACTGGCT CAGGGAAGAC 420 

ATTTACTATG ATGGGACCAT CTGAATCTGA TAATTTTTCI CATAACCTQA GAGGAGTAAT 480 

CCCACGAAGT TTTGAATATT TGTTTTOCTT AATTGATCGT GAAAAAGAAA AGGCTGGAGC 540 

TGGRAAGAGT TTCCTTTGTA AQTGTTCCTT TATTGAAATC TACAACGAGC AGATATATGA 600 

TCTACTGGAC TCTGC»TCGG CTGGACTGTA CTTAAGG6AG CATATCAAGA AGGGAGTCTT 660 

TGTTGTTGGT GCGGTOSAGC AfiGTGGTAAC CTCAGCTGCT GAAGCCTATC AGGTGCTGTC 720 

TGGAGGATGG AGQAATAGAC QTGTQQCATC AACATCAATG AACaVGAGAAT CGTCTAGGTC 780 

TCAIGCCGTC TTTACAATTA CAATAGAGTC AATGGAfiAAA AGTAATGAGA TTGIGAATAT 840 

ACGGACCTCC CTACTCAACC TGGTGGATTT AGCAGGATCT GAAAGGCAAA AAGATACCCA 900 

TGCfifiAAGGG ATGAGATTGA AGGAAQCAGQ TAACATAAAT CGATCSTTGA GCTGCCTGGG 960 

CCAAGTGATT ACAGCACTTG TCGACGTGGG TAATGGAAAA CAGAGACATG TTTGCTACAQ 1020 

AGACTCCAAA CTTACCTTCT TACTACGGGA TTCCCTTGGA GGTAATGCC» AAACAGCCAT 1080 

AATTQCAAAT GTTCATCCTG GATCCAGGTG TTTTGGGGAA ACCCTATCAA CACTPAACTT 1140 

TGCTCAAAGA GCCAAGCTGA TTAAAARCAA GGOWSTAGTA AATGAAQACA CCCAAGGRAA 1200 

TGTGAGCXaG CTCCAAGCTG AAGTGAAGAG GCTCAAAGAA CAACTGGOSG AGCTTGCITC 1260 

AG3ACAGACA CCACCAGAAA GCTTCCTGAC CSfiAGACSiAA AAGAAGACTA ACTATATGGA 1320 

GTATTTCCAG GAAGCAATGT TATTCTTTAA GAAATCTGAA CAGQAAAAQA AGTCTCTGAT 1380 

AGAAAAAGTT ACCCAATTAG AAGACCTCAC CCTCAAAAAG GAAAAATTTA TTCAATCIAA 1440 

TAAAATGATT GTGAAATTCC GAGAGGATCA AATAATACGC TTGGAAAAGC TCCACAAGGA ISOO 

ATCCCGGGGA GGTTTTCTGC CTGAGGAGCA GGATCGTTTG CTCTCAGAAT TAAGGAATGA 1560 

GATTCAAACT CTGCGAGAAC AAATAGAGCA CCACCCCMA GTTGCAAAGT ATGCTATGGA 1620 

AAATCATTCC CTCAGGGAGG AGAATAGAAG ACTGAGATTA TTAGAGCCTG TGAAAAGAGC 1680 

TCAAGAAATG GATGCCCAGA CCATTGCRAA ACTAGAAAAA GCTTTCTCTG AAATAAGTGG 1740 

A AGTGACAAAA ATCAGCAAGG ATTTTCRCCT AAAGCTCAGA AAGAGCCATG 1800 

A AACACTGAGA AGT7AAAAGC AC.AACTCCTG CflAATTCAGA CAGAGCTGAA 18S0 

G CAAGAATATG AAGAATTCAA AGAACTTACT AGGAAAAGGC AGCTAGAATT 1920 

G CTTCAGTCTT TGCAAAAAGC GAACCTTAA7 CT7GAAAACC TTTTGGAAGC 1980 

AACAAAAGCC TGCAAGCGGC AAGAAGTTTC TCAGCTGAAT AAAATTCATG CTGAAACACT 2040 

TAAGATTATA ACTACACCAA CCAAGGCCTA CCAACTTCAT TCCCGACCAG TACCAAAATT 2100 

AAGCCCTGAA ATGGGAAGCT TTGGCTCTCT ATACACTCAG AATTCTAGCA TATTAGATAA 2160 

TGATATATTA AATGAGCCAG TTCCTCCTGA GATGAATGAA CAAGCTTTTG AGGCCATTTC 2220 

T AGAACAGTGC AGGAACAAAT GAGTGCTCTT CAAGCCAAAC TGGATGAAGA 2280 

riAAAGC TTCAGCAGCA TGITGACAAA CTGGAACATC ATTCTACCCA 2340 

3 CTTTTCTCAT CAGAAAGAAT TGATTGGACC AAACAGCAGQ AAGAGCTTCT 2400 

CTCACAGTTG AATGTCCTTQ AAAAGCAQCT TCAAGAGACT CAAACTAWi ATGACTTTTT 2460 

GAAAAGTGAG GTACATGACC TGCGAGTAGT CCTTCATTCT GCTGACAAGG AGC7TTCTTC 2520 
AGTGAAATTG GAATATAGTT CATTCSUkAAC G 
TGAAAGACAC ATGCATGTAC AGCTTCAATT AGATAATCTC A 

GCTTGAGAGC AAAGCCTGCC TACAGGATTC CTATGACAAC TTACAAGAAA TAATGAAATT 2700 

TGAGATTGAC CAACTTTCAA GAAACCTCCA AAACTTCAAA AAAGAAAATG AAACTCTGAA 2760 

ATCTGATCTG AATAATTTGA TGGAGCTTCT TQAGGCAGAA AAAGAACGCA ATAACAAATT 2 820 

ATCATTACAG TTTGAAGAAG ATAAAGAAAA CAGTTCTAAA GAAATCTTAA AAGTTCTTGA 2880 

GGCTSIACGI CAGGAGAAAC AGAAAGAfiAC GGCCAAGTGT GAGCAGCAGA TGGCAAAAGT 294 0 

ACAGAAACTA GAAGASAGCT TGCTTGCTAC TGRAAAAGTG ATCfiGTTCCC TGGAAAAGTC 3000 

TAGAGATTCr GATAAGAAAG TTGTAGCTGA CCTCATGAAC CaGATCCSGG AGCTAAGAAC 3060 

ATCGGTCTOI GAGAAAAC»fi AAACTATAGA CaCCCIGAAA CAAGAACTGA AGGACATAAA 3120 

TTGCAAATAC AACTCTGCTT TGaiTGACftG AGAAGAGAfiC AGAGTGTTGA TCAAGAAGCa 3180 

GGAAGTGGAT ATTCTGGATC TGAAAGAAAC CCTTAGGCTG AGAATACITT CTGAGQACAT 3240 

AGAGftGGGAT ATGCTCTGTG AGOACCTGGC TCATGCCACI GAGCAGCTGA ACATGCTCAC 3300 

AGAGGCCTCA AAAAAACACT CGGGGCTGCT GCSGTCTGCC Ca«3GAAGAAC TGACCAAGAA 3360 

GGAAGCCCTG ATTCAGGAAC TTOiGCACAA GCTAAACCAA AAGAAAGAGG AAGTAGAACA 3420 

GAAGAAGAAT GAATATAACT TCAAAATGAG GCAACTAGAA CATGTGATGG ATTCTGCTGC 3480 

TGAGGATCCC CAGAGTCCTA AGACACCACC TCACTTTCAA ACACATTTGG CAAAACTCCT 3540 

GGAAACACAA GAACAAGAGA TAGAAGATGG AAGAGCCTCT AAGACTTCTT TGGAACACCT 3600 
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G CTABATGAAG ftCAGAGAAGT CAAAAATGCT GAAATCCTCA GAATGAAGGA 3S60 
GCAGTTGCGT GRAATGGAAA ACCTACGCCT GGSAAGTCAG CACTTAATAG A 
GCTCCTGCAA GGTCAGCTGG ATGATATTAA AAGACAAAAG GAAAACASTG A 
TCCAGATAAT CAACAGCTGA AGAATGAACA AGAAGAAAGT A 



G AATGACTGAT GAAGTCGAAC GAACCCAAAC 3960 

C AGGAAAAAGA ACAACTGAGA TCAAAGCTGG AAGAAATGTA 4020 

T CCCAGGAGAT GGAAATGTTA AGGAAGCRGG TGGAGIGTCT 4080 

TGCTGAGGAA AATGGAAAGT TGGTAGGTCA CCAAAATTTG CATCAGAAGA TTCAGTACGT 4140 

AGTGCXaCTA AAGAAGGAAA ATGTCAGGCT TGCTGAGQAG ACAGAAAAGT TGCGTGCXIGA 4200 

AAATGTATTT TTAAAAGAAA AGAAAAGAAG TGAATCTTGA GGATTCCGGT CAGCTACCTA 4260 

GGCATCACCT TGITTGAAGA TGTTTCTTCT CTTTTACAAG TAAGACCTAC TCCTGGCXAC 4320 

TTAGGAGAfiC TGAATTTATG GACCTTAATT ATTAAATGTT TATAAGGTGG TGGTAACCAC 4380 

CTCAAGTTTC TGATGAACAT TCTGCATCCA TATACACCCT GTGACAfiTCA GCAGTCTGCI 4440 

ATTAAGTGGC CTACTTCAAG GCTTTGAATC AACTTAAGGG AAAACCTTTT GTCTTTGTAA 4500 

AAATAAAAGC CTGTAfiCTAA GCTTTACaCT GGACATTAGC CAGATCATTT TCTTCTTAGA 4560 

TTATGCCATA ATCTCCTTTG ATTCTTATGG AAGTTCTAAC AATATATGGT GQTTCCAACA 4620 

CCTGCASTGa GTTTAATGAC TGACTTAGTA GCAGGTACAA GAAGCAAACT TGTTAATATA 4680 



1 11 21 31 41 51 

I I I I. I I 

MAPGCKTELR SVTNGQSNQE SNEGDAIKVF VEIREPAERS GSftDGEQNLC LSVLSSTSIiR 60 

LHSNPEPKTF TFDHVADVDT TQESVFATVA KSIVESCMSG YNGTIFAYGQ TGSGKTFTMM 120 

GPSBSDNFSH NLRGVIPRSF EYLFSLIDRE KEKAGAGKSF LCKCSFIEIY NEQIYDLLDS 180 

ASAGLYLREH IKKGVFWGA VEQWTSAAE AYQVLSGGWR NRRVASTSMN RESSRSHAVF 240 

TITIESMEKS NEIVNIRTSL LNLVDLAGSE RQKDTHASGM RLKEAGNINR SLSCLGQVIT 300 

ALVDVGNGKQ RHVCYRDSKL TFLLRDSLGG NAKTAIIANV HPGSRCFGET LSTLNFAQRA 360 

KLIKNKAWK EDTQGNVSQL QAEVKRLKEQ LAELASGQTP PESFLTRDKK KTNYMEYFQE 420 

AMLFPKKSEQ EKKSLIEKVT QLEDLTLKKE KFIQSNKMIV KFREDQIIRL EKLHKESRGG 480 

FLPEEQDRLL SELRNEIQTL REQIEHHERV AKYAKENHSL REENRRLRLL EPVKRAQEMD S4D 

AQTIAKLEKA PSEISGMEKS DKNQQGFSPK AQKBECLFAN TEKLKACLLQ IQTELNNSKQ 600 

EYEEFKELTR KRQLELESEL QSLQKANLNL ENLLEATKAC KRQEVSQLNK IHAETLKIIT 660 

TPTKAYQLHS RPVPKLSPBM GSFGSLYTQN SSILDNDILN EPVPPEKNEQ AFSAISEELR 720 

TVC3EQMSALQ AKLDEEEHKN LKLQQHVDKL EHHSTQMQEL FSSERIDWTK QQEELLSQLN 780 

VLEKQLQETQ TKMDFLKSEV HDLRWLHSA DKELSSVKLE YSSFKTNQEK EFNKLSERHM 840 

HVQLQLDNLR LENEKLLESK ACLQDSYDNL QEIMKFEIDQ LSENLQNFKK ENETLKSDLN 900 

NLMELLEAEK ERNNKLSLQF EEDKENSSKE ILKVLEAVRQ EKQKBTAKCE QOMAKVQKLE S60 

ESLLATEKVI SSLEKSRDSD KKWADLMNQ IQELRTSVCE KTETIDTLKQ ELKDINCKYM 1020 

SALVDREBSR VLIKKQEVDI LDLKETLRLR ILSEDIERDM LCEDLAHATE QLNMLTEASK 1080 

KHSGLLQSAQ EELTKKEALl QKLQHKLNQK KEEVBQKKNB YNFKMRQLEH VMDSAAEDPQ 1140 

SPKIPPHPQT HIAKLLETQE QEIEDGRASK TSLEHLVTKL NEDEEVKNAE ILHMKEQLRE 1200 

MENIiRLESQQ LIEKNWLLQG QLDDIKRQKE NSDQNHFDNQ QLKNEQEESI KERLAKSKIV 1260 

EEMLKMKADL EEVQSALYNK EMBCLRMTDB VBRTQTLESK AFQBKEQLRS KLBBMYBBRB 1320 

RTSQEMEMLR KQVECIAEEN GKLVGHQNLH QKIffYWRLK KENVRLAEET EKLHAENVFL 1380 



Seg ID NO: 33 UNA sequence 

Nucleic Acid Accession tt> BC000633 .1 

Coding sequence: 1..2574 

1 11 21 31 41 51 

I I I I I I 

ATGGAATCCG AGQATTTAAG TGOCAGAfiAA TTGACAATTG ATTCCATAAT GAACaAAGTG 
AGAGACATTA AAAATAAGTT TAAAAATGAA GACCTTACT6 ATGAACTAAG CTTGAAIAAA 
ATTTCTGCTG ATACTACMA TRACTCGGGR ACTGTTAACC AAATTATGAT GATGGCAAAC 
AACCCAGAGG ACTGGTTGAG TTTGTTCCTC AAACTAGAGA AAAACACTGT TCCGCTAAGT 
GATGCTCTTT TAAATAAATT GATTGQTCGT TACAGTC3UVG CSiATIGAAGC GCITCCCCCA 
GATAAATATG GCCAAAATGA GAGTTTTGCT AGAATTCRAG TGAGATTTGC TGAATTAAAA 
GCTATTCAAG AGCCAGATGA TGCACGTGAC TACTTTCAAA TGGCCAGAGC AAACTGCSiAG 
AAATTTGCTT TTGTTCATAT ATCTTTTGCA CAATTTGAAC TGTCACAAGG TAATGTCAAA 
AAAAGTAAAC AACTTCTTCA AAAAGCTGTA GAACGTGGAG CAGTACCACT AGAAATGCTG 
GAAATTGCCC TGCGGAATTT AAACCTCCAA AAAAAGCAGC TGCTTTCAGA GGAGGAAAAG 
AAGAATTTAT CAGCATCTAC GGTATTAACT GCKCAAGAAT CATTTTCCGG TTCACTTGGG 



C ACCACAAGAT GCAGAAATAG GTTACCGGAA T 
CAAACTAACA AAACTAAACA GTCATGCCCA TTTGGAAGAG TCCCAGTTAA CCT7CTAAAT 

ACCTCTAGAT CAGAATGCCG AGATTTGGTT GTGCCTGGAT CTAAACCAAG TGGAAATGAT 
TCCTGTGAAT TAAGAAATTT AAAGTCTGTT CAAAATAGTC A7TTCAAGGA 
TCAGATGAAA AGAGTTCTGA ACTTATTATT ACTGATTCAA TAACCCTGAA 
GAATCAAGTC TTCTAGCTAA ATTAGAAGAA ACTAAAGAGT ATCAAGAACC 
GAGAGTAACC AGAAACAGTG GCAATCTAAG AGAAAGTCAG AGTGTATTAA CCAGAATCCT 
GCTGCATCTT CAAATCACTG GCAGATTCCG GAGTTAGCCC 
AAACATACCA CTTTTGAGCA ACCTGTCTTT TCAGTTTCAA 
ACATCTAAAT GGTTTGACCC fiAARTCTATT TGTAAGACAC CAAGCAGCAA 
GATTACATGA GCTGTTTTAG AACTCCAGTT GTAAAGAATG ACTTTCCACC TGCTTGTCAG 
TTGTCAACAC CTTATGGCCA ACCTGCCTGT TTCCAGCAGC AACAGCATCA AATACTTGCX 
ACTCCACTTC AAAATTTACA GGTTTTAGCA TCTTCTTCAG CAAATGAATG I 
AAAGGAAGAA TTTATTCCAT TTTAAAGCAG ATAG3AAGTG 
CAGGIGTTAA ATGAAAAGAA ACAGATATAT GCTATAAAAT 
GATAACCAAA CTCTTGATAG TTACCGGAAC GAAATAGCTT ATTTGAATAA ACTACAACAA 
CACAGTGATA AGATCATCCG ACTTTATGAT TATGAAATCA CGGACCAGTA CATCTACATG 
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GTAATGGflGT GTGGBAATAT TGATCITAAT AGTTGGCTTA AAAAGRAAftA ATCCATTGAT 1860 

CXaTGGGAAC GCAAGAGTTA CIGGASAAAT ATGTTAGSGG CAGTTCACAC AATCCATCAA 1920 

CATGGCSVITG TTCACAGTGA TCTTAAACCa GCTAACTTTC TGATSGTTGA TGGAATGCTA 1980 

AAGCTAATTG ATTTTGGGAT TGCAAACCftA ATGCAACCAG ATACAACaftG TGTTGTTAAA 2040 

GATTCTC31GG TTGGCRCRGT TAATTATATG CCSiCCAGAAG CAATCAAAGA TATGTCTTCC 2100 

TCCAGAGAGA ATGGGAAATC TAAGTCAAAG ATAAGCCCCA AAAGTGATGT TIGGTCCTTA 2160 

GGATGTATTT TGTACTATAT GACTTACGGG AAAACACCAT TTCAGCAGAT AATTAATCAG 2220 

ATTTCTAAAT TACAIGCCAT AATTGATCCT AATCATGAAA TTGAATTTCC CGATATTCCa 2280 

GAGAAAGATC TTCAAGATGT GTTAAAGTGI TGTTTARAAA GGGACCCAAA ACAGAGGATA 2340 

TCCATTCCTG AGCTCCTGGC TCATCCCTAT GTTCAAATTC AAACTCATCC AGTTAACCAA 2400 

ATGGCCAAGG GAACCACTGA AGAAATGAAA TATGTTCTGG GCKAACTTGT TGGTCTGAAT 2460 

TCTCCTAACT CCATTTTGAA AGCTGCTAAA ACTTTATATG AACACTATAG TGGTGGTGAA 2520 
AGTCATAATT CTTCATCCTC- CAAGACTTTT GAAAAAAAAA GGGGAAAAAA ATGA 



MESEDLSGRE LTIDSIMNKV RDIKNKFKNE DLTDELSLNK ISADTTDNSG TVNQIMMMAN 60 

NPEDWLSLLL KLEKKSVPLS DALLNKLIGR YSQAIEALPP DKYGQNESFA RIQVRFAELK 120 

AIQEPDDAED YFQMRHANCK KFAPVHISFA QFELSQGKVK KSKQLLQKRV 2RGAVPLEML 180 

EIALRNLNLQ KKQLLSEEEK KNLSASTVLT AQESFSGSLG HLQNEITOSCD SRGQTTKARF 240 

LYGENMPPQD AEIGYRNSLR QTNKTKQSCP FGRVPVNLLN SPDCDVKTDD SWPCFMKRQ 300 

TSRSECRDLV VPGSKPSGND SCELRNLKSV QNSHFKBPLV SDEKSSELII TDSITLKNKT 360 

ESSLLAKLEE TKEYQEPEVP ESNQKQWQSK RKSECINQNP AASSNHWQIP ELARKVNTEQ 420 

KHTTFEQPVF SVSKQSPPIS TSKWFDPKSI CKTPSSNTLD DYMSCFRTPV VKKDFPPACQ 480 

LSTPYGQPAC FQQQQHQILA TPLQNLQVLA SSSANECISV KGRIYSILKQ IGSGGSSKVF 540 

QVLNBKKQIY AIKYVNLEEA DNQTLDSYRN EIAYLNKLQQ HSDKIIRLYD YEITDQYIYM 600 

VMECGNIDLN SWLKKKKSID PWERKSYWKN MLEAVHTIHQ HGIVHSDLKP ANFLIVDGML 660 

KLIDFGIANQ MQPDTTSWK DSQVGTVljYM PEEAIKDMSS SRENGKSKSK ISPKSDVWSL 720 

GCILYYMTYG KTPFQQIINQ ISKLHAIIDP NHBIBFPDIP BKDLQDVLKC CLKRDPKQRI 780 

SIPELLAHPY VQIQTHPVNQ MAKGTTEEMK YVI/SQLVGIiN SPNSILKAAK TLYEHYSGGE 840 
SHNSSSSKTF BKKRGKK 

Seq ID NO: 35 DNA ssqusnoe 

Nucleic Acid Accession #: NM_005823.2 

Coding sequence: 85.-1953 

1 11 21 31 41 51 

IGGCCACTCC CGTCTGCTGT GACGCGCGGA CAGAGAGCTA CCGGTGGACC CAOSGTGCCT 60 

CCCTCCCTGG GATCTACACA GACCATGGCC TTGCCAACGG CTCGACCCXIT GTTGGGGTCX: 120 

TGTGGQACCC CCGCCCTCGG CAGCCTCCTG TTCCTGCTCT TOWXXITCGG ATGQGTGCAG 180 

CCCTOSAGGA CCCTGGCTGG AGAGACAGGG CAGGAGGCTG CACCCCTGGA CGOAGTCCTQ 240 

GCCAACXXy^C CTAACATTTC CAGCCTCICX: CCTCGCCAAC TCCTTGGCTT CCCGTGTGCG 300 

GAGQTGTCCG GCCTGAGCAC GGAGOGTGTC CGGGAGCTGG CTGTGGCCTT GGCACAGAAG 360 

AATGTCAAGC TCICAACAGA GCAGCTGOSC TGTCTGGCTC ACOGGCTCTC TGAGCCCCCC 420 

GAGGACCTGG AOSCCCTCCC ATTGGACCTG CTGCTATTCC TCAACCCAGA TGCX3TTCTCG 480 

GGGCCCCAGG CCTGCACCCG TTTCTTCICC OGCATCACGA AGGCCAATGT GGACCTGCIC S40 

CCGAGGGGQG CTCCCGAGCG ACAGCGGCTG CTGCCTGCGG CTCTGGCCTG CTGGGaTGIG 600 

CGGGGGTCTC TGCTGAGCGA GGCTGATGTG CGGGCTCTGG GAGGCCTGGC TTGOSACCIG 660 

CCTGOGCGCT TTGTGQCCQA GTCX3GCCGAA GTGCTGCTAC CCXX3GCTGGT GAGCTGCOCG 720 

GGACCCCTGG ACCAGGACCA GCAGGAGGCA GCCAGGGCGG CTCTGCAGGG CGGGGGACCC 780 

CCCTACGGCC CCCCGTCGAC ATGGTCTGTC TCCAOGATGG ACX3CTCTGCG GGGCCTGCTG 840 

CCCGTGCTGG GCCAGCCCAT CATCCGCAGC ATCCCGCAGG GOITOGTGGC CGOSTGGaSG 900 

CAACGCTCCT CTCGGGACCC ATCCTGGCGG CAGCCTGAAC GGACCATCCT CCGGCCGOGG 960 

TTCCGGCGGG AAGTGGAGAA GACAGCCTGT CCTTCAGGCA AGAAGGCCCG CX3AGATAGAC 1020 

A TCTTCTACAA GAAGTGGGAG CTGGAAGCCT GCGTGGATGC GGCCCTGCTG' 1080 

T GAACGCCATC CCCTTCACCT ACGAGCAGCT GGACGTCCTA 1140 
:CCACAA GGTTACCCCG AGTCTGH 
GGCTACCTCT TCCTCAAGAT GAGCCCTGAG GACATTCGCA AGTGGRATGT G 
GAGACCCTGA AOGCTTTGCT TGRAGTCAAC AAAGGGCACG AAATGAGTCC TCAGGTGGCC 
ACCCTGATCG ACCGCTTTGT GAAGGGAAGG GGCCAGCTAG ACAAAGACAC C 
CTGACCGCCT TCTACCCTGG GTACCTGTGC TCCCTCAGCC CCGAGGAGCT G 
CCCCCCAGCA GCATCTGGGC GGTCAGGCCC CAGGACCTGG ACACGir"*- " 

CTGGACGTCC TCTATCCCAA GGCCCGCCTT GCTTTCCAGA ACATGAACGG GTCCGAATAC 1560 

TTCGTGAAGA TCCAGTCCTT CCIGGGTGGG GCCCCCACGG AGGATTTGAA GGCGCTCAGT 1620 

CAGCAGAATG IGAGCATGGA CTTGGCCACG TTCATGAAGC TGCGGACGGA TGCGGTGCTG 1680 

CCGTTGACTG TGGCTGAGGT GCAGAAACTT CTGGGACCCC ACGTGGAGGG CCTGAAGGCG 1740 

GAGQAGCGGC ACCGCCCGGT GCGGGACTGG ATCXTTACGGC AGCGGCAGGA CGACCTGGAC 1800 

ACGCTGGGGC TGGGGCTACA GGGCGGCATC CCCAACGGCT ACCTGGTCCT AGACCTCAGC 1850 

GTGCAAGAGG CCCTCTCGGG GACGCCCTGC CTCCTAGGAC CTGGACCTSr TCTCACCGTC 192 0 

CTGQCACTGC TCCTAGCCTC CMCCTGGCC TGAGGGCCCC ACTCCCTTGC TGGCCCCAGC 1980 

CCTGCTGGGG ATCCCCGCCT GGCCAGGAGC AGGCACGGGT GATCCCCGTT CCACCCCAAG 2040 
AGAACECGCG CTCAGTAAAC GGGAACATGC CCOCTGCAGA CACGT 



DLLLFIiNPDA FSGPQACTRF FSEITKAHVD LLPRGAEERQ RLLPAALACW GVRGSLLSEA 
DVRALGGLAC DLPGRFVAES AEVLLPRLVS CPGPLDQDQQ EAAEAALQGG GPPYGPPSTW 
SVSTMDALRG LLPVLGQPII RSIPQGIVAA WRQRSSRDPS WRQPERTILR PRFRREVEKT 
ACPSGKKARE IDESLIFYKK WBLEACVDAA LLRTQMDRVN AlPFTYEQLD VLKHKLDELY 
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PQC3YPBSVIQ HLGYLFLKMS PBDIRKWNVT SLBTLKALLE VNKGHEMSPQ VATLIDRFVK 
GRGQLDKDTL DTLTAPYPGY LCSLSPEELS SVPPSSIWAV RPQDLDTCDP RQUJVLYPKA 
lUAFQHMNGS EYFVKIQSFL GOAPTEDLKh LSQQNVSHDL ATFHKLRTDA VLPLTVAEVQ 
KLLGPHVEGL KAEBRHRPVR DWILRQRQDD LDTLGLOLQG GIPHGYLVLD LSVQEAWGT 
PCLLGPGPVL TVLAIiLLAST LA 



A CTCCCGTCTG CTGTGACGCG CGGACAGAGA GCTACCGGTG G 
GCCTCCCTCC CTGGGATCTA CACAGACCAT GGCCTTGCAA CGGCTCGACC CCTGTTGGTC 
CTGTGGGGAC CGCCCTGGCA GCCTCCTGTT CCTGCTCTTC AGCCTCGGAT GGGTGCaTCC 



AACCCCCCAT AACATTTCCA GCCTCTCCCC TCGCCAACTC C 



GGTGTCCGGC CTGAGCACGG AGCGTGTCCG GGAGCTGGCT GTGGCCTTGG CACAGAAGAA 3 GO 
■ TCAACAGAGC AGCTGCGCTG TCTGGCTCAC OSGCTCTCTG AGCCCCCCGA 420 
GCCCTCCCAT TGGACCTGCT GCTATTCCTC AACCCAGATG CGTTCTCGGG 480 
ZU GCCCCAGGCC TGCACCCGTT TCTTCTCCCG CATCACGAAG GCCAATGTGG ACCTGCTCCC 540 
GAGGGGGGCT CCCGAGCGAC AGCGGCTGCT GCCTGCGGCT CTGGCCTGCT GGGGTGTGCG 6 00 
GGGGTCTCTG CTGAGCGAGG CTGATGTGCG GGCTCTGGGA GGCCTGGCTT GOGACCTGCC 660 
TGGGCGCTTT GTGGCCGAGT CGGCCGAAGT GCTGCTACCC OGGCTGGTGA GCTGCCCGGG 720 
ACCCCTGGAC CAGGACCAGC AGGAGGCAGC CAGGGCGGCT CTGCRGGGC3G GGGGACCCCC 7 80 
25 CTACGGCCCC CCGTCGACAT GGTCTGTCTC CACGATGGAC GCTCTGCGGG GCCTGCTGCC 840 
CGTGCTGGGC CAGCCCATCA TCCGCAGCAT cdCGCAGGGC ATCX3TGQCCG CGTGGOSGCA 900 
ACGCTCCTCT CGGGACCCAT CCTGGCGGCA GCCTGAACGG ACCATCCTCC GGCCGOSGTT 9S0 

CCGGCGGGAA GTGGAGAAGA CAGCCTGTCC TTCAGGCAAG AAGGCCCGCG AGATAGAOGA 1020 

GAGCCTCATC TTCTACAAGA AGTGGGAGCT GGAAGCCTGC GTGGATG03G CCCTGCTGGC 1080 

30 CACCCAGATG GACCGCGTGA ACGCCATCCC CTTCACCTAC GAGCRGCTGG ACGTCCTAAA 1140 

GCATAAACTG GATGAGCTCT ACCC»CAAGG TTACCCCGAG TCTGTGATCC AGCACCTGGG 1200 

CTACCTCTTC CTCAAGATGA GCCCTGAGGA CATTCGCAAG TGGAATGTGA CGTCCCTGGA 1260 

GACCCTGAAG GCTITGCTTG AAGTCGACAA AGGGCACGAA ATGAGTCCTC AGGCTCCTOG 1320 

GOSGCCCCTC CCRCAGGTGG CCACCCTGAT CGACCGCTTT GTGAAGGGAA GGOGCCAGCT 1380 

35 AGACRftAGAC ACCCTAGRCA CCCTGACCGC CTTCTACCCT GGGTACCTGT GCTCCCTCM«3 1440 

CCCCGAGGAG CTGBGCTCCG TGCCXX:CC3K3 CAGCATCTGG GCGGTCAGGC CCCAGGACCT 1500 

GCiACACGTCT GACCCAAGGC SCICTGGACGT CCTCTATCCC AAGGCCCGCK TTGCTTTCCA ISSO 

GAACATGAAC GGGICCGAAT ACTTCGTGAA GATCCAGTCC TTCCIGGGIG GGGCCCCCM«: 1620 

GGAOSATTTG AAGGCGCTCA GTCAGCAGRA TGTGAGCATG GACTTGGCXa CGTTCATGAA 1680 

40 GCTGOSGAaS GATGOGQIGC TGCCGTTGAC TGTGGCTGAG GTGCAGAAAC TTCTGGGACC 1740 

CXaCGTGGAG GGCXTOAAGG CX3GAGGAGCG GCACOSCCCG GTGCGGGACT GGATCCTAO; 1800 

GCAGCGGCAG GACGACCIGG ACAOGCTGGG GCTGGGGCTA CAGGGCGGCA TCCCCAACX3G 1860 

CTACCTGGTC CTAGACCTCa GOSTGCAAGA GACCCTCTCG GGGACGCCCT GCCTCCTAGG 1920 

ACCTGGACCI GTTCTCACCX3 TCCTGGCACT GCTCCTAGCC TCCaCCCTGG OCTGAGGGCC 1980 

45 CCACTCCCTT GCTGGCCCCA GCCCTGCTGG GGATCKCXMC CTGGCCAGGA GCAGGCACGG 2040 

GTGATCCCCG TTCXaCCCCA AGAGAACTOG CGCTCaGTAA ACGGGAACAT GCCCCCTGCa 2100 



I I I I I I 

MALQRLDPCW SCGDRPGSLL FliliFSLGHVH PARTLAGETG TESAPLGGVL T— 
PRQLLGFPCA BVSGLSTERV RBIAVALAQK JJVKLSTEQLR CLAHRLSEPP EDLDALPLDL 
LLFLNPDAFS GPQACTRFFS RITKANVDLL PRGAPERQRL LPAALACWGV RGSLLSEADV 
RALGGLACDL PGRFVAESAB VLLPRLVSCP GPLDQDQQEA ARAALQGGGP PYGPPSTWSV 
STMDALRGLL PVLGQPIIRS IPQGIVAAWR QRSSRDPSWR QPERTILRPR FRRBVEKTAC 
PSGKKAEEID ESLIFYKKWE LEACVDAALL ATQMDRVNAI PFTYEQLDVL KHKLDELYPQ 
GYPBSVIQHL GYLFLKMSPE DIRKWMVTSL BTLKALLEVD KGHEMSPQAP RRPLPQVATL 
IDRFVKGRGQ LDKDTLDTLT AFYPGYLCSL SPBBLSSVPP SSIWAVRPQD LDTCDPRQLD 
VLYPKARLAF QNMNGSEYFV KIQSFLGGAP TEDLKALSQQ HVSMDLATFM KLRTDAVLEL 
TVABVQKLLG PHVEGLKAEE RKRPVRDWIL RQRQDDLDTL GLGLQGGIPM GYLVLDLSVQ 



I I I I I I 

ATGGCTTCAC CCAGCCTCCC GGGCAGTGAC TGCTCCCAAA TC^TTGATCA CAGTCATGTC 
CCaSAGTTTG AGGTGQCCAC CTGGATCRAA ATCACCCTTA TTCTGGTGTA CCTGATCAIC 
TTOGTGATCG GCXJTTCrGGG GAACAGCGTC ACCSTTOGGG TCACCCAGGT GCTGCAGAAG 
AAAGGATACT TGCXGAAGGA GGTGACaGAC CACaTGGTGA GTTTGGCTTG CTCGGACATC 
TTGQTGTTOC TCATCGGCAT GCCCATGGAG TTCTACAGCA TCATCTGGAA TCCCCTGACC 
ACGTCtaiGCT ACACCCTGTC CTGCAAGCTG CACaCTTTCC TCTTCGAGGC CTGCAGCTAC 
GCTACGCTGC IGCACGTGCT GACGCTCAGC TTTGAGCGCT ACATCGCCAT CTGTCaCCCC 
TTCAGGTACA SGGCTGTGTC GGGACCTTGC CaGGTGASGC TGCTGATTOG CTTCGTCTGG 
GTCACCTCCX3 CCCTGGTGGC ACTGCCXTTTG CTGTTTG<XA TGGGTACTGR GTACCCCCIG 
GTGAACGTGC CCAGCCACCG GGGTCTCACT TGCAACCGCT CCAGCaCCCG CCACCACGRG 
CAGCCCGAGA CCTCCfiATAT GTCCATCTGT ACCSVACCTCT OCAGOXiCrG GACOGTGTTC 
CAGTCCAGCA TCTTOSGCGC CTTCGTGGTC TACCTCGTGG TCCTGCTCTC CGTAGCCITC 
ATGTGCTGGA ACATGATGCA GGTGCTCATG AAAAGCCBGA AGGGCTCGCT GGCXX3GGGGC 
ACGCGGCCTC CGCAGCTGAG GAAGTCCGAG AGCGAAGAGA GCAGGACCGC CAGGAGGCAG 
ACCATCATCT TCCTGAGGCT GATTGTTGTG ACATTGGCCG TATGCTGGAT GCCCAACCAG 
ATTCGGSGGA TCATGGCTGC GGCCAAACCC AAGCACGACT GGACGAGGTC CTACTTCCGG 
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GCCTACATGA TCCTCCTCCC CTTCTCGGAG ACGTTTTTCT ACCTCaGCTC GGTCATC3UVC 1020 

CCGCTCCTGT AC&CX3GTGTC CTCXSCRGCftG TTTCGGCSGG TGTTCGTGCa GGTGCTGTCC 1080 

TGCJCGCCTGT CGCTGCAGCa CGCCAACCAC GAGAAGCGCC TGCXKX3TAC» TGOSCACTCC 1140 

AOaCCGACA GCGCCCX3CTT TGTGCAGCGC COGTTGCTCT KIGCGTCCCG GCGCCAGTCC 1200 

5 TCTGCAAGGA GAACTGAGAA GATTTTCTTA AGCaCTTTTC AGAGOSAGGC CGAGCCCCWG 1260 

TCTAAGTCCC AGTCRTTGAG TCTCGAGTCA CTAGAGCCCA ACTCAGGCGC GAAfiCCAGCC 1320 
AATTCTGCTG CAGAGRATGG TTTTCAGGAG CATGAAGTTT GA 

Seq ID NO: 40 Protein sequence 
ID Protein Accession #= NP_001499.1 

I i' r r T . T 

MASPSLPGSD CSQIIDHSHV PEFEVATWIK ITLILVYLII FVMGIiGHSV TIKVTQVUJK SO 

KGYLQKEVTD HMVSLACSDI LVFLIGMPME FYSIIWNPLT TSSYTLSCKL HTFLFERCSY 120 

15 ATLLHVLTLS FERYIAICHP FRYKAVSGPC QVKLLIGFVW VTSALVALPL LFAMGTEYPL 180 

VNVPSHRGLT CNRSSTRHHE QPETSNMSIC TNLSSRWTVF QSSIFGAFW YLWLLSVAF 240 

MCWHMMQVLM KSQKGSLAGG TRPPQLRKSE SEESRTARRQ TIIFLRLIW TLAVCWMPNQ 300 

IRRIMABAKP KHDWTRSYFR RYMILLPFSE TFFYLSSVIN PLLYTVSSQQ FRRVFVQVLC 360 

CRLSLQHANH EKRLRVHAHS TTDSARPVQR PLLFASRRQS SARRTEKIFL STFQSSAEPQ 420 

20 SKSQSLSLES LEPNSGAKPA NSAAENGFQE HEV 



25 



III... 

GGAGCGCGCG GTCCGGGCAC ACGGAGCAGG TTGGGACCGC GGCGGGTACC G 
CGCCATGCGG AGGCCGAGCG TGCGCGCGGC C 

GCTGQTGGGC GCTGCTGTCT TCGACGCGCT CGAGTCCGAG GCGGAAAGCG GCCGCCAGCG 180 

ACTGCTGGTC CAGAAGCGGG GCGCTCTCCG GAGGAAGTTC GGCTTCTCGG CCGAGGACTA 240 

CCGCGAGCTG GAGCGCCTGG CGCTCCAGGC TGAGCCCCAC CGCGCCGGCC GCCAGTGGAA 300 

GTTCCCCGGC TCCTTCTACT TCGCCATCAC CGTCATCACT ACCATCGGOT ACGGCCACGC 360 

CGCGCCGGGT ACGGACTCCG GCAAGGTCTT CTGCATGTTC TACGCGCTCC TGGGCATCCC 420 

GCTGACGCTG GTCACTTTCC AGAGCCTGGG CGAACGGCTG AACGCGGTGG TGCGGCGCCT 480 

CCIGTTGGCG GCCRAGTGCT GCCrGGGCCT GCGGTGGACG TGCGTGTCCA CGGAGAACCT 540 

GGTGGTGGCX: GGGCTGCTGG CGTGTGCCGC CaCCCTGGCC CTCGGGGCCG TCGCCTTCTC 600 

GCACTTCGAG GGCTGGACCT TCTTCCaCGC CTACTACTAC TGCTTCATCA CCCTCACCAC 660 

CATOGGCTTC GGCGACTTCG TGGC»CTGCA QftSCGGCGAG GOGCTGOiGA GGAAGCTCCC 720 

CTAOGTGGCX: TTCAGCTTCC TCTACATCCT CCTGGGGCTC AOGGTCATTG GCGCCITCCT 780 

CAACCTGGTG GTCCTGCGCT TCCTCGTTGC CaGOGCCGAC TGGCCCGftGC GCGCTGCCCG 840 

CCCCCCCAGC CCGOGCCCCC CXSGGGGCGCK CGASAGCCGT GGCCTCTGGC TGCCCCGCCG 900 

CCCGGCCCGC TCCGTGGGCT CCGCCTCTGT CTTCTGCCAC GTGCACAAGC TGGAfiRGGTG 960 

CGCCCGCGAC AACCTGGGCT TTTOGCCCXIC CTCXSAGCCCG GGGGTCGTGC GTGGOSGGCA 1020 

GGCTCCCaGG CCTGGGQCCC GGTGGAAGTC CaTCTGACAA CCCCACCCAG GCC&GGGTCG 1080 

AATCTGGAAT GGGAGGGTCT GGCTTCAGCT ATCAGGGCAC CCTCCCCaGG GATTGGAAAC 1140 

GGATGACGGG COTCTAGGCG GTCTTCTGCC ACXSAGCAGTT TCTCATTACT GTCTGTGGCT 1200 

AAGTCCCCTC CCTCCTTTCC AAAAATATAT TACAOTCaCa CCATAAAAAA AAAAAAAAAA 1260 
AAAAAAAAAA A 



I I I I I I 

MERPSVRAAG LVLCTLCYLL VGAAVFDALE SEAESGRQRL LVQKRGALRR KFGFSAEDYR 
ELERLALQAE EHEAGRQWKF PGSFYFAITV ITTIGYGHAA PCTDSGKVFC MFYALLGIPL 
TLVTFQSU3E RLHAWRELL LAAKCCLGLR WTCVSTENLV VAGLLACAAT LALGAVAFSH 
FEGWTFFHAY YYCFITLTTI GFGDFVALQS GEALQRKLEY VAFSFLYILL GLTVIGAFLN 
LWLRFLVAS ADWPERAARP PSPRPPGAPE SRGLWLPRRP A ' 
RDNLGFSPPS SPGWRGGQA PRPGARWKSI 



11 21 31 41 51 

I I I I I 

^ TCCAGCTGAA GGACTGATTG CAGGAAAACT TGGCAGCTCC CCAACCTTGG 60 

TGGCCCAGGG AGTGTGAGGC TGCAGCCTCA GAAGGTGTGA GCAGTGGCCA CGAGAGGCAG 12 0 

GCTGGCTGGG ACATGAGGTT GGCAGAGGGC AGGCAAGCTG GCCCTTGGTG GGCTTCGCCC 18 D 

TGAGCACICG GAGGCACTCC TATGCTTGGA AAGCTCGCTA TGCTGCTGTG GGTCCAGCAG 240 

GCGCTGCTCG CCTTGCTCCT CCCCACACTC CTGGCACAGG GAGAAGCCAG GAGGAGCCGA 300 

AACACCACCA GGCCCGCTCT GCTGAGGCTG TCGGATTACC TTTTGACCAA CTACAGGAAG 3 60 

GGIGTGCGCC CCGTGAGGGA CTGGAGGAAG CCAACCACCG TATCCATTGA CGTCATTGTC 420 

TATCCCATCC TCAACGTGGA TGAGAAGAAT CAGGTGCTGA CCACCTACAT CTGGTACCGG 480 

CftGTACTGGA CTGATGAfiTT TCTCCaGTGG AACCCTGAGG ACTTTGACAA CATCACCAAG 540 

TTGTCCRTCC CCACGGACAG CaTCTGGGTC CCGGACATTC TCATCAATGA GTTCGTGGAT 600 

GTGGGGAAGT CTCCAAATAT CCCGTACGTG TATATTCGGC ATCAAGGCGA AGTTCAGAAC 660 

TACAAGCCCC TTCSfflGTGGT GACTGCCTHT AGCCTCGACA TCTACAACTT CCCCTTCGAT 720 

GTCCAGAACT GCTOSCTGRC CTTCACCACT TGGCTGCACA CEATCCAGGA CATCAACATC 780 

TCTTTGTGGC GCTTGCCAGA AAAGGTGAAA TCCGACSiGGA GTGTCTTCM GRACCAGGGA 840 

GAGTGGGAGT TGCTGGGGGT GCTGCCCTAC TTTOGGGAGT TCAGCATGGA AAGCAGTAAC 900 

TACTATGCAG AAAIGAAGTT CTATGTGGTC ATCOGCCGGC GGCCCCTCTT CTATGTGGTC 960 

AGCCTGCTAC TGCCCASCAT CTTCCTCATG GTCATGGACA TCGTGGGCTT CTACCTGCCC 1020 

OCCAACAGTG GCGAGAGGGT CTCTTTCAAG ATTACACTCC TCCTGGGCTA CTCGGTCTTC 1080 

CTGATCATCG TTTCTGACAC GCTGCCGGOC ACTGCCATCG GCACTCCTCT CATTGGTGTC 1140 

TACTTTGTGG TGTGCATGGC TCTGCTGGTG ATAAGTTTGG COGAGACCAT CTTCATTGTG 1200 
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60 



CGGCTGGTGC ACAAGCAAGA CCTGCRGCAG CCCGTGCCTG CTTGGCTGCG TCRCOTGGTT 1260 

CTGGAGAGAA TCGCCTGGCT ACTTTGCCTG AGGGAGCRGT CAACTTCCXm GAGGCCTCCA 1320 

GCCACCTCCC AAGCCRCCAA GACTGATGAC TGCTCAGCCA TGGGAAAC(» CTGCAGCCAC 1380 

RTGGGAGGAC CCCAGGACTT CGAGAAGAGC CCGAGGGRCA GATGTRGCTC TCCCCCRCCA 1440 

5 CCTCX3GGAGG CCTCGCTGGC GGTGTGTGGG CTGCTGCaGG AGCTGICCTC CATCXX3GCRA 1500 

TTCCTGGAAA AGCG<3GATGA GRTCCGRGAG GTGGCCOSftG ACTGGCTGOS CGTGGGCTCC 1560 

GTGCTGGftCA AGCTGCTATT CCAC&TTTAC CTGCTAfiCGG TGCTGGCCTA CAGCATOiCC 1620 

CTQGTTATGC TCTGGTCCaT CTGGCaGTAC GCTTGAGIGG GTACRGCCCR GTGGAGGSGG 1680 

GGGTACAGTC CTGGTTAGGT GGGGACAGAG GATTTCTGCT TAGGCCCCTC AGGACCCSGG 1740 

10 GAATGCCRGG GACATTTTCR AGACACAGAC AAAGTOCCGT GCCCTGTTTC CARTGCCMT 1800 

TCftTCTCAGC AATCftCAAGC CAAGGTCTGR ACCCTTCCAC CaAAAACTGG GTGTTCAAGG 1860 

CCCTTACaCC CTTGTCCCAC CCCCAGCAGC TCACCATGGC TTTAABACAI GCTCTCTTAG 1920 

ATCAGGAGAA ACTCGGGCaC TCCCTAAGTC CACTCTAGTT GTGGACTTTT CCCCATTGAC 1980 

CCTCACCTGA ATAAGGGACT TIGGAATTOT GCTTCTCTTT CACAACTTTG CTTTTAGGTT 2040 

15 GAAGGCAAAA CCAACTCTCT ACTACaCAGG CCTGATAACT CTCTAOSAGG CTTCTCTAAC 2100 

CCCTAGTGTC ttTTTTTTCT TCACCTCACT TGTGGCAGCT TCCCTGAAC» CTCSTCCCCC 2160 
T GGGAGTGGGA AGAATAAAAT GCAGTGRAAC cc 



I I I I I I 

MLLWVQQALL ALLLPTLLRQ GEARRSRNTT RPALLRLSDY LLTNMiKGVR PVRDWRKPTT 
I LNVDEKNQVL TTYIWYRQVW TDEFLQWNPE DFDHITKLSI PTDSIWVPDI 
1 HQGEVQNYKP LQWTACSLD IYNFPFDVQN CSLTFTSWLH 
TIQDINISLW RLPEKVKSDR SVFMNQGEWE LicVLPYFRE FSMESSNYYA EMKFYWIRR 
RPLFYWSLL LPSIFLMVMD IVGFYLPPNS GERVSFKITL LLGYSVFLII VSDTLPATAI 
GTPLIGVYFV VCMRLLVISL AETIFIVRLV HKQDLQQPVP AWLRHLVLER lAWLLCLREQ 
STSQRPPATS QATKTDDCSA MGNHCSHMGG PQDFEKSPRD RCSPPPPPRE ASLAVOSLLQ 
ELSSIRQFLE KRDEIREVAR DWLRVGSVLD KLLFHIYLLA VLAYSITLVM LWSIWQYA 

Seq ID NO: 45 DNA sequence 
Nucleic Acid Accession #: NM_015507 
Coding sequence; 241.. 1902 

I i' r V V r 

CCGCAGAGGA GCCTCGGCCA GGCTAGCCAG GGCGCCCCCA GCCCCTCCCC AGGCCGCGAG 
CGCCCCTGCC GCGGTGCCTG GCCTCCCCTC CCAGACTGCA GGGACAGCAC C 
CGAGTGGAGC GGAGGACCCG AGCGGCTGAG GAGAGAGGAG GCGGCGGCTT ft 

GGGTCCGGCC GGCGCCCTCC CGAGGGGGGC TCAGGAGGAG GAAGGAGGAC CCGTGOGAGA 240 

ATGCCTCTGC CCTGGAGCCT TGCGCTCOOG CTGCTGCTCT CCTGGGTGGC AGGTGGTTTC 300 

GGGAACGOW CCAGTGCAAG GCATCACGGG TTGTTAGCAT OGGCRCGTCA GCCTGGGGTC 360 

TGTCACIATG GAACTAAACT GGCCTGCTGC TACGGCTGGA GAAGAAACSG CAAGGGAGTC 420 

TGTGAflGCTA CATGCX5AACC TGGATGTAAG TTTGGTGAGT GCGTGGGACC AAACAAATGC 480 

AGATGCTTTC CAGGATACAC CGGGAAAACC TGCaGTCAfiG ATGTGAATGA GIGTGGAATG 540 

AAACCCCGGC CATGCCARCX CAGATGTGIG AATACACACG GAAGCTACRA GTGCTTTTGC 600 

CTCAGIGGCC ACATGCTCAT GCX»GATGCT ACGTGTGTGA ACTCTAGGRC ATGTGCCATG 660 

ATAAACTGTC AGTACAGCTG TGAAGACACA GRAGABGGGC CAC3«3TGCCT GTGTCCATCC 720 

TCAGGRCTCC GCCTGGCCCC AAATGGASGA GACTGICTAG ATATTGATGA ATGTGCCTCT 780 

GGTAAAGTC3V TCTGTCCCTA CAATCGAAGA TGTGTGAACA CATTTGGBAG CTACTACTGC 840 

AAATGTCACa TTGGTTTCGA ACTGCAATAT ATCAGTGGAC GATATGACIG TATAGATATA 900 

AATGAATGTA CTATGGATAfi CCATACGTGC AGCCMCaTG CCAATTGCTT CAATACCKAA 960 

GGGTCCTTCA AGTGTAAATG CaAGCAGGGA TATAASGGCA ATGGACTTCG GTGTTCTGCT 1020 

ATCCCTGAAA ATTCTGTGAA GGAAGTCCTC AGAGCACCTG GTACCXTCAA AGACAGAATC 1080 

AAGAAGTTGC TTGCTCACAA AAACAGCATG AAAAAGAAGG CSiAAA&TTAA AAATGTTACC 1140 

CCAGAACCCA CCAGGACTCC TACCCCTAAG GTGAACTTGC AfiOCCTTCRA CIATGRAGAG 1200 

ATAGTTTCCA GAGGCGGGAA CTCTCATGGA GGTAAAAAAG GGAATGAAGA GAAAAIGAAA 1260 

GAGGGGCTTG AGGATGAGAA AAGAGAAGRG AAAGCCCTGA AGRATGACAT AGAGGSGCGA 1320 
AGCCTGCGAG GAGATGTGTT TTTCCCTAAG GTGAATGAAG CAGGTGAATT O 
CTGGTCCAAA GGAAAGCGCT AACTTCCAAA CTGGAACATA AAGATTl 
GACTGCAGCT TCAATCATGG GATCTGTGAC TGGAAACAGG ATAGAGAAGA T 

TGGAATCCTG CTGATCGAGA TAATGCTATT GGCTTCTATA TGGCAGTTCC GGCCTTGGCA 1560 

GGTCACAAGA AAGACATTGG CCGATTGAAA CTTCTCCTAC CTGACCTGCA ACCCCAAAGC 162 0 

AACTTCTGTT TGCTCTTTGA TTACCGGCTG GCCGGAGACA AAGTCGGGAA ACTTCGAGTG 1680 

65 TTTGTGAAAA ACAGTAACAA TGCCCTGGCA TGGGAGAAGA CCACGAGTGA GGATGAAAAG 1740 

TGGAAGACAG GOAAAATTCA GTTGTATCAA GGAACTGATG CTACCAARAG CATCATTTTT 1800 

3 CAAAACCGGC GAAAICGCAG TG3ATG3CGT CTTGCTTGTT 1860 

G CCTTTTATCT GTGGATGACT G.^ATGTTACT ATCTTTATAT 1920 

TTGACTTIGT ATGTCAGTTC CCTGGTTTTT TTGATATTGC ATCATRGGAC CTCTGGCATT 1980 

70 TTAGAATTAC TAGCTGAAAR ATTGTARTGT ACCAACAGA/^ ATATTATTGT AAGATGCCTT 204 0 

TCTTGTATAA GATATGCCAA TATTTGCTTT AAATRTCATA TCACTGTATC TTCTCAGTCA 2100 

TTTCTGAATC TTTCCACATT ATATTATAAA ATATCGAAAT GTCAGTTTAT CTCCCCTCCT 2160 

CAGTATATCT GATTTGTATA AGTAAGTTGA TGAGCTTCTC TCTACAACAT TTCTAGARAA 2220 

TAGAAAAAAA AGCACRGAGA AATGTTTAAC TGTTTGACTC TTATGATACT TCTTGGAAAC 2280 

75 TATGACATCA AAGATAGACT TTTGCCTAAG TGGCTTAGCT GGGTCTTTCA TAGCCAAACT 2340 
A AATTCTTTGT AATAATAATA TCCRAATCaT CSUVAARAAAA AAAAAAAA 



Protein Accession #r NP_056322 

I r r r 

MPLPWSLRLP LLLSWVAGGF GNAASREHHG LLASAHQPGV C 
CEATCEPGCK EGECVGPNKC RCFPGYTGKT CSQDVNECGM KPRPCQHECV NTHGSYKCFC 
LSCamLMPDA TCVNSETCAM INCQYSCEDT EEGPQCLCPS SGLRLAPHGR DCLDIDECAS 
GKVICPrami CVNTFGSYYC KCHIGFELQY ISGRYDCIDI NECTMDSHTC SHHRNCFNTQ 
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GSFKCKCKQG YKGNGLRCSA IPENSVKEVL RAPGTIKDRI [OCLLftHKNSM KKKftKIKNVT 
PEPTRTPTPK VNLQPFireBE IVSRGGNSHG GKKGNEEKMK EGLEDEKREE KAUOTOIEER 
SLRGDVPFPK VNEAGEFGLI LVQRKALTSK LEHKDLNISV DCSFNHGICD WKQDREDDFD 
IRVPALA GHKKDIGRLK LLLPDLQPQS NFCLLFDYRL AGDKVGKLRV 
& WEKTTSEDEK WKTGKIQLYQ GTDATKSIIF EAERGKGKTG EIAVDGVLLV 
SGLCPDSLLS VDD 

Seq ID NO: 47 DNA sequence 



GCTCCATGGC 



TGTGCAAGAG GCTCCCACCC A 
GGAGGCGTCC TGGTCAATGA GCGCTGGGTG CTCACTGCCG CCCACTGCAA GATGAATGAG 
?GGGCAG TGATACGCIG GGCGACAGGA GAGCTCAGAG GATCAAGGCC 
T TCCGCCACCC CGGCTACTCC ACACAGACCC ATGTTAATGA CCTCATGCTC 
GTGAAGCTCA ATAGCCAGGC CAGGCTGTCA TCCATGGTGA AGAAAGTCAG GCTGCCCTCC 



GATGTGACCT TTCCCTCTGA CCTCATGTGC GTGGATGTCA A 
G TTTACAAGGA CTTACTGGAA AATTCCATGC T 
\ ACGCCTGCAA TGGTGACTCA GGGGGACCGT T 
CMGGTCTGG TGTCCTGGGG AACTTTCCCT TGCGGQCAAC CCAATGACCC AGGAGTCTAC 
T GCAAGTTCAC CAAGTG6AIA AATGACRCCA TGRAftAAGCX TCGCTAftCGC 
T TAATTAACTG TGTGCTTCCA A£JaGARRATG CaCBlGGAGIG AGGACGCCGA 
TGACCTATGA AGTCAAATTT GACTTTACCT TTCCTCAAAG ATATATTTAA ACCTCATGCC 
CTGTTGATAA AOSVATCAAA TTGGTAAftGA CCTAAAACCA AAACfiA&TAA AGRAACaCAA 
AACCCICAA 



MARSLLLPLQ IliLSLALET AGEEAQGDKI IDGAPCARGS HPWQVALLSG NQLHOGGVLV 
NBRWVLTAAH CKMNEYTVHL GSDTLGDRHA QHIKASKSFR HPGYSTQTHV NDLMLVKLHS 
QAELSSHVKK VRLPSRCBPP GTTCTVSGWG TTTSPDVTFP SDLMCVDVKL ISPQDCTKVY 
KDLLENSMLC AGIPDSKKNA CNGDSGGPLV CRGTLQGLVS WGTFPCGQPH DPGVYTQVCK 



Seq ID NO: 49 DNA sequence 

Nucleic Acid Acceaaion #i NM_0034S6.1 

Coding sequence: 11.. 1363 

1 11 21 31 41 51 

I I I I I I 

GAATTCGGCG AIGCCTCACA ACTCCATCAG ATCTGGCCAT GGAGGGCTGA ACCAGCTGGG 
AGGGGCCTTT GTGAATGGCA GACCTCTGCC GGAAGTGGTC CGCCAGCGCA TOGTAGACCT 
GGCCCACCAG GGTGTAAGGC CCTGCGACAT CTCTCGCCAG CTCKGCGTCA GCCATGGTTG 
CGTCAGCAAG ATCCTTGGCA GGIACTACGA GACTGGCAGC ATCCGGCCTG GAGTGATAGG 
GGGCTCCAAG CCCAAGGTGG CCACCCCCAA GGTGGTGGAG AAGATTGGGG ACTACAAACG 
T ACCATGTTTG C 



GTGCCCATTT GAGCGGCAGC ACTACCCRGA GGCCTATGCC T 

CGAGCAGGGC CTCTACCCGC TGCCCTTGCT CAACAGCACC CTGGAOSACG GGAAGGCCAC 84 0 

60 CCTGACCCCT TCCAACRCGC CRCTGGGGCG CAACCTCTCG ACTCACCAGA CCTACCCCGT 90 0 

GGTGGCAGAT CCTCACTCAC CCTTCGCCAT AAAGCAGGAA ACCCCCGAGG TGTCCAGTTC 960 

TAGCTCCACC CCTTCCTCTT TATCTAGCTC CGCCTTTTTG GATCTGCAGC AAGTCGGCTC 1020 

CG3GGTCCCG COCTTCAATG CCTTTCCCCA TGCIGCCTCC GIGTACX3GGC AGTTCAOGGG 1080 

CCAGGCCCTC CTCTCAGGGC GAGAGATGGT CGGGCCCACG CTGCCCGGAT ACCCACCOCA 1140 

65 CATCXKCaCC ftdCGGACAGQ GCAGCTATGC CTCCTCTGCC ATCGCAGGCA TGGTGGCAGG 1200 

AAGTGAATAC TCTGGCAATG CCTATGGCCA CA(3CCCCTAC TCCTCCTACSi GCGAGGCCTG 1260 

GOSCTTCCCC RRCTCCAGCT TGCTGRGTTC CCCATATTAT TACAGITCCR GATCAAGGCC 1320 
GAGTCCRCCG CCCACCACTG CCACGGCCTT TGACCATCTG TAGTTGAAGC TT 



70 



Seq ID NO: so Protein sequence 
Protein Acceaaion NP_003457 

I T r i' T r 

MPHNSIRSGH GGUJQIiGGAF VNGRPLPEW RQRIVDLAHQ GVRPCDISHQ LRVSHGCVSK 
ILGRYYETGS IRPGVIGGSK PKVATPKWE KIGDYKRQNP TMFAMBIRDR LIAEGVCDND 
TVPSVSSINR IIRTiCVQQPF NLPMDSCVAT KSLSPGHTLI PSSAVTPPES PQSDSIiGSTY 
SINGLLGIAQ P6SDKRKMDD SDQDSCRLSI DSQSSSSGPR KHLRTDAFSQ HHLEPLECPF 
ERQHYPEAYA SPSHTKGEQG LYPLPLtHST LDDGKATLTP SMTPLGRNLS THQTYPWAD 
PHSPFAIKQE TPEVSSSSST PSSLSSSAFL DLQQVGSGVP PFNAFPHAAS VYGQFTGQAL 
LSGREMVGPT LPGYPPHIPT SGQGSYASSA lAGMVAGSEY SGMAYGHTPY SSYSEAMRFP 
NSSLLSSPYY YSSTSRPSAP PTTATAFDHL 

Seq ID NO: 51 DNA aequence 

Nucleic Acid Accession #: NM_013952 

Coding aequence: 161.. 1357 



wo 02/102235 



1 11 21 31 41 51 

I I I I I I 

TTCaCiAAGGA GGAG!«3ACRC CGC3GCCCAGG CCACCCTCGC GGGCGGGCGG ACCCAAGCAG 60 

3 CAGCCX3GCCG GCCAGCMCAG OGGCAGOCGC GGCCCGQACC TACGGGAGGA 120 

G CTGCGAGCGA CTCCCCGGCG ATGCCTCACR ACTCCATCAG 180 

R ACCAGCTGGG AGGGGCCTTT GTGAATGGCR GACTTCTGCC 240 

GGAAGTGGTC CGCCRGCGCA TCGTAGRCCT GGCCCACCAG GGTGTAAGGC CCTGCG&CaT 300 

CTCTCGCCAG CTCCGCGTCA GCCATGGCTG OGTCAGCRAG ATCCTTGGCa GGTACTACGA 360 

GACTGGCRGC ATCCGGCCTG GAGTGATAGG GGGCTCCMG CCCAAGGTGG CCACCCCCAA 420 

GGTGGTGGAG AAGATTGGGG ACTACAAACG CCAGAACCCT ACCATGTTTG CCTGGGAGAT 480 

CCGAGACCGG CTCCTGGCTG AGGGCGTCTG TGACfiATGAC ACTGTGCCCA GTGTCAGCTC 540 

CATTAATAGA ATCATCCGGA CCAAAGTGCR GCAACCATTC AACCTCCCTA TGGRCAGCTG 600 

CGTGGCCACC AAGTCCCTGA GTCCCGGACA CACGCTGATC CCCAGCTCAG CTGTAACTCC 660 

CCCGGAGTCA CCCCAGTCGG ATTCCCTGGG CTCCACCTAC TCCATCAATG GGCTCCTGGG 720 

CATCGCTCAG CCTGGCAGCG ACAAGBGGAA AATGGATGAC AGTGATCAGG ATAGCTGCCG 780 

ACTAAGCATT GACTCACAGA GCAGCAGCAG CGGACCCCGA AAGCACCTTC GCACGGATGC 840 

CTTCAGCCAG CACCACCTCG AGCCGCTCGA GTGCCCATTT GAGOSGCAGC ACTACCCAGA 900 

GGCCTATGCC TCCCCCAGCC ACACCAAAGG CGAGCAGGGC CTCTACCCGC TGCCCTTGCT 960 

CRACAGCACC CTGGACGACG GGAAGGCCAC CCTGACCCCT TCCAACACGC CACTGGGGCG 1020 

CAACCTCTCG ACTCACCAGA CCTACCCCGT GGTGGCAGCT CCGCCCTTTT GGATCTGCAG 10 30 

CAAGTCGGCT CCGGGGTCCC GCCCTTCAAT GCCTTTCCCC ATGCTGCCTC CGTGTACGGG 1140 

CAGTTCACGG GCCAGGCCCT CCTCTCAGGG CGAGAGATGG TGGGGCCCRC GCTGCCCGC3A 12 00 

ACATCCCCAC CAGCGGACAG GGCAGCTATG CCTCCTCTGC CATCGCAGGC 12S0 

3 GAAGTGAATA CTCTQGCAAT GCCTATGGCC ACACCCCCTA CTCCTCCTAC 1320 

AGCGAGGCCT GGGGCTTCCC CSiACTCCAGC TTGCTGAGTT CCCCaTATTA TTACAGTTCC 1380 

ACATCRAGGC CGAGIGCACC GCCCSCCACT GCfCACGGCCT TTGACCaTCT GTAGTTGCCA 1440 
TGGGGACAGT G 

Seq ID NO: 52 Protein sequence 
Protein Accession ft: HP 039246 

1 11 21 31 41 51 

I I I I I I 

MPHMSIRSGH GGLNQLGGAF VNGRPLPEW RQRIVDLAHQ GVRPCDISRQ LRVSHGCVSK 60 

ILGRYYBTGS IRPGVIGGSK PKVATPKWB KIGDYKRQNP TMFAWEIRDR LLAEGVCDHD 120 

TVPSVSSINR IIRTKVQQPF NLPMDSCVAT KSLSPGHTLI PSSAVTPPBS PQSDSLGSTY 180 

SINGLLGIAQ PGSDKRKMDD SDQDSCRLSI DSQSSSSGPR KHLRTDAFSQ HHLEPLECPF 240 

BRQHYPBAYA SPSHTKGEQG LYPLPLtHST LDDGKATLTP SNTPLGBNLS THQTYPWAA 300 

PPFWICSKSA PGSRPSMPFP MLPPCTGSSR ARPSSQGBRW WGPRCPDTHP T " 

PPLPSQAWWQ EVNTLAMEMA TPPTPPTARP GASPTPAC 



CTTGTGGTTC CTCTCTACTT GGGGAAATCA GGTGCAGCGG CCATGGCTAC AGCAAGACCC 60 

CCCTGGATGT GGGTGCTCTG TGCTCTGATC ACAGCCTTGC TTCTGGGGQT CacaGftGCAT 120 

GTTCTCGCCA ACAATGATGT TTCCTGTGAC CSVCXCCTCTA ACACCX3TGCC CTCTGGGAGC 180 

AACCAGGACC TGGGAGCTGG GGCCGGGGAA GACGCCCGGT OGGATGACAG CSGCAGCCGC 240 

ATCATCAATG GATCCX3ACTG CGATATGCAC ACCCAGCCGT GGCAGGCOGC GCTGTTGCTA 300 

AGGCCCAACC AGCTCTACTG CGGGGCGGTG TTGGTGCATC CACRGTGGCT GCTCAOGGCC 360 

GCCCACTGCA GGRRGAAAGT TTTCAGAGTC OGTCTCGGCC ACTACTCCCT GTCACCAGTT 420 

TATGAATCTG GGCRGCAGAT GTTCCAGGOG GTCAAATCCA TCCCCCACCC TGGCTACTOC 480 

CACCCTGGCC ACTCTAACGA CCTCATGCTC ATCAAACTGA ACRGRAGAAT TCGTCCCACT 540 

AAAGATGTCA GACCCATCAA CGTCTCCTCT CATTGTCCCT CTGCTGGGAC AAAGTGCTTG 600 

T GGGGGACAAC CAAGAfiCCCC CRAGTGCACT TCCCTAAGGT CCTCCAGTGC 660 

A GCGTGCTAAG TCAGAAAAGG TGCGAGGATG CTTACCCGAG RCAGATAGAT 720 

T TCTGCGCCGG TGACRAAGCA GGTAGAGACT CCTGCCAGGG TGATTCTGGG 780 

GGGCCTGTGG TCTGCRATGa CTCCCTGCAG GGACTCGTGT CCTCGGGAGA TTACCCTTGT 840 

GCCCGGCCCA ACAGACCGGG TGTCTACACG AflCCTCTGCA AGTTCACCAA GTGGATCCAG 900 

GAAACCATCC AGGCCAACTC CTGAGTCATC CCAGGACTCA GCACACCGGC ATCCCCACCT 960 

GCTGCAGGGA CAGCCCTGAC ACTCCTTTCA GACCCTCATT CCITCCCAGA GATGTTGAGA 102 0 

ATGTTCATCT CTCCAGCCCC TGACCCCATG TCTCCTGGAC TCAGGGTCTG CTTCCCCCAC 1080 

ATTGGGCTGA CCGTGTCTCT CTAGTTGAAC CCTGGGAACA ATTTCCAAAA CTGTCCAGGG 1140 

CGGGGGTTGC GTCTCAATCT CCCTGGGGCA CTTTCATCCT CAAGCTCAGG GCCCRTCCXT 1200 
TCTCTGCAGC TCTGACCCAA ATTTAfiTCCC AGAAATAAAC TGftGAAGTGQ AAAAAAAARA 

Seq ID NO; 54 Protein sequence 
Protein Accession «: NP_036559 

1 11 21 31 41 51 

I I I I I I 

MATARPPWMW VLCALITALL LGVTBHVIiAN NDVSCDHPSN TVPSGSNQDL GAGftGBDARS 60 

DDSSSRIING SDCDMHTQPW QAALLLRPNQ LYCGAVLVHP QWLLTAAHCR KKVFRVRLGH 120 

YSLSPVYESG QQMFQGVKSI PHPGYSHPGH SHDLMLIKLK RRIRPTKDVR PIJWSSHCPS 180 

AGTKCLVSGW GTTKSPQVHF PKVLQCIjNIS VLSQKRCEDA YPRQIDDTMF CAGDKAGRDS 240 
OQGDSGGFW OJGSLQGLVS WGEYPCARPH RPGVYTNLCK FTKWIQETIQ SNS 

Seg ID NO: 55 DNA sequence 



CCCAGAGCCG CCTCCCCOTG TTGCTGGCAT CCCGAGCTTC CTCCOTTGCC AGCCAGGACG 
CTGCCBRCTT GTCTTTGCCC GCTGCTCCGC AGACGGGGCT GCAAAGCTGC AACTAATGGT 
GTTGGCCTCC CTGCCCACCT GTGGAAGCAA CTGCGCIGAT TGATGCGCCR CAGACTTTTT 
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TCCCCTCGAC CTCGCCGGCG TACCCTCXKA CAGATCCAGC ATCRCCCAGT GRATGTACAT 240 

TAGGGTGGTT TCCCCCTCAG CTTCGGGCTT TGTTTGGGTT TGATTGTGTT TGGCTCTTOG 300 

CTAAGCTCiAT TTATGCAGC& GAAGCCCCaC CGGCTGGRGA GAAACAAAAG CTCTTTTCTI 360 

TGTCCCGGA6 CAOSCTGCGG AGCCCTTGCA GAGCCCTCTC TCCfiGTCGCC GCCGGGCCCI 420 

TGGCCGICKA AGGAGGTGCT TCTCGCGGRG AOCGOGGGAC CCX3CCX5TGCC GftGCCOGGAG 480 

GGCCGTSGGG GCCCTGAGAT GCCGAGOSGT GCCCX3GGCXX; GCTTACCTGC ACCGCTTGCT 540 

CCGAGCCGOS GGGTCCGCCT GCTAGGCCIG CGGAAAACGT CCTAGCGACA CTCGCCCGOS 600 

GGCCCCX3AGG TCGCCCGGGA GGCCGAGOCC GCGTCOGGAA GGCaGCC»GG CGOCGGGCGC 660 

GGGGCGGGCT GTTTTGCATT ATGTGCX3GCT CGGCCCTGGC TTTTTTTACC GCTGOiTTTG 720 

TCTGCCTGCR AAACGACCGG CGAGGTCOX; CCTCGTTCCT CTGGGCAGCC TGGGTGTTTT 780 

CACTTGTTCT TGGACTGGGC CAAGGTGARG ACAATAGATG TGCATCTTCA AATGC31GCAT 840 

CCTGTGCCAG GTGCCTTGCG CTGGGTCCaG AATGTGGATG GTGTGTTCAA GAGGATTTCX 900 

TTTCAGGTGG ATCAAGAAGT GAACGTTGTG ATATTGTTTC C3^TTTAATA AGCAAAGGCT 960 

GCTCAGTTGA TTCfiATAGAA TACCCATCTG TGCATGTTAT AATACCCaCT GAAAATGAAA 1020 

TTAATACCCA GGTGACACCR GGAGAAGTGT CTATCCAGCT GCGTCCAGGA GCCGAAGCTA 1080 

ATTTTATGCT GAAAGTTCAT CCTCTGAAGA AATATCCTGT GGATCTTTAT TATCTTGTTG 1140 

ATGTCTCAGC ATCAATGCAC AATAATATAG AAAAAITAAA TTCCGTTGGA AACGATTTAT 1200 

CTAGAAAAAT GGCATTTTTC TCCCGTGACT TICGTCTTGG ATTTGGCTCA TACGTTGATA 1260 

AAACAGTTTC ACCATACATT AGCATCCACC CCGAAAGGAT TCATAATCAA TGCAGTGACT 1320 

ACAATTTAGA CTGCATGCCT CCCCATGGAT ACATCCATGT GCTGTCTTTG ACAGAGAACA 1380 

AAGGAGGTTT TGACGCCATG CTTCAGGCAG GTGTCTGTGA AAGTCATATC GGATGGCGAA 150 0 

AAGAGGCTAA AAGATTGCTG CTGGTGATGA CAGATCAGAC GTCTCATCTC GCTCTTGATA 15 60 

GCRAATTGGC AGGCATAGTG GTGCCCAATG ACGGAAACTG TCATCTGAAA AACAACGTCT 1620 

ACGTCAAATC GACAACCATG GAACACCCCT CACTAGGCCA ACTTTCAGAG AAATTAATAG 1680 

ACAACAACAT TAATGTCATC TTTGCAGTTC ASGGAAAACA ATTTCATTGG TATAAGGATC 1740 

TTCTACCCCT CTTGCCAGGC ACCATTGCTG GTGAAATAGA ATCAAAGGCT GCAAACCTCA 1800 

ATAATTTGGT AGTGGAAGCC TATCAGAAGC TCATTTCAGA AGTGAAAGTT CAGGTGGAAA 18 6 0 

ACCAGGTACR AGGCATCTAT TTTAACATTA CCGCCATCTG TCCAGATGGG TCCAGAAAGC 192 0 

CAGGCATGGA AGGATGCAGA AACGTGACGA GCAATGATGA AGTTCTTTTC AATGTAACAG 1980 

TTACRATGRR AARATGTGAT GTCACAGGAG GAAAAAACTA TGCAATAATC AAACCTATTG 2040 

GTTTTAATGR AACCGCTAAA ATTCATATAC ACAGAAACTG CAGCTGTCAG TGTGAGGACA 2100 

ACAGAGGACC TAAAGGAAAG TGTGTAGATG AAACTTTTCT AGATTCCAAG TGTTTCCAGT 2160 

GTGATGAGRA TARATGTCAT TTTGATGRAG ATCAGTTTTC TTCTGAGAGT TGC3\AGTCaC 2220 

ACAAGGATCA GCCTGTTTGC AfiTGGTCGftfi GAGTTTGTGT TTGTGGGAAA TGTTCATCTC 2280 

ACAAAATTAA GCTTGGAAAA GTGTATGGAA AATACTGTGA AAAGGATGAC TTTTCTTGTC 2340 

CMATCaCCA TGGAAATCTG TGTGCTGGGC ATGGAfiAeTG TGAAGCAGGC AGATGCC3VAT 2400 

GCTTCAGTGG CTGGGftAGGT GATCGATGCC AGTGCKCTTC AGCAGCAGCC CaGCACTGTG 2460 

TCAATTCAAA GGGCC3UIGTG TGCAGTGGAA GAGGCACGTG TGTGTGTGGA AGGTGTGAGT 2520 

GCACCGATCC CRGGAGCATC GGCCGCTTCT GTGAACACTG CCCCACCTGT TATACAGCCT 2580 

GCAAGGAAAA CIGGAMTGT ATGCAATGCC TTCaCKCTCA CAATTTGTCT CaGGCTATAC 2S40 

TTGATCAGTG C34AAACCTCA TGTGCTCTCA TGGRRCAACA GCATTATGTC GACCRAACTT 2700 

CAGAATGTTT CTCCAGCCCA AGCTACTTGA GAATATTTTT CaTCATTTTC ATAGTTACAT 2760 

TCTTGATTGG GTTGCTTAAA GTCCTGRTCA TTAGRCAOGT GATACTACAA TGGAATAGTA 2820 

ATAAAATTAA GTCCTCATCA GATTACAGAG TGTCAGCCIC AftAAAAGGAT AAGTTGATTC 2880 

TGCRAAGTGT TTGCACAAGA GCAGTCACCT ACCGACGIGA GAAGCCTGAA GAAATAAAAA 2940 

TGGRTATCaG CfiAATTAAAT GCTCATGAAA CTTTCAGGTG CSUiCTICTAA AAAAAGATTT 3000 

TTAAACACTT AATGGGAAAC TGGAATTGTT AATAATTGCT CCTAAAGATT ATAATTTTAA 3060 

AAGTCACAGG AGGAGACAAA TTGCTCAOGG TCATGCC3VGT TGCTGGTTGT AC3VCTCGAAC 3120 

GAAGACTGAC AAGTATCCTC ATCATGATGT GACTCACATA GCTGCTGACT TTTTCAGAGA 3180 

AAAATGTGTC TTACTRCTGT TTGAGACTAG TGTCGTTGTA GCACTTTACT GTAATATATA 3240 

ACTTATTTAG ATCAGCATAG AATGTAGATC CTCTGAAGAG CACTGATTAC ACTTTACMG 3300 

TACCTGTTAT CCCTACGCTT CCCAGAGAGA ACAATGCTGT GAGAGAGTTT AGCATTGTGT 3360 

CACTACAAGG GTACAGTAAT CCCTGCACTG GACATGTGAG GAAAAAAATA ATCTGGCAAG 3420 

ATGAATAAAT GATTCGTGTT TCACTCTTTC AAGAGGTGAA CAGATACAAC CTTAATCTTA 3 540 

AAAGATTATT GCTTITTAAA GTGTGTAGTT TTATGCATGT GTGTTTATGG TTTGCTTATT 3 600 

T GGATACTAAT TCCAGCATTC TCTCCTCTTT GCCTTTATGT TTTGTTTTCT 3660 

«3TTTAT GTATGTCACA GATGACTGGA TTAATTAAGT GCTAAGTTAC 3720 

A RAAAACTAAT AATACAATGT CACTTTATCA GAATACTAGT TTTAAASGCT 3780 



I I I 1 

T AAFVCLQNDR RGPASFLWAA HVFSLVLGLG QGEDKP.CASS N 
LGPBCGWCVQ EDFISGGSES ERCDIVSNLI SKGCSVDSIE YPSVHVIIPT SNEINIQVTP 
GEVSIQLRPG AEANFMLKVH PLKKYPVDLY YLVDVSASMH NNIEKLKSVG NDLSRKMAFF 
SRDFRLGFGS YVDKTVSPYI SIHPERIHNQ CSDYNLDCMP PHGYIHVLSL TENITEFEKR 
VHRQKISGHI DTPBGGFDAM LQAAVCESHI GMEKEAKRLL LVMTDQTSHL ALDSKLAGIV 
VPNDGNCHLK HNVYVKSTTM BHPSLGQLSE KLIDNNIIIVI FAVQGKQFHW YKDLLPLLPG 
TIAGEIESKA ANLNNLWEA YQKLISEVKV QVENQVQGIY FHITAICPDG SRKPGMEGCE 
NVTSNDEVLP KVTVTMKKCD VTGGKHYAII KPIGFNETAK IHIHBNCSCQ CEDNRGPKGK 
CVDETFLDSK CFQCDENKCH FDBDQFSSBS CKSHKDQPVC SGRGVCVCGK CSCHKIKLGK 
VYGKYCEKDD FSCPYHHGNL CAGHGBCBAG RCQCFSGWBG DRGQCPSAAA QHCVNSKGQV 
CSGRGTCVCG ECECTDPRSI GRPCBHCPTC YTACKENWNC MQCLHPHNLS QAILDQCKTS 
CRIMBQQHW DQTSBCFSSP SYLRIFFIIF IVTFLIGLLK VLIIRQVILQ HNSHKIKSSS 
DYRVSASKKD KI.ILQSVCTR AVTYEREKPE EIKMDISKM AHETFROJF 

Seq ID HO: 57 DBA sequence 

Nucleic Acid Accession #: nm_001719 

Coding sequence: 12 3.. 1418 
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GC3GCGCAGCG GGGCCCGTCT GCAGCAAGTG ACCGACGGCC GGGACGGCCG CCTGCCCCCT 60 

CTGCCACCTG GGGOSGTGCG GGCCCGGAGC CCGGAGCCCG GGTAGCGCGT AGAGCCGGCG 120 

CGATGCRCGT GCGCTCACTG CGAGCTGCGG CGCCGCACAG CTTCGIGGCXS CTCTOGGCAC 180 

CCCTGTTCCT GCTGCGCTCC GCCCTGGCCG ACTTCAGCCT GGACAACGAG GrGCACTCGA 240 

GCTTCATCCA CCGGCGCCTC CGCAGCCAGG AGCGGCGGGA GRTGCaGCGC GAGATCCTCT 300 

CCATTTTGGG CTTGCCCCAC CGCCCGCGCC CGCACCTCCSl GGGCAAGCAC AACTCGGCAC 360 

CCATGTTCAT GCTGGACCTG TACAACGCCA TGGCGGTGGA GGAGGGCGGC GGGCXXGGCG 420 

GCCAGGGCTT CTCCTACCCC TACAAGGCCG TCTTCAGIAC CCAGGGCCCC CCTCTGGCCA 480 

GCCTGCAAGA TAGCCATTTC CTCACCGACG CCGACATGGT CATGAGCTTC GTCflACCTCG 540 

TGGAACATGA CAAGGAATTC TTCCACCCAC GCTACCACCA TCGAGAGTTC CGGTTTGATC 600 

TTTCCAAGAT CCXSGAAGGG GAAGCTGTCa OGGCAGCCGA ATTCCGGATC TACAAGGACT 660 

ACaTCCGGGA ACX3CITCGAC AATGAGACGT TCCGGATCAG CGTTTATCAG GTGCTCC?.GG 720 

AGCACTTGGG CAGGGAATCG GATCTCTTCC TGCTCX3ACAG CCGTACCCTC TGGGCCTCGG 7 80 

AGGAGGGCTG GCTGGTGTTT GRCATCACAG CCACCAGCAA CCACTGGGTG GTCAATCCGC 840 

GGCACAACCT GGGCCTGCAG CECTCGGTGG AGACGCTGGA TGGGCRGAGC ATCRACCCCA 900 

AGTTGGCGGG CCTGATTGGG CGGCACX3GGC CCCAGAACAA GCAGCCCTTC ATGGTGGCTT 960 

TCTTCAAGGC CaCGQAGOTC CACTTCCGC* GCaTCCGGTC CACGGGGAGC AAACAGCGCA 1020 

GCCRGAACCG CTCCAAGACG CCCARGAAGC AGGAAGCCCT GCGGATGGCC AACBTGGCAG 1080 

AGAACAGC&G CAGCGACCAG AGGCAGGCCI GTAAGAAGCA OSAGCTGTAT GTCAGCTTCX; 1140 

GAGACCIGGG CTGGC3W36AC TGGATCATCG CGCCTGAAGG CTACGCCGCC TACTACTGTG 1200 

AGGGGGAGTG TGCCTTCCCT CTGAACTCCI ACaTGAACGC CACC3\ACCaC GCCATCGTGC 1260 

AGACGCTGGT CCACTTCATC ARCCCGGAAR CGGTGCCCAA GCCXJTGCTGT GOKCCaCXK 1320 

C CATCTCCGTC CTCTACTTOG ATGACAGCTC CAACGTCATC CTGAAGfiAAT 1380 

G GCCTGTGGCr GCCACTAGCT CCTCOSAGAA TTCAGACCCT 1440 

TTGGGGCCAA GTTTTTCTGG ATCCTCCATT GCTCGCCTTG GCCAGGAACC AGCAGACCAA 1500 

CTGCCTTTTG TGAGACCTTC CCCTCCCTAT dtCCAACTTT AAAGGTGTGA GACfTATTAGG 1560 

: AGCATATGGC TTTTGATCAG TTTTTCRGTG GCAGCATCCA ATGAACRAGA 1620 

\ AAACCTAGCA GGAAAABAAA ACAACGCATA AAGAARAATG 1680 

GCCGGGCCAG GTCATTGGCT GGGAAGTCTC AGCCATGCaC GGACTCGTIT CCAGAGGTAA 1740 

CTACCAGCCA GGCCACCCAG CCGTGGGSGG AfiGGGGGOST GGCAAGGGGT 1800 

C CTCTCTCTGC GAAAGGAAAA TTGACCOQGA AGTTCCTGTA ATAAATGTCA 1860 
3 AATGAATG 

21 31 41 51 

P LFLLRSALAD FSLDNEVHSS FIHRRLRSQE RREMQREILS 60 
ILGLPHRPRP HLQGKHNSAP MFMLDLYNAM AVEBGGGPGG QGFSYPY 
LQDSHFLTDA DMVMSFVNLV BHDKEFFHPR YHHREFRFDL S 
IRERFDNETF RISVYQVLQE HLGRBSDLFL LDSRTLWASE EGWLVFDITA T 
HNLGLQLSVE TLDGQSINPK LAGLIGRHGP QNKQPFMVAF FKATEVHFRS IRSTGSKQRS 
QNRSKTPKNQ EALRHANVAE NSSSDQRQAC KKHELWSPR DLGWQDWIIA PEGYAAYYCE 
GECAFPLNSY MNATNHAIVQ TLVHFINPET VPKPCCAPTQ LHAISVLYFD DSSNVILKKY 
RNMWRACGC H 

Seq ID NO: 59 MIA sequence 
Nucleic Acid Accession «: NM_002821 
Coding sequence: 150.. 3362 

1 11 21 31 41 51 

I I I I I I 

AACTCCCGCC TCGGGACGCC TCGGGGTCGG GCTCCX3GCTG CGGCTGCTGC I— 
GCGCTCCGGT GCGTCCGCCT CCTGTGCCCG CCXSCGGAGCa GTCTGCGGCC C 
CCTCaGCTCC TTTTCCTGAG CCCGCCGCXa TGGGAGCTGC GCX3GGGATCC CCS3GCCAGAC 
CCCGCCGGTT GCCTCTGCTC AGaSTCCIGC TGCTGCKGCT GCTGGGCJGGT ACCCAGACAG 
CCATTGTCTT CSTCAfiGCAG CCGTCCTCCC AGGATGC&CT GCaGGGGCGC CX3GGCX3CTGC 
TTCGCTGTGA GGTTGRGGCT CCGGGCCOGG TACATGTGTA CTGGCTGCTC. GATGGGGCCC 
CTGTCCAGGA CAOGGAGCGG OGTTTOKCC AGGGCAGCAG CCTGAGCTTT GCAGCTGTGG 
ACCGGCTGCA GGACTCTGGC ACCTTCCAGT GTGTGGCTCX3 GGATGATGTC ACTOSAGAAG 
3 TGCCAACGCC TCCTTC3\ACA TCAAATGGAT TGAGGOiGGT CCTGIGGTCC 
C AGCCTCGGAA GCTGAGATCC AGCCACAGAC CCAGGTCACA CTTCS3TTGCX: 
ACATTGATGG GCACCCTCGG CCCACCTACC AATGGTTCCX3 AGATGGGACC CCCCTTTCTG 
3 CAACCACACA GTCAGCAGCA AGGAGOGGAA C 



CCCAGGACGT GGTAGTAGCG AGGTATGAGG AGGCCATGTT CCATTGCCAG TTCTCAGCCK 
AGCCACCCCC GAGCCTGCRG TGGCTCTTTG AGGRTGAGAC TCCCATCACT AACOGCAQIC 
GCCCCCCACA CCTCCGCAGA GCCACAGTGT TTGCCAACGG GTCTCTGCTG CTGACCCAGG 
TCCGGCCACG CAATGCAGGG ATCTACCGCT GCATTGGCCA GGGGCAGAGG GGCCCACCCA 
TCATCCTGGA AGCCACACTT CACCTAGCAG AGATTGAAGA CATGCCGCTA TTTGAGCCAC 
GGGTGTTTAC AGCTGGCAGC GAGGAGCGTG TGACCTGCCT TCCCCCCAAG GGTCTGCCAG 
r GTGGTGGGAG CACGCGGGAG TCCGGCTGCC CACCCATGGC AGGSTCTACC 



GCCACGCGGC CAACCTGGCT GGTCAGCGGA GACAGGATGT CAACATCACI GTGGCCACTG 1380 
TGCCKTCCTG GCTGAAGAAG CCCCAAGACA GCCAGCTGGA G3AGGGCAAA CCCGGCTACT 1440 
TGGATTGCCT GACCCAGGCC ACACCAAAAC CTACAGTTGT CIGGTACAGA AACCAGATGC 150 0 
TCATCTCAGA GGACTCACGG TTCGAGGTCT TCAAGAATGG GACCTTGCGC ATCAACAGCG 1560 
TGGAGGTGTA TGATGGGACA TGGTACCGTT GTATGAGCA.G CA.CCCCAGCC GGCAGCATCG 1620 
AGGCGCAAGC CCGTGTCCAA GIGCTGGAAA AGCTCAAGTT CACACCP " ' ' ' 

AGCAGTGCAT GGAGITTGAC AAGGAGGCCA CGGTGCCCTG TTCAGCCACA G 
AGCCCACTAT TAAGTGGGAA CGGGCAGATG GGAGCAGCCT CCCAGAGTGQ GTGACAGACA 
ACGCTGGGAC CCTGCATTTT GCCCGGGTGA CTCGAGATGA CGCTGGCAAC TACACTTGCA 
TTGCCTCCAA CGGGCCGCAG GGCCSiGATTC GTGCCCATGT CCAGCTCACT GTGGCAGTTT 
TTATCACCTT CAAAGTGGAA CCR.GMXGIA CGACTGTGTA CCAGGGCCAC ACAGCCCTAC 
TGCAGTGCGA GGCCCAGGGG GACCCCAAGC CGCTGATTCA GTGGAAAGGC AAGGACCGCA 

294 
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TCCTGGACCC CaCCAfiGCTG GGACCCAGGA TGCacaTCTT CC3«3AATGCIC TCCCTGGTGR 2100 

TCCATGACGT GGCCCCTGAiS GACTC&GGCC GCTACACCTG CAITGCAGGC AACRGCTGCA 2160 

ACATCAAGCA CftCGGAGGCC CCCCTCTRTG TCGTGGACAA GCCTGTGCCM GAGGAGTCGG 2220 

AGGGCCCTGG CAGCCCTCCC CCCTACAAGA TGATCCAGAC CATTGGGTTG TCGGIGGGTG 2280 

CCGCIGTGGC CTACATCATT GCCGTGCTGG GCCTCaTGTT CTACTGC3UVG AAGCGCTGCa 2340 

AAGCCAAGCG 6CTGCAGAAG CAGCCXX3AGG GCX3AGGAGCC AGRGATGGAA TGCCICAAOS 2400 

GAGGGCCTTT GCAGAACGGG CAGCCCTCAG CAGAGATCCa AGAflGAAGTG GCCTTGACCA 2460 

GCTTC3GGCTC CGGCCCCGCG GCCACCAACA AACGOCACAS CACAAGTGAT AAGAIGCACT 2S20 

TCCCACGGTC TAGCCTGCAG CCTATCACCA CGCTGGGGAA GAGTGAGTTT GGGGBGGTGT 2S80 

TCCTGGCaAA GGCTCRGGGC TTGGAGGAGG GSGTGGCftGA GACXICTGGTA CTTGIGAAGA 2640 

GCCTGCAGAC GAAGGATGAG CAGCAGCSiGC TGOACTTCaS GAGGGAGTTG GAGAIGTTTG 2700 

GGAAGCTGAA CCRCGCCAAC GTGGTGCX3GC ICCTQGGGCI GTGCCGGGAG GCTGfiGCCCC 2760 

ACTACATGGT GCTGGAATAT GTGGATCTGG GAGACCTCAA GCAGTTCCTG AGGATTTCCA 2820 

AGAGCAAGQA TGAAAAATTG AAGTCACAGC CCCTCAGCAC CRAGCAGRAG GTGOICCTAT 2880 

GCACCCAGGT AGCCCTGGGC ATGGAGCACC TGTCCAACAA COGCTTTGTG CATAAGQACT 2940 

TGGCTGCGCG TAACTGCCTG GTCAGTGCCC AGAGACAAGT GAAGGTGTCT GCCCTOGGCC 3000 

TCAGCAAGGA TGTGTACAAC AGTGAGTACT ACCACTTCCG CCAGGCCTGG GTGCCGCTGC 3060 

GCTGGATGTC CCCCGAGGCC ATCCTGGAGG GTGACTTCTC TACCAAGTCT GATGTCTGGG 3120 

CCTTCGGTGT GCTGATGTGG GAAGTGTTTA CACATGGAGA GATGCCCCAT GGTQGGCAGG 3180 

CAGATGATGA AGTACTGGCA GATTTGCAGG CTGGGAAGGC TAGACTTCCT CAGCCCGAGG 3240 

GCTGCCCTTC CAAACTCTAT CGGCTGATGC AGCGCTGCTG G3CCCTCAGC CCCAAGGACC 330 0 

GGCCCTCCTT CAGTGAGATT GCCAGCGCCC TGGGAGACAG CACCGTGGAC AGCAAGCCGT 33 6 0 

GAGQAGGGAG CCCGCTCAGG ATGGCCTGGG CAGGGGAGGA CATCTCTAGA GGGAAGCTCA 3420 

CAGCATGATG GGCRAGATCC CTGTCCTCCT GGGCCCTGAG GTGCCCTAGT GCAACAGGCA 3480 

TTGCTGAGGT CTGAGCAGGG CCTGGCCTTT CCTCCTCTTC CTCACCCTCA TCCTTTGGGA 3540 

GGCTGACTTG GACCCAAACT GGGCGACTAG GiSCTTTGAGC TGGGCAGTTT CCCCTGCCAC 360 0 

CTCTTCCTCT ATCAGGGACA GTGTGGGTGC CACAGGTAAC CCCAATTTCT GGCCTTCAAC 3660 

TTCTCCCCTT GACCGGGTCC AACTCTGCCA CTCATCTGCC AACTTTGCCT GGGGAGGGCT 3720 

AGGCTTGGGA TGAGCrGGGT TTGTGGGGAG TTCCTTAATA TTCTCRAGTT CTGGGCACAC 37 80 

AGGGTTAATG AGTCTCTTGC CCACTGGTCC ACTTGGGGGT CTAGACCAGG ATTATAGAGG 3840 

ACACAGCAAG TGAGTCCTCC CCACTCTGGG CTTGTGCACA CTGACCCAGA CCCAOGTCTT 3900 

CCCCACCCTT CTCTCCTTTC CTCATCCTAA GTGCCTGGCA GATGAAGGAG TTTTCAGGAG 3960 

CTTTTGACAC TATATAAACC GCCCTTTTTG TATGCACCAC GGGOGGCTTT TATATGTAAT 4020 

TGCAGOGTGG GGTGGGTGGG CATGGGAGGI AGQGGTGGGC CCTGGRGATG AGGAGGGTGG 4080 

GC»TCX:TTA CCCCACACIT TTATTGTTGT CQTTTTTTGT TTGTTTTGTT TTITTGTTTT 4140 
TGTTTTTCTT TTTACACTCG CTGCTCTCAR TAAATAAGCC TTTTTTA 



1 11 21 31 41 51 
I I I I I I 
MGAARGSPAR PRRtPLLSVL IiLPLLGGTOI AIVFIKQPSS QDALQGRRAL LRCEVBAPGP 
VHVYWLLDGA PVQDTERHFA QGSSLSFAAV DRLQDSGTFQ CVAHDDVTGE EARSANASFM 
IKWIEAGPW LKHPASEABI QPQTQVTLRC HIDGHPRPTY QWFRDGTPLS DGQSNHTVSS 
KERNLTLEPA GPEHSGLYSC CAHSAFGQAC SSQNFTLSIA DBSFARWLA PQDWVRRYB 
EAMFHCQFSA QPPPSLQWLF EDETPITNRS RPPHLRRATV FANGSLLLTQ VRPBNAGIYR 
CIGQGQRGPP irCSaVTLHLA EIBDMPLFBP RVFTAGSBBR VTCLPPKGLP EPSVWWEHAfi 
VRLPTHGRVY QKGHELVLRN lAESDAGVYT CHRRNLAGQR RQDVNITVAT VPSWLKKPQD 
SQLEEGKPGY LDCLTQATPK PTWHYENQM LISEDSRFEV FKNGILRINS VEVYDGTWYR 
CMSSTPAGSI EAQARVQVLE KLKPTPPPQP QQCMEPDKEA TVPCSATGRE KPTIKWERAD 
GSSLPBWVTD NAGTLHFAEV TRDDAGNYTC lASNGPQGQI EAHVQLTVAV FITFKVBPBR 
TTVYQGHTAL LQCEAQGDPK PLIQWKGKDR ILDPTKLGPR MHIFQNGSLV IHDVAPEDSG 
RYTCIAGNSC NIKHTBAPLY WDKPVPBES EGPGSPPPYK MIQTIGLSVG AAVAYIIAVL 
GLMFYCKKRC KAKELQKQPB GEEPEMECLN GGPLQN8QPS AEIQEEVALT SLGSGPAATN 
KRHSTSDKMH PPRSSLQPIT TLGKSEFGEV FLRKAQGLEE GVAETLVLVK SLQTKDEQQQ 
LDFRRBLEMF GKLNHANWR LLGLCREAEP HYMVLEYVDL GDLKQFLRIS KSKDEKLKSQ 
PLSTKQKVAL CTQVALGMEH LSNNRFVHKD liAARNCLVSA QRQVKVSALG LSKDVYNSEY 
YHPRQAWVPL RWMSPEAILE GDFSTKSDVW AFGVLMWEVP THGEMPHGGQ ADDEVLADIiQ 1020 
AGKARLPQPE GCPSKLYRLM QRCWALSPKD RPSFSEIASA L 



I I I I I I 

CACCTGCACC CCGCCCGGGC ATAGCACCAT GCCTGCTTGT CGCCTAGGCC CGCTAGCCGC 
CGCCCTCCTC CTCAGCCTGC TGCTGTTCGG CTTCACCCTA GTCTCAGGCA CAGGAGCAGA 
GAAGACTGGC GTGTGCCCCG AGCTCCAGGC TGACCAGAAC TGCACGCAAG AGTGCGTCTC 
GGACAGCGAA TGCGCCGACA ACCTCAAGTG CTGCaGCGOG GGCTGTGCCA CCTTCTGCCT 
TCTCTGCCCA AATGATAAGG AGGGTTCCTG CCCCCAGGTG AACATTAACT TTCCCCAGCT 
CGGCCTCTGT CGGGACCAGT GCCAGGIGGA CAGCCAGTGT CCTGGCCAGA TGAAATGCTG 
CCGC3UiTGGC TGTGGQAAGG TGTCCTGTGT CACTCCCAAT TTCTGAGGTC CAGCXACCAC 
CAGGCTGAGC AGTGAGGAGA GAAAGTTTCT GCCTGGCCCT GCATCTGGTT CCAGCCCACC 
TGCCCTCCCC TTTTTCGGGA CTCTGTATTC CCTCTTGGGC TGACCACAGC TTCTCCCTTT 
CCCAAOCSWVT AAAGTAACCA CTTTCAGCaA AAAAAAAAAA AAAA 



MPACRLGPIA AALLLSLLLF GFTLVSGTGA BKTGVCPBLQ ADQHCTQECV SDSBCSDtlLK 
CCSAGCATFC ILCPNDKEGS CPQVNINFPQ LGLCRDQOJV DSQCPGQMKC CRNGCGKVSC 
VTPNF 
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CGCGGGGCGC GGAGTCGGCG GGGCCTCGCG GGACGCGGGC AGTGCaSAGA CCXSCGGCGCT 60 

GAGGACGOSG GAGCCGGGAG CGCaCGOSCG GGGTGC3AGTT CAGCCTftCTC TTTCTTAfiRT 120 

GTGARAGGAA AGGAAGATCa TTTCATGCCT TGTTGATAAA GGITCAGSCT TCTGCTGATT 180 

CATAACCRTT TGGCTCTGAG CTATGAOVRG ftGAOSAAACa AAAAGTTAAA CTTACAAGCC 240 

TGCCATAAGT GAGAAGCAAA CTTCCTTGAT AACATCSCTTT TGCGAAGTGC AGOAAAATTA 300 

AATGTGGGCA CCAAGRRAGA GGATGGTGAG fiGTACAGCTC CCACCCCCW5 TCCAAAGGTC 360 

TTGOGTTGTA AATGCCACCA CCATTGTCCa GAAGACTCRG TCSUICAATAT TTGCAGCACR 420 

GACGGATATT GTTTCACGAT GATAGAAGAG GATGACTCTG GGTTGCCTGT GGTCACTTCT 480 

GGTTGCX:TAG GACTAGAAGG CrCaGATTTT CAGTGTCGGG ACSCTCCCAT TCCTCATCSiA 540 

AGAAGATCAA TTGAATGCTG CAC3M3AAAGG AACX3AATCTA ATAAAGACCT ACACCCTACA 600 

CTGCCTCCAT TGAAAAACAG AGATTTTGTT GATGGACCTA TACACCaCAG GGCTTTACTT 660 

ATATCTGTGA CTGTCTGTAG TTTGCTCTTG GTCCTTATCA TATTATTTTG TTACTTCCGG 720 

TATAAAAGAC AAGAAACCAG ACCTCX3ATAC AGCATTGGGT TAGAAa«3GA TGAAACTTAC 780 

ATTCCTCCTG GAGAATCCCT GAGAGACTTA ATTGAGCAGT CTCAGAGCTC AGGAAGTGGA 840 

TCAGGCCTCC CTCTGCTGGT CCAAAGGACT ATAGCTAAGC AGATTCaGAT GQTGAAACaG SOO 

ATTGGAAAAG GTCGCTATGG GGAAGTTTGG ATGGGAAAGT GGCGTGGOGA AAAGGTAGCT 960 

GTGRAAGTGT TCTTCRCCAC AGAGGAAGCC AGCTGGTTCA GAGAGACRGA AATATATCRG 1020 

ACAGTGTTGA TGAGGCATGA AAACATTTTG GGTTTCATTG CTGCAGATAT CAAAGGGACA 1080 

GGGTCCTGGA CCCAGTTGTA CCTAATCACA GACTATCATG AAAATGGTTC CCTTTATGAT 1140 

T CCACCACCCT AGACGCTAAA TCAATGCTGA AGTTAGCCTA CTCTTCTGTC 1200 



G GTGAAGAAAA ATGGAACTTG CTGTATTGCT 1320 
GACCTGGGCC TGGCTGTTAA ATTTATTAGT GATACAAATG AAGTTGACAT ACCACCTAAC 1380 
3 GCACCAAACG CTATATGCCT CCAGAAGTGT TGGACGAGAG CTTGAACAGA 1440 



ATGACAGAAT GCTGGGCTCA CAATCCIGCA TCAAGGCTGA C 

ACACTTGCCA AAATGTCAGA GTCCCAGGAC ATTAAACTCT GATAGGAGAG GAAAAGTAAG 1800 

CATCTCTGCA QAAAQCCAAC AGGTACTCTT CTGTTTGTGG GCAGAGCAAA AGACATCAAA 1860 

TARGCATCCA CAGTACAAGC CTTGAACATC GTCKTGCTTC CEAGTGGGTT CAGACCTCAC 1920 

CTTTCAGGGA GCGACETGGG CAAAGACAGA GAAGCTCCXA GAAGGAGAGA TTGATCCGIG 1980 
TCTGTTTGTA GGCGGAGAAA CCGTTGGGTA ACTTGTTCAA GATATGATGC AT 
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DSGLPWTSG CLGLBGSDPQ CRDTPIPHQR RSIECCTBHN EC 
GPIHHRALLI SVTVCSLLLV LIILFCYFRY KRQBIRPRYS IGLBQDBTYI PPGBSIiRDLI 
EQSQSSGSGS GLPLLVQRTI AKQIQMVKQI GKGRYGEVWM GKWRGEKVAV KVFFTTEEAS 
WFRETEIYQT VLHRHENILG FIAADIKGTG SWTQLYLITD YHGHGSLYDY LKSTTLDAKS 
MLKLAYSSVS GLCHUTTEIF STQGKPAIAH RDLKSKNILV KKNGTCCIAD LGIAVKFISD 
TNEVDIPPNT RVGTKRYMPP EVLDESLNRN HFQSYIMADM YSFGLILHBV ARRCVSGGIV 
EEYQLPYHDL VPSDPSYEDH SEIVCIKKLR PSFPNRWSSD ECLRQMGKLH TECWAHHPAS 
RLTALRVKKT LAKMSESQDI Klj 



1 CAACGGAGRG ACTCAAGATG ATTCCCTTTT TACCCATGTT TTCTCTACTA 
TATAAACGCC AACAATCATT AIGACAAGAT CTTGGCTCAT 
AGTCGTATCA GGGQICGGGA CCAAGGCCCA AATGTCTGTG CCCTTCAACA GATTTTGGGC 
AAAAGTCCAT CTGTGGACAG 



TGCCCAGCAG TTTTGCCCAT TGACCATGTT TATGGCACTC T 

A CTGAGGGAGG AGATCGAGGG AAAGGGATCC 
r TTGCACCGAG TAATGAGGCT TGGGACS 
GGTTTGGAGA GCAACGTGAA TGTTGAATTA CTGAATGCTT T 

h TTCCTTCAAT GTATAACAAT 

I GTTGTCCATG TCATTGACCG TGIGCTTACA 
CAAATTGGTA CCTCAATTCA AGACTTCATT GAAGCAGAAG ATGACCTTTC ATCTTTTAGA 
GCAGCTGCCA TCRCATCGGA CATATTGGAG GCCCTTGGAA GAGACGGTCA CTICACACTC 
TTTGCTCCCA CCAATGAGGC TTTTGAGAAA CTTCCACGAG GTGTCCTAGA AAGGTTCATG 
A TCTTAAATAC TCTCCAGTGT 

A TTATGGGAGG AGCAGTCTTT GAGACGC 
GGATGTGACG GTGACAGTAT AACAGTAAAT GGAATCAAAA T' 

T GATCAGGTCC TAATTCCTGA TTCTGCCAAA 1140 
G AGCIGGCTGG AAAACAGCAA ACCACCTTCA CGGATCTTGI GGCCCAATTA 1200 
GGCTTGGCAT CTGCTCTGAG GCCAGATGGA GAATACAOTT TGCTGGCACX: TGIGAATAAT 1260 
GCATTTTCTG ATGRTACTCT CRGCSTGGTT CAGOKCTCC TTAAATTAAT TCIGCAGAAT 1320 
CRCATATTGA AAGTAAAAGT TGGCCTTAAT GAGCTTTACA AOGGGCAART ACTGGAAACC 1380 
ATCGGAGGCA AACaGCTCAG AGTCTTCGTA TATCGTACAG OTGTCTGCAT TGAAAATTCA 1440 
TGCATGGAGA AAGGGAGTAA GCAAGGGAGA AACGGTGCGA TTCACATAIT CCGCGAGATC 1500 
ATCAAGCCAG CAGAGAAATC CCTCCATGAA AAGTTAAAAC AAGATAAGQG CTTTAGCACC 1560 
TTCCTCAGCC TACTTGAAGC TGCAGACTTG AAAGAGCTCC TGACACAACC TGGAGACTGG 1620 
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ACATTATTTG TGCCAACCAR TGATGCTTTT AflGGGAATGR CTAGTGAAGA MAAGAAATT 1680 

CTGRTACGGG ACAAAAATGC TCTTCAAAAC ATCATTCTTT ATCACCTGAC ACC3M3GAGTT 1740 

TTC3VTTGGAA AAGGATTTGA ACCTGGTGTT ACTAACATTT TAAAGACCAC ACAAGGAAGC 1800 

AAAATCTTTC TGAAAGAAGT AAATGATACA CTTCTGGTGA ATGAATTGAA ATCKAAAGAA 1860 

TCTGACATCA TGACAAC3U«k TGGTGTAATT CATGTTGTAG ATAAACTCCT CTATCCAGCA 1920 

GACRCACCTG TTGGAAATGA TCAACTGCTG GAAATACTTA ATAAATTAAT (31AATACATC 1980 

CAAATTAAGT TTGTTCGTGG TAGCACCTTC AAAGAAATCC CCGTGACTGT CTATACAACT 2040 

AAAATTATAA CCAAAGTTCT GGARCCSflAA ATTAAAGTGA TTGAAGGCAG TCTTCAGCCT 2100 

ATTATCAflflA CTGAAGGACX: CACACTARCA AAAGTCAAAA TTGAAGGTGA ACCTGAATTC 2160 

AGACTGATTA AAGAAGGTGA AACAATAACT GAAGTGATCC ATGGAGAGCC AATTATTAAA 2220 

AAATACACCA AAATCATTGA TGGAGTGCCT GTGGAAATAA CTGAAAAAGA GACACGAGAA 2280 

GAACGAATCA TTACAGGTCC TGAAATAAAA TACACTAGGA TTTCTACTGG AGGTGGAGAA 2340 

ACAGAAGAAA CTCTGAAGAA ATTGTTACAA GAAGAGGTCA CCAAGGTCAC CAAATTCATT 2400 

GAAGGTGGTG ATGGTCATTT ATTTGAAGAT GAAGAAATTA AAAGACTGCT TCAGGGAGAC 2460 
ACACCCGTGA GGAAGTTGCA AGCCAACAAA AAAGTTCAAG GTTCTAGAAG A 
GAAGGTCGTT CTCAGTGAAA ATCCAAAAAC CAGAAAAAAA TGTTTAl 
AATAACCTGA CCTTAGAAAA TTGTGAGAGC CAAGTTGACT TCAGGAACTG A 
ACSyy^GAAGC AATCATCAAA TAATTCTGAA CACAAATTTA ATATTTTTTT T 
GAAACATGAG GGAAATTGTG GAGTTAGCCT CCTGTGGTAA AGGAATTGAA G 
CACCTTACAC CCTTTTTCAT CTTGACATTA AAAGTTCTGG CTAACTTTGG A 
AGAAAAATCC TTGTCACCAG ATTCATTACA ATTCAAATCG AAGAGTTGTG A 

CCATTGAAAA GACCGAGCCT TGTATGTATG TTATGGATAC ATAAAATGCA CGCAAGCCAT 2940 

TATCTCTCCA TGGGAAGCTA AGTTATAAAA ATAGGTGCIT GGTGTACAAA ACTTTTTATA 3000 

TCAAAAGGCT TTGCACATTT CTATATGAGT GGGTTTACTG GTAAATTATG TTATTTTTTA 3060 

CAACTAATTT TGTACTCTCA GAATGTTTGT CATATGCTTC TTGCSUlTGCai . TATTTTTTAA 3120 

TCTCAAACGT TTCAATAAAA CCATTTTTCR GAtATAAAGA GAATTACTTC AAATTGAGTA 3180 
ATTCAGAAAA ACTCAAGATT TAAGTTAAAA AGTGGTTTGG ACTTGGGAA 

Seq ID NO! 66 protein sequence 
Protein Accession ft; NP 006466.1 

1 11 21 31 41 51 
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MIPFLPMPSL LLLLIVHPIN ANNHYDKILA HSRIEGRDQG PNVCALQQIL GTKKKTFSTC 60 

KNWYKKSICG QKITVLYECC PGYMRMEGMK GCPAVLPIDH VYGTLGIVGA TTTQRYSDAS 120 

KIiREEIEGKG SFTYFAPSNE AHDNIiDSDIR RGLESNVNVE LIJIALHSHMI NKRMLTKDLK 180 

HGMIIPSMTO NliGLFINHYP NGWTVNCAR IIHGHQIATN GWHVIDRVL TQIGTSIQDF 240 

lEAEDULSSP RAAAITSDIL EALGRDGHPT LFAPTNEAFE KIjPRGVLERP MGDKVASEAL 300 

MKYHILNTLQ CSESIMGGAV FETLBGHTIE IGCDGDSITV NGIKMVNKKD IVTNNGVIHL 360 

IDQVLIPDSA KQVIELAGKQ QTTFTDLVAQ LGLASALRPD GEYTLLAPVN MAFSDDTLSM 420 

VQRLLKLILQ MHILKVKVGL NELYNGQILE TIGGKQLRVP VYHTAVCIEH SCMEKGSKQG 480 

SNGAIHIFRE IIKPAEKSLH EKLKQDKRFS TPLSLLEAAD LKBIiTQPGD WTLPVPTMDA 540 

FKGMTSEEKE ILIRDKNALQ NIILYHLTPG VFIGKGFEPG VTNILKTTQG SKIFLKBVND 600 

TLLVHELKSK ESDIMTTNGV IHWDKLLYP ADTPVGNDQL LEILNKLIKY IQIKFVRGST 660 

FKEIPVTVYT TKIITKWEP KIKVIEGSLQ PIIKTEGPTL TKVKIEGEPE PHLIKEGETI 720 

TEVIHGEPII KKYIKIIDGV PVEITEKETR EBRIITGPEI KYTRISTGGG BIEETLKKIiL 780 
QBEVTKVTKP lEGGDGHLFE DEEIKRLLQG DTPVRKIflAN KKVQGSHHEL HEGRSQ 



ATGCCTGGGG GGTGCTCCCG GGGCCCCCCC GCCGGGGACG GGOSTCTGCG GCTGGCGCGA 
CTAGCGCTGG TACTCCTGGG CTGGGTCTCC TCGTCTTCTC CCACCTCCTC GGCATCCTCC 
TTCTCCTCCT CGGCGCCGTT CCTGGCTTCC GCCGTGTCCG CCCAGCCCCC GCTGCCGGAC 
CAGTGCCCCG CGCTGTGCGA GTGCTCCGAG GCAGCGCGCA CAGTCAAGTG CGTTAACCGC 
AATCIGACCG AGGTGCCCAC GGACCTGCCC GCCTACGTGC GCfiACCTCTT CCTTACCGGC 
AACCAGCTGG CCAGCAACCA CTTCCTTTAC CTGCCGCGGG ATGTGCTGGC CCAACTGCCC 
AGCCTCAGGC ACCTGGACTT AAGTAATAAT TCGCTGGTGA GCCTGACCTA CGTGTCCTTC 
CGCAACCTGA CACATCTAGA AAGCCTCCAC CTGGAGGACA ATGCCCTCAA CK3TCCTTCAC 
AATGGCACCC TGGCTGAGTT GCAAGGTCTA CCCCACATTA GGGTTTTCCT GGACAACAAT 
CCCTGGGTCT GCGACTGCCA CATGGCAGAC ATGGTGACCT GGCICAAGGA AACAGAGGIA 
GTGCAGGGCA AAGACCGGCT CACCTGTGCA TATCCGGAAA AAATGAGGAA TCGGGTCCTC 
TTGGAACTCA ACAGTGCTGA CCTGGACTGT GACCCGATTC TTCCCCCATC CCTGCAAACC 
TCTTATGTCT TCCTGGGTAT TGTTTTAGCC CTGATAGGCG CTATTTTCCT CCTGGTTTTG 
TATTTGAACC GCAAGGGGAT AAAAAAGTGG ATGCATAACA TCAGAGATGC CTGCAGGGAT 
CACATGGAAG GGTATCATTA CAGATATGAA ATCAATGCGG ACCXKAGATT AACAAACCTC 
AGTTCTAACT CGGATGTCCT CGAGTGA 



Protein Accession #: EOS sequence 

1 11 21 31 41 51 
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MPGGCSRGPA AGDGRLRLAR LAI.VLI.GWVS SSSPTSSASS PSSSAPFLAS AVSAQPPLPD 
QCPALCECSE AaETVKCVNR NLTEVPTDLP AYVRNLFLTG NQIASNHFLY LPRDVLAQLP 
SLBHLDLSSM SLVSLTYVSF RNLTHLESLH LEDNALKVm NGTLAELQGL PHIRVFUHSIN 
PWVCDCHMAD MVTWLKETEV VQGKDRLTCA YPEKMKHRVL LELNSflDLDC DPILPPSLQT 
SYVFLGIVLA LIGAIFLLVL YIHRKGIKKW MHNIRDACRD HMEGYHYRYE INADPRLTNL 
SSNSDVLE 
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CAGCACCCAG CTCCCOGCCR CCGCCRTGGT CCCCGRCACC GCCTGCGTTC TTCTGCTCAC SO 

CCTGGCTGCC CICX3<3CX3CGT CCGGACRGGG CC&Ga«3CCCG TTGGGCTCAG ACCTGGGCCC 120 

GCAGRTGCTT CGGGAACTGC AGGAAACCAA OGCGGCGCTG CAGGACGTGC GGGRCTGGCT 180 

GOGGCAGCAG GTCAGGGAGR TCaCGTTCCT GAAAAACACG GTGATGGAGT GTGRCGCGTG 240 

CGGGATGCAG CMTC3M3TAC GCRCCGGCCT ACCCAGCGIG CGGCCCCTGC TCCRCTGOX: 300 

GCCCGGCTTC TGCTTCCCTG GCGTGGCCTG C3^TCC»GACX3 GAGAGCGGCG GCCXSCTGOGG 360 

CCCCTGCCCC GCGGGCTTCa CGGGCAACGG CTCGCACTGC ACCGACGTCA ACGftGTGCAA 420 

OGCKCACCCC TGCTTCCCCC GAGTCCGCTG TATCAACACC AGCCCGGGGT TCOGCTGOSA 480 

GGCTTGCCCG CCX3GGGTAC» GCGGCCXCAC CCACCAGGGC GTGGGGCTGG CTTTCXSCCflA 540 

GGCJCAACAAG CAGGTTTGCR CGGACATCAA CGAGTGTGAG ACCGGGCAAC ATAACTGCGT 600 

CCCCAACTCC GTGTGCATC» ACACCCGGGG CTCCTTCCAG TGCGGCCCGT GCCAGCCCGG 66 0 

CTTCX3TGGGC GACCAGGCGT CCGGCTGCCR GOSCGGOSCA CRGOGCTTCT GCCCXX»CGG 720 

CTOGCCCRGC GAGTGCCACG AGCATGCAGA CTGOGTCCTA GRGCGCX»TG GCTCX3CGGTC 780 

GTGCGTGTGT CGCGTTGGCT GGGCCGGCAA CGGGATCCTC TGTGGTCX3CG ACaCTGACCT 840 

AGftCGGCTTC CCGGROGAGA AGCTGCGCTG CCCGGAGCCG CRGTGCCGTA AGGflCa^CTG 900 

CX3TGACTGTG CCCAACTCAG GGCAGGAGGA TGTGGACCGC GATGGCATCG GAGACGCCTG 960 

OGATCCGGAT GCCGACGGGG ACGGGGTCCC CAATGAAAAG GACAACTGCC CGCTGGTGCG 1020 

GAACCCAGAC CAGCGCAACA CGGACGAGGA CAAGTGGGGC GATGOSTGCG ACAACTGCCG 1080 

GTCCCAGAAG AACGACGACC AAAAGGACAC AGACCAGGAC GGCCGGGGCG ATGOJTGCGA 1140 

CGACGACATC GACGGCGACC GGATCCGCAA CCAGGCCGAC AACTGCCCTA GGGTACOCAA 1200 

CTCAGACCAG AAGGACAGTG ATGGCGATGG TATAGGGGAT GCCTGTGACA ACTGTCCCCA 12 60 
GAAGAGCAAC CCGGATCAGG CGGATGTGGA CCRCGACTTT GTGGGAG 
CGATCAAGAC CAGGATGGAG ACGGACATCA GGACTCTCGG GACAACTGTC C 
TAACAGTGCC CAGGAGGACT CAGACCACGA TGGCCAGGGT GATGCCT 
CGACAATGAC GGAGTCCCTG ACAGTCGGGA CAACTGCCGC CTGGTGCCTA A 

GGAGGACGCG GACAGGGACG GCGTGGGCGA CGTGTGCCAG GACGACTTTG ATGCAGACAA 15 SO 

GGTGGTAGAC AAGATCGACG TGTGTCCGGA GAACGCTGAA GTCACGCTCA CCGACTTCAG 1620 

GGCCTTCCAG ACAGTCGTGC TGGACCCGGA GGGTGACGCG CAGATTGACC CCAACTGGGT 16SC 
GGTGCTCAAC CAGGGAAGGG AGATCGTGCA GACAATGAAC AGCGACCCAG G 
GGGTTACACT GCCITCAATG GCGTGGACTT CGAGGGCACG TTCCATC"*" ' 
GGATGACGAC TATGCGGGCT TCATCTTTGG CTACCAGGAC AGCTCCAGCT T 
CATGTGGAAG CAGATGGAGC AAACGTATTG GCAGGCGAAC CCCTTCCGTG C 

GCCTGGCATC CAACTCAAGG CTGTGAAGTC TTCCACAGGC CCCGGGGAAC AGCTGCGGAA 19 BO 

CGCTCTGTGG CATACAGGAG ACAC3M3AGTC CCAGGTGCGG CTGCTGTGGA AGGACOOSCG 2040 

AAflCGTGGGT TGGAAGGACA AGAAGTCCTA TCGTTGGTTC CTGCAGCACC GGCCCCRAGT 2100 

GGGCTACATC AGGGTGOSAT TCTATGAGGG COCTGAGCTG GTGGCCGACA GCAACGTGGT 2160 

CTTGGACACA ACCATGOGGG GIGGCCGCCT GGGGGTCTTC TGCTTCTCCC AGGAGAACAT 2220 

CRTCTGGGCC AACCTGCGIT ACCGCTGCM TGACACCATC CKAGfiGGACT ATGAGACCCA 2280 

TCAGCTGOGG CAAGCXTAGG GACXa«3GGTG AGGACCCGCC GGATGACAGC CACCCTCACC 2340 

GCGGCTGGAT GGGGGCTCTG CACCCAGCCC AAGGGGTGGC CBTCCTGAGG GGGAAGIGAG 2400 
AAGGGCTCAG AGAGGACAAA ATAAAOTGTG TGTGCAGGG 



45 

I I I I I I 

MVPDTACVLL LTLAALGASG QGQSPLGSDL GPQMLRBLQE TNAALQDVHD WLRQQVHEIT 
FLKHTVMECD AOGMQQSVRT GLPSVRPLLH CAPGFCFPGV ACIQTESGGR CGPCPAGFTG 
HGSHCTDVWE C3JAHPCFPRV RCINTSPGFR CEACPPGYSG PTHQGVGLAF AKAHKQVCTD 
50 INECETGQHN CVPNSVCINT RGSFQCGPCQ PGPVGDQASG CQRGAQRFCP DGSPSECHEH 
ADCVLERDGS RSCVCRVGWA GNGILCGRDT DLDGFPDEKL RCPEPQCRKD NCVTVPNSGQ 
EDVDRDGIGD ACDPDADGDG VPNEKDNCPL VRNPDQRNTD EDKWGDACDN CRSQKNDDQK 
DTDQDGRGDA CDDDIDGDRI RNQADNCPRV PNSDQKDSDG DGIGDACDNC PQKSNPDQAD 
VDHDFVGDAC DSDQDQDGDG HQDSRDNCPT VPKSAQEDSD HDGQGDACDD DDDNDGVPDS 
55 RDNCRLVPNP GQEDADREGV GDVCQDDFDA DKWDKIDVC PENAEVTLTD FRAFQTWLD 
PEGDAQIDPN WWLNQGREI VQTMNSDPGL AVGYTRFNGV DF3GTFHVNT VTDDDYAGFI 
FGYQDSSSFY WMWKQMBQT YWQANPFRAV AEPGIQLKAV KSSTGPGEQL RNALWHTGDT 
ESQVRLLWKE PRNVGWKDKK SYRWFLQHRP QVGYIRVRFY E 
RLGVFCPSQE NllWANLRYR CNDTIPEDYE THQLRQA 



Seq ID NO: 71 DNA sequen 
Nucleic Acid Accession # 



65 I r I I 1 

GAGTCACCAA GGAAGGCAGC GGCAGCTCCA CTCAGCCAGT ACCCAGATAC G 

TTCCCCAGCC ATGGCXTCCC TGGGGCAGAT CCTCTTCTGG AGCATAATTA GCATCATCAT 120 

TATTCTGGCT GGAGCAATTG CACTCATCAT TGGCTTTGGT ATTTCftGGGA GACftCTCCAT 180 

CACAGTCACT ACTGTOGCCT CAGCTGGGAA CATTGGGGAG GATGGAATCC TGAGCTGCAC 24 0 

70 TTTTGAACCT GRCATCAAAC TTTCTGATAT CGTGATACAA TGGCTGAAGG AAGGTGTTIT 30 0 

AQGCTTGGTC CRTGASTTCA AAGAAGGCAA AGATGAGCTG TOGGAGCAGG ATGRARTGTT 3 60 

CAGAGGCCGG ACAGCSiGTGT TTGCTGATCA AGTGATAGTT GGCAATGCCT CTTTGCGGCT 420 

GAAAAACGTG CAACTCACAG ATGCTGGCAC CTACAAATGT TATATCATCA CTTCTAAAGG 480 

CAAGGGGAAT GCTAACCTTG AGTATAAAAC TGGAGCCTTC AGCATGCCGG AAGTGAATGT 540 

75 GGACTATAAT GCCAGCICAG AGRCCTTGCG GTOTGAGGCT CCCXX3ATGGT TCCCXKAGCC 600 

CACAGTGGTC TGGGCATCCC AAGTTGACCA GGGAGCCAAC TTCTCGGAAG TCTCCAATAC 660 

CAGCTTTGAG CTGAACTCTG AGAATGTGAC CATGAAGGTT GTCTCTGTGC TCTACAATGT 720 

TAOSATCAAC AACACATACT CCIGTATQAT TGAAAATGAC ATTGCCAAAG CAACAGGGGA 780 

TATCAAAGTG ACAGAATCGG AGATCAAAAG GCGGAGTCRC CIACAGCTGC TAAACTCBAA 840 

80 GGCTTCTCTG TGTGTCTCTT CTTTCTTTGC CATCAGCTGG GCACTTCTGC CTCTCAGCXX: 900 

TTACCTGATG CTAAAATAAT GTGCCTCGGC CACRAAAASG CATGCSAAGT CATTGTTACA 960 

ACAGGGATCT ACAGAAOTAT TTCACCACCA GATATGACCT AGTTTTATAT TTCTGGGAGG 1020 

AAATGAATTC ATATCTAGAA GTCTGGAGTG AGCAAAOUU; AGCAAGAAAC AAAAAGAAGC 1080 

CAAAAGCAGA AGGCTCCAAT ATGAACAAGA TAAATCTATC TTCAAAGACA TATTAGAAGT 1140 

85 TGGGAAAATA ATTCATGTGA ACTAGAGTCA ACTGTGTCAG GGCTAAGAAA CCCTGGTTTT 1200 



298 
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GftGTAGAAAA GGGCCKSGftA AGRGGGGRGC CAACAAATCT GTCTGCTTCC TCACATTACT 1260 

CATTGGCAAA TAAGCSiTTCT GTCTCTTTGG CTGCTGCCrC AGCACAGAGA GCCRGiftACTC 1320 

TATCGGGCAC CAGGATAACA TCTCTCAGTG AACAGAGTTG ACAAGGCCTA TGGGAAATGC 1380 

CTGATGGGAT TATCTTCAGC TTGTTGAGCT TCTAAGTTTC TTTCCCTTCA TTCTACCCIG 1440 

CAAGCCAAGT TCTGTAAGAG AAATGCXTGA GTTCIAGCIC AGGTTTTCTT ACTCTGAATT 1500 

TAGATCTCXA GACCCTGCCT GGCCACAATT CAAATTAAGG CAACAAACAT ATACCTTCCA 1560 

TGAAGCACAC ACAGACTTTT GAAAGCAAGG ACAATGACTG CTTGAATTGA GGCCTTGAGG 1620 

AATGAAGCTT TGAAGGAARA GAATACTTTG TTTCCAGCCr CX^TCCCACA CTCITCATGT 1680 

GTTAACCaCT GCCTTCCTGG ACCTTGGRGC CAOGGTGACT GTATTACATG TTOTTATAGA 1740 

T TAGAGTTCTG ATCGTTCAAG AGAATGATTA AATATACATT TCCTAAAAAA 1800 



MASLGQILFW SIISIIIILA GAIALIIGFG ISGRHSITVT TVASAGNIGE DGILSCTFEP 
DIKLSDIVIQ WLKEGVLGLV HEFKEGKDEL SEQDEMFRGR TRVFADQVIV GNASLRLKNV 
QLTDAGTYKC YIITSKGKGN ANLEYKTGAF SMPEVNVDVN ASSETLRCEA PRWFPQPTW 
MASQVDQGAN FSEVSNTSFE LNSENVTMKV VSVLYNVTIN NTYSCMIEND lAKATGDIKV 
TESEIKRRSH LQLLNSKASL C 



GGGAGGGAGA GAGGCGCGCG GGTGAAAGGC GCATTGATGC AGCCTGCGGC GGCCTCGGAG 60 

CGCGGCGGAG CCAGACGCTG ACCACGTTCC TCTCCTCGGT CICCTCCGCC TCCAGCTCCG 120 

CGCTGCKCGG CAGCOGGGAG CXXTGCGACC CCAGGGCCCC GCXXKCTCCC CGCAGCGGCT 180 

CCXKSSGCCTC CTGCTGCTCC TGCTGCTGCA GCTGCCOXX; CCGTOGAGOS CCTCTGAGAT 240 

CCCXAAGGGG AAGCAAAAGG OKAGCrCCG GCBGAGGGAG GTGGTGGACC TGTATAATGG 300 

AATGTGCTTA CAAGGGCCAG CAGGAGTGCC TGGTCGAGAC GGGAGCKCTG GGGCX»ATGG 360 

CATTCCGGGT ACACCTGGGA TCECAGGTOS GGATGGATTC AAAGGAGAAA AGGGGGAATG 420 

TCTGAGGGAA AGCTTTGAGG AGTCCTGGAC ACCX»ACTAC AAGCAGTGTT CATGGAGITC 480 

ATTGAATTAT GGCATAGATC TTGGGAAAAI TGOGGAGTGT ACATTTACAA AGATGCCTTC 540 

AAATAGIGCT CTAAGAGTTT TGTTCAGTGG CTO^CTTCGG CTAAAATGCA GAAATGCATG 600 

CTGTCAGOGT TGGTATTTO^ CATTCAATGG AGCTGAATGT TCAGGACCTC TTCCCATTGA 660 

AGCTATAATT TATTTGGACC AAGGAAGCCC TGAAATGAAT TCSACAATTA ATATTCATCG 720 

CACTTCTTCT QTGGAAGGAC TTTGTGAAGG AATTGGTGCT GGATTAGTGG ATGTTGCTAT 780 

CTGGGTTGGC ACTTGTTCAG ATTACCCAAA AGGAGATGCT TCTACTGGAT GGAATTCAGT 840 

TTCTOGCATC ATTATTGAAG AACTACCAAA ATAAATGCTT TAATTTTCAT TTGCTACCTC 900 

TTTTTTTATT ATGCCTTGGA ATGGTTCACT TAAATGACAT TTTAAATAAG TTTATGTATA 960 

CATCTGAATG AAAAGCAAAG CTAAATATGT TTACAGACCA AAGTGTGATT TCACACIGTT 1020 

[TATTCA TTTTGCTTCA ATOUIAAGTG GTTTCAATAT TTTTTTTAGT lOBO 

r ACTTTCTTCA TAGTCACATT CTCTCAACCT ATAATTTGGA ATATTGTTGT 1140 

GGTCTTTTGT ttTTTCTCTT AGTATAGCAT TTTTAAAAAA ATATAAAAGC TACCMTCTT 1200 

TGTACAATTT GTAAATGTTA AGAATTTTTT TTATATCTGT TAAATAAAAA TTATTTCCAA 1260 
CAACCTTAAA AAAAAAAAAA AAAA 

Seq ID NO: 74 Protein sequence 



MRPQGPAASP qrlrglllll llqleapssa seipkgkqka clrqrewdl yngmclqgpa 

GVPGRDGSPG ANGIPGTPGI PGRDGFKGEK GECLRESFEB SWTPNYKQCS WSSLNYGIDL 
GKIAECTFTK MRSNSALRVL FSGSLRLKCR NACCQRWYFT FNGABCSGPL PIEAIIYLDQ 
GSPEMNSTIN IHRTSSVEGL CEGIGAGLVD VAIWVGTCSD YPKGDASTGW N 



I I I I I I 

CACAGCGTGG GRAGCAGCTC TGGGGGAGCT CGGAGCTCCC GATCACGGCT TCTTGGGGGI 60 

AGCTACGGCT GGGTGTGTAG AACGGGGCCG GGGCTGGGGC TGGGTCCCCT AGTGGAGACC 120 

CAAGSTGCGAG AGGCAAGAAC TCTGCAGCTT CCTGCCTTCT GGGTCAGTTC CTTATTCAAG isO 

TCTGCAGCaS GCTCCCAGGG AGATCTCGGT QGAACTTCAG AAACGCTGG6 CAGTCTGCCI 240 

TTCAACCATG CCCCTGTCCC TGGGAGOXSA GATGTGGGGG CCTGAGGCX:! GGCTGCTGCT 300 

GCTGCTACTG CIGGCATCAT TTACAGOCCG GTGCCCXXSOG GGTGAGCTGG AGACCTCAGA 360 

CGTGGTARCT GTGGTGCTGG GCCAGGACGC AAAACTGCCC TGCTTCTACC GAGGGGACTC 420 

CGGCGAGCKA GTGGG6CAAG TGGCATGGGC TGQGGTGGAC GCGGGCGAAG GCGCCCAGGA 480 

ACTAGOGCTA CTGCACTCCA AATACGGGCT TCATGTGAGC COGGCTTAO; AGGGCCGOST 540 

GGAGCAGCCG CCGCCCCCAC GCAACCCCCT GGACGGCTCA GTGCTCCIGC GCAACGCAGT 600 

GCAGGCGGAT GAGGGCGAGT ACGAGTGCCX3 GQTCAGCACC TTCCCCGCCG GCAGCTTCCA 660 

GGCGCGGCTG CGGCTCCGAG TGCTGGTGCC TCCCCTGCCC TCACTGAATC CTGCTCCAGC 720 

ACTAGAAGAG GGCCAGGGCC TGACCCTGGC AGCCTCCTGC ACRGCTGAGG GCAGCCCAGC 780 

CCCCAGCGTG ACCTGGGACa OKAGGTCAA AGGCACAACG TCXaiGCXXiTT CCTTCAAGCA 840 

CTCCCGCTCT GCTGCCGTCft CCTCAGftGTT CCACTTGGTG CCTAGOXiCA GCATGAATGG 900 

GCRGCCACTG ACTTGTGTGG TGTCCCaTCC TCGCXTTGCIC CAGGAOCSAA GGAICACCCA 960 

CATCCTCCAC GTGTCCTTCC TTGCTGAGGC CTCTGTGAGG GGCCTTGAAG ACCAAAATCT 1020 

GTGGCACSiTT GGCAGAGAAG GAGCTATGCT CAAGTGCCTG AGTGAAGGGC AGCCCCCTCC 1080 

CTCATACAAC TGGACACGGC TGGATGGGCC TCTGCCCAGT GGGGTACGAG TGGATGGGGA 1140 

CACTTTGGGC TTTCCCCCAC TGACCACTGA GCACAGCGGC ATCTAOGTCT GCCATGTCAG 1200 
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CaATGAGTTC TCCICAAGGG ATTCTCAGGT CACTGTGGAT GTTCTTGRCC CCCAOSAflGR 1260 

CICTGGGAAG CAGGTGGACC TAGTGTCftGC CTCGSTGGTG GTGGTGGGTG TGATCX3CCGC 1320 

ACTCTTGTTC TGCCTTCTGG TGGTGGTGCT GGTGdC&TG TCCCGATACC ATCGGCGCAA 1380 

GGCCCAGCAG ATGACCX31GA AATATGAGGA GGAGCIGACC CTGACCAGGG AGAACTCCRT 1440 

CCGGAGGCTG CATTCCCATC ACACGGACTC CSiGGAGCCAG CCGGAGGAGA GTGTAGGGCT 1500 

GAGAGCCGAG GGCCACCCTG ATAGTClCAA GGACAACAGT AGCTGCTCTG TGATGAGTGA 1560 

AGAGCCCGAG GGCCGCAGTT ACTCCACGCT GACCACGGTG AGGGAGATAG AAACACAGAC 1620 

TGAACTGCTG TCTCCAGGCT CTGGGCGGGC CMAGGAGGAG GAAGATCAGG ATGARGGCAT 1680 

CAAACAGGCC ATGAACCATT TTGTTCAGGA GAATGGGACC CTACGGGCCA AGCCCACX3GG 1740 

CAATGGCATC TACATCAATG GGCGGGGACA CCTGGTCTGA C(XAGGCCTG CCTCCCTTCC 1800 

CTAGGCCTGG CTCCTTCTGT TGACATGGGA GATTTTAGCT CATCTTGGGG GCCTCCTTAA 1860 

ACACCCCCAT TTCTTGCGGA AGATGCTCCC CATCCCACTG ACTGCTTGAC CTTTACCTCC 1920 

AACCCTTCTG TTCATCGGGA GGGCTCCACC AATTGAGTCT CTCCCACCAT GCATGCAGGT 1980 

CACTGTGTGT GTGCATGTGT GCCTGTGTGA GTGTTGACTG ACTGTGTGTG TGTGGAGGGG 2040 

TGACTGTCCG TGGAGGGGTG ACTGTGTCCG TGGTGTGTAT TATGCTGTCR TATCAGAGTC 2100 

AAGTGAACTG TGGTGTATGT GCCACGGGAT TTGAGTGGTT GCGTGGGCAA CACTGTCAGG 2160 

GTTTGGCGTG TGTGTCATGT GGCTGTGTGT GACCTCTGCC TGAAAAAGCA GGTATTTTCT 222 0 

CAGACCCAGG TGTGCGGGCA TAGCTGGAGC TGGAATCTGC CTCCGGTGTG AGGGAACCTG 234 0 

TCTCCTACCA CTTCGGAGCC ATGGGGGCAA GTGTGAAGCA GCCAGTCCCT GGGTCAGCCA 24 00 

GAGGCTTGAA CTGTTACAGA AGCCCTCTGC CCTCTGGTGG CCTCTGGGCC TGCTGCATGT 24 60 

ACATATTTTC TGTAAATATA CATGCGCCGG GAGCTTCTTG CAGGAATACT GCTCCGAATC 252 0 

ACTTTTAATT TTTTTCTTTT TTTTTTCTTG CCCTTTCCAT TAGTTGTATT TTTTATTTAT 2580 

TTTTATTTTT ATTTTTTTTT AGAGTTTGAG TCCRGCCTGG ACGATATAGC CAGACCCTGT 2640 
A ACCAAAACCC A 



Protein Accession #: AAH10423 

I i' r r r r 

MPLSLGAEMW GPEAWLLLLL LLASFTGECP AGELETSEW TWLGQDAKL P " 
QVGQVAWARV DAGEGAQELA LLHSKYGIiKV SPAYEGRVEQ PPPPRNPLDG SVLLENAVQA 
DEGEYECRVS TFPAGSFQAR LRLRVLVPPL PSUOPGPALE EGQGLTIiAAS CTAEGSPAPS 
VTWDTEVKGT TSSRSFKHSR SAAVTSBFHL VPSRSMHGQP LTCWSHPGL LQDQRITHIL 
HVSFIiAEASV RGLEDQNLWH IGREGAMIiKC LSEGQFPPSY NWTRLDGFLP SGVRVDGDTL 
GPPPLTTEHS GIYVCHVSNE FSSRDSQVTV DVLDPQEDSG KQVDLVSASV VWGVIAAIiL 
FOiLVWWL MSRYHRRKAQ QMTQKYEEEL TLTRENSIRR LHSHHTDFRS QPEESVGLSA 
EGHPDSLKDN SSCSVMSEEP EGRSYSTLTT VREIETQTEL LSPGSGRAEE EEDQDEGIKQ 
AMHHFVQEHG TLRAKPTGNG lYINGRGHLV 

Seq ID NO: 77 DNA sequence 

Nucleic Acid Accession ft: im_003474.2 

Coding sequence: 37.. 3036 

1 11 21 31 41 51 

I I I I I I 

CACTAACGCT CTTCCTAGTC CCCGGGCCAR CTCGGACAGT TTGCTCATTT ATTGCAAOGG 
TCAAGGCTGG CTTGTGCCAG AACGGCGCGC GCGOGACGCA OGCACACACA CGGGGGGAAA 
CTTTTTTAAA AATGAAAGGC TAGAAGAGCT CAGCGGOGGC GCGGGCCGTG CGCXSAGGGCT 
CCGGAGCTGA CTCGCCGAGG CAGGAAATCC CTCOGGTCGC GACGCCCGGC CCCGCTOGGC 
GCCCGCGTGG GATGGTGCAG CGCTCGCCGC CGGGCCOGAG AGCTGCTGCA CTGAAGGCCG 
GCOACGATGG CAGCGCGCCC GCTGCCCGTG TCCCCCGCCC GCGOCCTCCT GCTCGCCCTG 
GCCGGTGCTC TGCTCGCGCC CTGCGAGGCC CGAGGGGTGA GCTTATGGAA CGAAGGAAGA 
GCTGATGAAG TTGTCAGTGC CICTGTTCGG AGTGGGGACC TCTGGATCCC AGTGAAGAGC 
TTCGACTCCA AGAATCATCC AGAAGTGCTG AATATTCGAC TACAACGGGA AAGCAAAGAA 
CTGATCATAA ATCTGGAAAG AAATGAAGGT CTCATTGCCA GCAGTTTCAC GGAAACCCAC 
TATCTGCAAG ACGGTACTGA TGTCTCCCTC GCTCGAAATT ACACGGT.^^.T TCTGGGTCAC 
TGTTACTACC ATGGACATGT ACGGGGATAT TCTGATTCAG CAGTCAGTCT CAGCACGTGT 
TCTGGTCTCA GGGGACTTAT TGTGTTTGAA AATGAAAGCT ATGTCTTAGA ACCAATGAAA 
AGTGCAACCA ACAGATACAA ACTCTTCCCA GCGAAGAAGC TGA^?AGCGT CCGGGGATCA 
TGTGGRTCAC ATCACAACAC ACCARACCIC GCTGCAAAGA ATGTGTTTCC ACCACCCTCT 
CAGACATGGG CAAGAAGGCA TAAAAGAGAG ACCCTCAAGG CAACTAAGTA TGTGGAGCTG 
GTGATCGTGG CAGACAACCG AGAGTTTGAG AGGCAAGGA^ AAGATCTGGA 7VAAAGTTAAG 
R TAGAGATTGC TAATCACGTT GACAAGTTTT ACAGACCACT GAACAITCGG 



GACAATCCCC TTGGTGCAGC CGTGACCCTG GCACATGAGC TGGGCCACAA TTTCGGGATG 1380 

ARTCRTGACa CACTGGRCAG GGGCTGTAGC TGTCASATGG CGGTTGAQAA AGGAGGCTGC 1440 

ATCMGAACG CTTCCACCGG GTACCCATTT CXCATGGTGT TCAGCAGTTG CAGCAGGAAG 1500 

GACTTGGAGR CCRGCCTGGA GAAAGGARTG GGGGTGTGCC TGTTTAACCT GCCGGAftCJTC 1560 

AGGGAGTCTT TCGGGGGCCR GAAGTGTGGG AACAGRTTTG TGGftAGARGG AGAGGfiSTGT 1620 

GACTGTC3GGG AGCCAGAGGA ATGTATGAAT OSCTGCTGCA ATGCCACCAC CTGTACCX3t3 1680 

AAGCCGGACG CTGTGTGCGC ACATGGGCTG TGCTGK3AAG ACTGCCAGCT GAAGCCTGCR 1740 

GGAACAGCGT GCAGGGACTC CAGCAACTCC TGTGACCTCC CAGAGTTCTG CACAGGGGOC 1800 

AGCCCTCACT GCCCAGCCAA CGTGTACCTG CACGMGGGC ACTCATGTCA GGATGTGGAC 1860 

GGCTACTGCT ACAATGGCAT CTGCCAGACT CACGAGCAGC AGTGTGTCAC ACTCTGGGGR 1920 

CCAGGTGCIA AACCTGCCCC TGGGATCTGC TTTGRGAGRG TC3UITTCTGC AGGTGATCCT 1980 

TATGGCAACT GTGGCAAAGT CTCGAAGAGT TCCTTTGCCA AATGCGftGAT GAGAGATGCT 2040 

AAATGTGGAA AAATCCAGTG TCAAGGAGGT GCCfiGCCGGC CAGTCATTGG TACCAATGCC 2100 

GTTTCCATAG AAACAAACAT CCCCCTGCAG CAAGGAGGCC GGATTCIGTG CCGGGGGACC 2160 

CACGTGTACT TGGGCGATGA CATGCOGGAC CCAGGGCTTG TGCTTGCAGG CAC3UUU3aX3T 2220 

GCAGATGGAA AAATCTGCCT GAATCGTCAA TGTCAAAATA TTAGTGTCTT TGGGGTTCAC 2280 

GAGTGTGCAA TGCAGTGCCA CGGCAGAGGG GTGTGCAACA ACAGGRfifiAA CTGCCACIGC 2340 

GAGGCCCACT GGGCACCTOC CTTCTGTGAC AAGTTTGGCT TTGGAGGAAfi CAC3M3ACAGC 2400 
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GGCCCCRTCC GGCASGCAGA TAACCAAGGT TTARCCKTAG GAATTCTGGT GRCCATCCTG 2460 

TGICTTCTTG CTGCCGGATT TGTGGTTTAT CTCftAAAGGA AGACCTTGAT ACGACTGCTG 2520 

TTTACRAATA AGAAGACCAC CATTGAAAAA CTAAGGTGTG TGCGCCCTTC CCGGCCRCCC 2580 

OGTGGCTTCC AACCXTTOTCA GGCTCACCTC GGCCACCTTG GAAAAGGCCT GATGAGGAAG 2640 

CCGCCAGATT CCTAOCCACC GAAGGACAAT CCCAGGAGAT TGCTGCAGTG TCAGAATGTT 2700 

GACATCRGCA GACCCCTCAA CX3GCCIGAAT GTCCCTCAGC CCCRGTCAAC TCAGOSAGTG 2760 

CTTCCTCCCC TCCACCGGGC CCCACGTGCA CCTAGOSTCC OTGCCAGACC CCTGCCAGCC 2820 

AAGCCTGCAC TTAQGCAGGC CCa^GGGGACX: TGTAAGCCaA ACCOCCCTCA GASGCCTCTG 2880 

CCTGCftGATC CTCTGGCCAG AACAACTCGG CTCACTCATG CCTTGGCCAG GACCCCAGGA 2940 

CAATGGGAGA CTGGGCTCCG CCTGGCACCC CTCAGACCTG CTCCRCAATA TCCACaCCSUV 3000 

GTGCCCAGAT CCACCCACRC CGCCTATATT AAGTGAGAAG CKGACACCTT TTTTCAACAG 3060 

TGAAGACAGA AGTTTGCACT ATCTTTCAGC TCCAfiTTGGA GTTTTTTGIA CCAACTTTTA 3120 

GGATTTTTTT TAATGTTTAA AACATCATTA CTATAAGAAC TTTGAGCTAC TGCCX3TCAGT 3180 

GCTGTGCTGT GCTATGGTGC TCTGTCTACT TGCACRGGTA CTTGTAAATT ATTAATTTAT 3240 

GCAGAATGTT GATTACAGTG CAGTGCGCTG TAGTAGGCAT TTTTACCATC ACTGAGTTTT 3300 

CCATGGCAGG AAGGCTTGTT GTGCTTTTAG TATTTTAGTG AACTTGAAAT ATCCTGCTTG 3360 

ATGGGATTCT GGACAGGATG TGTTTGCTTT CTGATCAAGG CCTTATTGGA AAGCAGTCCC 3420 

CCAACTACCC CCAGCTGTGC TTATGGTACC AGATGCAGCT CAAGAGATCC CAAGTAGAAT 3480 

CTCAGTTGAT TTTCTGGATT CCCCATCTCA GGCCAGAGCC AAGGGGCTTC AGGTCCAGGC 3540 

TGTGTTTGGC TTTCAGGGAG GCCCTGTGCC CCTTGACAAC TGGCAGGCAG GCTCCCAGGG 3 SCO 
ACACCTGGGA GAAATCTGGC TTCTGGCCAG GAAGCTTTGG TGAGAACCTG G 
AGGAATCTTA AGGTGTAGCC ACACCAGGAT AGAGACTGGA A 
TGACCCTGAG CTGACCAGCC GTGAGCATGT TTGGAAGGGG T 
GGTGCTTGAT AGAAATGCCA AGCRCTTCTT TTTCTCGCTG TCCTTTCTAG A 
CCAGTAGGTT ATTTAGCTTG GGAAAGGTGG T 
CTGCAAACCG CCACCTCCCT ATACTGCTTG G 



h ACCAGATCTA GTCAATCAAG T 
AATTAGGCAG ACTCTTTATG CTTGCAAAAA CTACAACCAA TGGAATGTGA T 

TATAGTTCAT GTCTGCTATC ATTATTCGTA GATATIGGAC AAAGAACCTT CTCTATGGGG 4200 

CATCCTCTTT TTCCAACTTG GCTGCAGGAA TCTTTAAAAG ATGCTTTTAA CAGAGTCTGA 4260 

ACCTATTTCT TAAACACTTG CAACCTACCT GTTGAGCATC ACftGAATGTG ATAAGGAAAT 4320 

CAACTTGCTT ATCAACTTCC TAAATATTAT GAGATGTGGC TTGGGCAGCA TCCCCTTGAA 4380 

CTCTTCACTC TTCAAATGCC TGACTAGGGA GCCATGTTTC ACAAGGTCTT TAAAGTGACT 444 0 

AATGGCATGA GARATACAAA AATACTCAGA TAAGGTAAAA TGCCATGATG CCTOTGTCTT 4500 

CTGGACTGGT TTTCACRTTA GAAGACAATT GACAACAGTT ACRTAATTCA CTCTGAGTGT 4560 

TTTATGAGAA AGCCTTCTTT TGGC3GTCAAC AGTTTTCCTA TGCTTTGAAA CAGAAAAATA 462 0 

TGTACCAAGA ATCTTGGTTT GCCTTCCAGA AAACAAAACT GCATTTCACT TTCCCGGTGT 4680 

TCXCCACTGT ATCTAGGCAA CATAGTATTC ATGACIATGG ATAAACIAftA OiOSIGhikC 4740 

AAACACACAC AAAAlSGGAAC CCAGCTCTAA TACATTCCAA CTCGTATAGC ATGCATCTGT 4800 

TTATTCTATA GTTATTAAGT TCTTTAAAAT GTAAAGCCAT GCTGGAAAAT AATACTGCTG 4860 

AGATACATAC AGASTTACTG TAACTGATTA CACTTGGTAA TTGTACTAAA GCCAAACATA 4920 

TATATACTAT TAAAAAGGTT TACftGAATTT TATGGTGCAT TACGTGGGCA TTGTCTTTTT 4980 

AGATGCCCAA ATCX3TAGAT CTGGCATGTT AGCCCTTCCT CCAATTATAA GAGGATATGA 5040 



1 11 21 31 41 51 
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KAARPLPVSP ARALLLALAG ALLAPCEAEG VSLWNEGRAD EWSASVRSG DLWIPVKSFD 
SKNHPEVLNI RLQEESKELI INLERNEGLI ASSFTETHYL QDGTDVSLAR NTTVILGHCY 
YHGHVEGYSD SAVSLSTCSG LRGLIVFENE SWLEPMKSA TNRYKLFPAK KLKSVRGSCX3 
SHHNTPNLAA KNVFPPPSQT WARRHKRETL KATKYVELVl VADHREFQRQ GKDLBKVKOE 
LIEIAMHVDK PYRPIiNIRIV LVGVEVWMDM DKCSVSQDPF TSLHEFLDWR KMKLLPRKSH 
DNAQLVSGVY FQGTTIGMAP IMSMCTADQS GGIVMDHSDN PLGAAVTLAH ELGHNFGMKH 
DTLDRGCSCQ MAVEKGGCIM NASTGYPFPM VFSSCSRKDL ETSLEKGMGV CLFULPEVEE 
SFGGQKCGNR FVEEGEECDC GEPEECtWRC CNATTCTLKP DAVCAHGLCC EDOJLKPAGT 
ACRDSSNSCD LPEFCTGASP HCPANVYLHD GHSCQDVDGY CYNGICQTHE QQCVTLWGPG 
AKPAPGICFE RVNSAGDPYG NCX3KVSKSSF AKCEMRDAKC GKIQCQGGAS RPVIGTNAVS 
lETNIPLQQG GRILCRGTHV YLGDDMPDPG LVLAGTKCAD GKICLNRQCQ NISVPGVHEC 
AMQCHGRGVC NNRKNCHCEA HWAPPFCDKF GFGGSTDSGP IRQADNQGLT IGILVTILCL 
LAAGFWYLK RKTLIRLLFT NKKTTIEKLR CVRPSRPPRG PQPCQAHLGH LGKGLMRKPP 
DSYPPKDNPR RLLQCQWVDI SRPLNGLNVP QPQSTQRVLP PLHRAPRAPS VPARPLPAKP 
ALRQAQGTCK PNPPQKPLPA DPLARTTELT HALARTPGQW ETGLHLAPIjH PAPQYPHQVP 



I I I I I I 

GAGGAGGAGG GAAAAGGCGA GCAAAAAGGA AGAGTGGGAG GAGGAGGGGA Ar" 

GAGGAAGAGG AGGRGGAGGA AGAGGGGAGC ACAAAGGATC CAGGTCTCCC GACGGGRGGT 120 

TAATACCAAG AACCATGTGT GCXX3AGCGGC TGGGCCAGTT CATGRCCCIG GCTTTGGTGT 180 

TGGCCACCTT TGACKCGGCG CXKK3GGACCG ACGCCACCAA CCCACCCGfifi GGTCCCCAAG 240' 

ACAGGAGCTC CCAGCAGAAA GGCCGCCTCT CCCTGC3M3AA TACAGCG6AG ATCCAGCACT 300 

GTTTGGTCAA CGCTGGCGAT GTGGGGTGTG GOSTGTTTGA ATGTTTCGAG AACAACTCTT 360 

GTGAGATTCG GGGCITACAT GGGATTTGCA TGRCTTTTCT GCACAACGCT GGAAAATTTG 420 

ATGCCCAGGG CAAGTCATTC ATCAAAGAOS CCTTGAAATG TAAGGCCCftC GCTCTGCGGC 480 

ACAGGTTCGG CTGCATAAGC CGGAAGTGCC CGGCCATCAG GGAAATGGTG TCCCAGTTGC 540 

AGCGGGAATG CTACCTCAAG CACGACCTGT GOSCXSQCTGC CCAGGAGRAC ACCCGGGTGA 600 

TAGTGGAGAT GATCCATTTC AAGGACTTGC TGCTGCACGA ACCCTACGTG GACCTCGTGA 660 

ACTTGCTGCT GACCTGTQGG GAGGAGGTGA AGGAGGCCAT CACCCACAGC GTGCAGGTTC 720 

AGTGTGA6CA GAACTGGG6A AGCCTGTGCT CCATCTTGAG CTTCTGCACC TCGGCCATCC 780 
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AGARGCCTCC CACGGCGCCC CCXX3ASCGCC MCCCCAGGT GGACAGAACC AAGCTCTCCA 840 

GGGCCCACCA CGGGGRAGCA GGACATCACC TCCCAGftGCC CRGCAGTAfiG GA«3RCTGGCX: 900 

GAGGTGCCAA GGGTGAGCGA GCTAGCSUIGA GCCRCCCSAA CGCCCRTGCC CGAGGCAGAG 960 

TCGGGGGCCT TGGGGCTCAG GGRCCTTCCG GAAGCAGOSA CTGGGAAGAC GAACXGTCTG 1020 

AGTATTCIGA TATCCX3GAGG TGAftATGAftA GGCCTGGCCA OGAftATCTTI CCTCCaCQCC 1080 

GTCCATTTTC TTATCTATGG ACATTCCAAA ACATTTACCA TTAGAGAGGG GGGATGTCAC 1140 

ACGCAGGATT CTGTGGGGAC TGTGGACTTC ATCGAGGTGT CTGTTCGCGG AACGGACBCG 1200 

TGAGATGGAG ACCCCTGGGG CCGTGGGGTC TCAGGGGTGC CTGGIGAATT CKKACrTAC 1260 

ACGTACTCAA GGGAGCGCGC CCGCGTTATC CTOSTACCTT TGTCTTCTTT CXSiTCTGIGG 1320 

AGTCAGTGGG TGTCGGCCGC TCTGTTGTGG GGGRGGTGAA CCAGGGAGGG GCAGGGtSlSG 1380 

GCAGGGCCCC CAGAGCTGGG CCACACAGTG GGTGCTGGGC CTCGCCCCGA AGCTTCTGGT 1440 

GCAGCAGCCT CTGGTGCTGT CTCCGCGGAR GTCAGGGCXK3 CTGGRTTCCR GGACAGGABT 1500 

GAATGTfiAAA ATAAATATCG CTTAGAATGC AGGAGAAGGG TGGAGAGGAG GCftGGGGCCG 1560 

AGGGGGTGCT TGGTGCCAAA CTGAAATTCSi GTTTCTTGTG TGGGGCCTTG CXSGTTCAGftG 1620 

G AGGAGTGTCA TTTCTATGTG TAATTTCTGA GCCATTGTAC 1680 

A CTGTCCAAGG GAGTGGCCCC TRTGRGTTTA TATTTTAACC 1740 

A ATCTCGATTT CACTTTTTTT ATTTATCCAG TTATRTCTAC ATATCTGTCA 1800 

^ ACAAAGCAAC TGGGTCATTA AAACCAGCTC AAAGGGGQTT 1860 

A AAAACCAGCC CRTCCTTTGA GGCTGATTTT TCTTTTTTTT AAGTTCTATT 1920 

TTAAAAGCTA TCAAACAGCG ACATAGCCAT ACATCTGACT GCCTGACATG GACTCCTGCC 1980 

CACTTGGGGG AAACCTTATA CCCAGAGGAA AATACACACC TGGGGAGTAC ATTTGACAAA 2 040 

TTTCCCTTAG GATTTCGTTA TCTCACCTTG ACCCTCAGCC AAGATTGGTA AAGCTGCGTC 2100 

CTGGCGATTC CAGGAGACCC AGCTGGAAAC CTGGCTTCTC CATGTGAGGG GATGGGAAAG 2160 

GASAGAAGAG AATGAAGACT ACTTAGTAAT TCCCATCAGG AAATGCTGAC CTTTTACATA 2220 

AAATCAAGGA GACTGCTGAA AATCTCTAAG GGACAGGATT TTCC3M3ATCC TAATTGGAAA 2280 

TTTAGCAATA AGGAGAGGAG TCCAAGGGGA (JAAATAAAGG CAGAGAGAGA GAGAGAGAGA 2340 
GGGAGAGGAA GAAAAGAGAG RGAGAAAAGA GCCTCGTGCC 

Seq ID NO: 80 Protein sequence 
Protein Accession ft: NP_003';05 

1 11 21 31 41 51 
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MCAERLGQFM TIiALVLATFD PARGTDATNP PEGPQDRSSQ QKGRLSLQNT AEIQHCLVNA 60 

GDVGCGVFEC FENNSCEIRG LHGICMTFLH NAGKFDAQGK SFIitDALKCK AHRLRHRPGC 120 

ISRKCPAIRE MVSQLQRECY LKHDLCAAAQ BNTRVIVBMI HFKDLLLHBP YVDLVNLLLT 180 

CGEEVKEAIT HSVQVQCEQH WGSLCSILSF CTSAIQKPPT APPEEQPOVD RTKLSRAHHG 240 

EAGHHLPEPS SRETQRGAKG ERGSKSHPNA HARGRVG(a/3 AQGPSGSSEW EDEQSEYSDI 300 
RR 
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AATCRTGTTT TTGGAATAAA AATGGTAAAT GTTTTTTTTT TTTTTTTTTT GTTAGTAATG 60 

GAAGTATTCT ATTATTTTTA ATTTTATGTA TGTACSiGACA AGAGCTATAT GGGAAAGTAG 120 

CTATTACAGA COCATTATAG TTTCATCSTT TATAACCAGG GTTGGTAGTA AAGCAACAGA 180 

AATACATCTT CAATGCAGAC TTGCTCCCAG CATCTCTTTC CTTTCATCCT TGACCATATT 240 

GCATCCTGGA CRGGAAGAAG ACATCCTCAG CCCACCACCC CTTCCCCICC AATGTGGACT 300 

GAGTGTTGGG TCAACTGATA AGCRGTGCAG AOTCTTCATT TTGCTTTGTT GGTGCCTTGG 360' 

AGGCaTGTGA GGGCAOWCC TC3UUITTCCA GAGACCTTGC CTTCAGGATT TGAGGTGGGC 420 

RAAGATGTGA AAGAAGACCT TTAGCCACTC AGTGTTATCA AAAGCaU^TCA CCAAAGAAGT 480 

TTTTGCATCT GGCATTGTGT TGGGACGTCC ATCAGCAGTA CCGTACGGTC TCTTTACTGG 540 

ACAAGTATTT ACRCCTTGTT AACAGCAGTG TACAOCCGGT GGRAAACTCC TATCTAATGG 600 

CATGGCAGAA TTGTAAACAT TTCCTCCTTC RCCAARTTTG TGGATGGAAT AAACATGTAG 660 

CCTTAAACTT TATTGGCTTC TCCGGTCTTG CCACTGCGGG GCCCAAAAAC TTTCAGAAGI 720 

AGGAATCCTG TGGCATCTGC TAATGGTGAT TGGAAGAAAG AGTAGAGGTG ACAGTATCTA 780 

TGGGTGCAGT ACCAGAACTR TTACGAAATG TTCCAGCTGC GATTTCRGAG GAATCCCCCC 840 
TGACCCCTGG ACGTGGTTCT CCTATTTCAG TCACCCTGTG COG 

■ Seq ID NO: 32 DNA sequence 

Nucleic Acid Accession #: XM_061091.1 
Coding sequence : 1 . . 2 4 8 1 

I i' r r i' i' 

ATGCCAAATA CTTCAGGAAC AACCAGGATT GAAATTTGGC TTCTCCRAGA GCCGCCCGOG 60 

CACCGAGCGC TGGTCGCCGC TCTCCTTCCG GTGAGTCCCA GCCCCGAGTT GGCTCTGGCG 12 0 

CCCGGGTRCC CGCCRGTGCC GGCTGCCGAT GACOGATTCA CGCTCCCGAT GATTGGAGGT 180 

CAGATGCATG GTGAGAAGGT AGATCTCTGG AGCCTTGGTG TTCTTTGCTR TGAATTTTTA 24 0 

GTTGGGAAGC CTCCTTTTGA GGCAAACGAA GTCCATGTAA GCAAAGAAAC CATCGGGAAG 3 00 

ATTTCAGCTG (XAGCAAAAT GATGTQGTGC TCGGCTGCXG TGGAC34TCAT GTTICTGITA 360 

GATGGGTCTA ACAGCGTCGG GARAGGGAGC TTTGAAAGGT CCAAGCACTT TGCCATCRCA 42 0 

GTCTGTGACG GTCTGGACAT CAGCCCCBAG AGGGTCAGAG TGGGAGCATT CCRGTTCRGT 480 

TCCaCTCCTC ATOTGGAATT CCXICTTGGAT TCATTTTCRA CCCAACRGGR AGTGARGGCA 540 

AGAATCAAGA GGATGGTTTT CRAAGGAGGG CGCACGGBGA CGGAACTTGC TCTGAAAIAC 600 

CTTCTGCACA GAGGGTTGCC TGGAGGCAGA AATGCTTCTG TGCCCCAGAT CCTCATCATC 660 

GTCaCTGATG GGAAGTCCCA GGGGGATGTG GCRCTGCCAT CCAAGCAGCI GARGGAAAGG 720 

GGTGTCACTG TGTTTGCTGT GGGGGTCAGG TTTCCCAGGT GGGAGGAGCT GCaTGCACTG 780 

GCCAGCGAGC CTAGAGGGCA GCACGTGCTG TTGGCTGAGC AGGTGGAGSA TGCCaCCHAC 840 

GGCCTCTTCR GCACCCTCAG CRGCTCGGCC ATCTGCTCCA GCGCCACGOC AGCTGGGRGC 900 

CCCGAGCTTG TCTTCATGGA GCGGTTAATG GGCATCTCTC TGATAGGCCC CTGTGACICG 960 

CAGCCCTQCC AQAATGGAGG CRCATGTGTT CC3M3AAGGAC TG6ACGGCTA CCRGTGCCTC 1020 

TGCCCGCTGG CCTTTGGAGG GGAGGCTAAC TGTGCCCTGA AGCTGAGCCT GGRATGCRGG 1080 

GTCGACCTCC TCTTCCTGCT GGACRGCTCT GCGGGCACCA CTCTGGACGG CTTCCTGOSG 1140 

GCCAARGTCT TCGTGAAGCG STTTGTGCGG GCCGTGCTGA GCGAGGACTC TCGGGCCOGA 1200 

GTGG6TGTGG CCRCATACRG CRGGGRGCTG CTGGTGGCG6 TGCCTGTGGG GGAGTACCAG 1260 

302 
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GATGTGCCTG ACCTGGICTG GAGCCTCGAT GGCATTCCCT TCCGTGGTG6 CXTCRCCCTG 1320 

ACGGGCftGIG CrTTGOSGCa GGOSGCftGAG CGTGGCTTCG GGRGOSCCAC CAGQiCAGGC 1380 

CAGGRCCGGC OUaSTAGAGT GGTGGTTTTG CTCACTGAGI CACaCTCCGA GGATGAGGTT 1440 

GCGGGCCCAG CXSCGTOVCGC AAGGGCGOa GAGCTGCTCC TGCTGGGTGT AGGCRGTGAG 1500 

GCOSTGCGGG CAGAGCTGGA GGAGATCACA GGCAGCCCAA AGCATGIGAT GGTCTACTCG 1560 

GATCCTCAGG ATCTGTTCAA CCaiAATCXKT GAGCTGCAGG G6AAGCTGTG CAGCCGGCAG 1620 

CGGCCAGGGT GCOGGACACR AGCCCTGGAC CTCGTCTTCA TGTTGGACaC CTCTGCCTCA 1680 

GTAGGGCCCG AGAATTTTGC TCAGATGCA6 AGCTTTGTGA GARGCTGTGC CCTCCRGTTT 1740 

GAGGTGAACC CTGACGTGAC ACAGGTCGGC CTGGTGGTGT ATGGCAGCCA GGTGCAGACT 1800 

GCCTTCGGGC TGGACACCAA ACCCACCCGG GCTGCXSATGC TGCBGGCCAT TAGCCAGGCC 1860 

CCCTACCTAG GTGGGGTGGG CTCAGCCGGC ACCGCCCTGC TGCaCATCTA TGACRAASTG 1920 

ATGACCGTCC AGAGGGGTGC CCGGCCTGGT GTCCCCAAAG CTGTGCSTGGT GCTCACAGGC 1980 

GGGAGAGGCG CAGAGGATGC AGCCGTTCCT GCCCAGAAGC TGAGGAACAA TGGCATCTCT 2040 

GTCTTGGTCG TGGGCGTGGG GCCTGTCCTA AGTGAGGGTC TGOGGAGGCT TGCaGGTCCC 2100 

CGGGATTCCC TGATCCACGT GGCAGCTTAC GCCGACCIGC GGTACCACCSi GGACGTGCTC 2160 

ATTGAGTGGC TGTGTGGAGA AGCCAAGCAG CCAGTCAACC TCTGC3iAACC CAGCCCGTGC 2220 

ATGAATGAGG GCAGCTGCGT CCTGCAGAAT GGGAGCTACC GCTGCAAGTG TCGGGATGGC 2280 

TGGGAGGGCC CCCRCTGCGA GAACCGTGAG TGGAGCTCTT GCTCTGTATG TGTGAGCCaG 2340 

GGATGGATTC TTGAGACGCC CCTGAGGCAC ATGGCTCCCG TGCAGGAGGG CAGCAGCCGT 2400 

ACCCCTCCCA GCAACTACAG AGAAGGCCTG GGCACTGAAA TGGTGCCTAC CTTCTGGAAT 2460 



DGSNSVGKGS FERSKHFAIT VCDGLDISPB RVRVGAPQFS STPHLEFPLD SFSTQQEVKA 
RIKEMVFKGG RTETELALKY LLHRGLPGGR NASVPQILII VTDGKSQGDV ALPSKQLKER 
GVTVFAVGVR FPRWEELHAL ASEPRGQHVL lAEQVEDATN GLFSTLSSSA ICSSATPAGS 
PELVFMERLM GISLIGPCDS QPCQNGGTCV PEGLDGifQCL CPLAFGGEAN CRLKLSLECR 
VDLLFLLDSS AGTTLDGFLR AKVFVKRFVR AVLSEDSRAR VGVATYSREL LVAVPVGEYQ 
DVPDLVWSLD GIPFRGGPTL TGSALRQAAE RGFGSATRTG QDRPRRWVL LTESHSEDEV 
■ AGPARHAEAR ELLLLGVGSE AVRAELEEIT GSPKHVMVYS DPQDLFNQIP ELQGKLCSRQ 
RPGCRIQALD LVFMLDTSAS VGPENFAQMQ SFVRSC3UiQF EVNPDVTQVG LWYGSQVQT 
AFGLDTKPTR AAMLRAISQA PYLGGVGSAG TALLHIYDKV MTVQRGARPG VPKA\AA;lTG 
GRGAEDAAVP AQKLENNGIS VLWGVGPVL SEGLRRLAGP EDSLIHVAAY ADLRYHQDVL 
lEWLOGEAKQ PVNLCKPSPC MNEGSCVLQN GSYRCKCEDG WEGPHCENRE HSSCSVCVSQ 
GWILETPLRH MAPVQEGSSR TPPSNYREGL GTEMVPTFWN VCaPGP 



I I I I I I 

ATGCCCCCTT TOCTGTTGCT GGAGGCCGTC TGTGTTTTCC TGTTTTCCAG AGTGCCCCCA 60 

TCTCTCCCTC TDCAGGAAGT CCATGTAAGC ASAGAftACCA TCGGGAAGAT TTCAGCTGCC 120 

AGCAAAATCA TGIGGTGCTC GGCTGCAGTG GACATCATGT TTCTGTTAGA TGGGTCTAAC 180 

AGCGTCGGGA AAGGGAGCTT TGAAAGGTCC AAGCACTTTG CCATCACSGT CTGTGRCGGT 240 

CTGGRCATCR GCCCCGRGAG GGTCAGAGTG GGAGCSTTCC AGTTCAGTTC CACTCCTCAT 300 

C ATTTTCAACC CAACAGGAAG TGAAGGCAAG AATCAAGAGG 360 

h AfiGGAGGGCG CACX3GAGACG GAACTTGCTC TGAAATACCT TCTGCACAGA 420 

CCCCAGATCX: TCSTCATCGT CACTGATGGG 480 

C ACTGCCATCC AAGCAGCTGA AGGAAAGGGG TGTCACTGTG 540 

TTTGCTGTGG GGGTCAGGTT TCCCAGGTGG GAGGAGCTGC ATGCACTGGC CAGCGAGCCT 600 

AGAGGGCAGC ACGTGCTGTT GGCTGAGCAG GTGGAGGATG CCRCCAACGG CCTCTTCAGC 660 

ACCCTCAGCA GCTCGGCCAT CTGCTCCAGC GCCACGCCAG ACTGCAGGGT CX3AGGCTCAC 720 

CCCTGTGAGC ACAGGACGCT GGAGATGGTC CGGGAGTTCG CTGGCAATGC CCCATGCTGG 780 

60 AGAGGATCGC GGCGGACCCT TGCGGTGCTG GCTGCACACT GTCCCTTCTA CAGCTGGAAG 84 0 

AGAGTGTTCC TAACCCACCC TGCCACCTGC TACAGGACCA CCTGCCCAGG CCCCTGTGAC SOD 

TCGCAGCCCT GCCAGAATGG AGGCACATGT GTTCCAGAAG GACTGGACGG CTACCAGTGC 960 

CTCTGCCCGC TGGCCTTTGG AGGGGAGGCT AACTGTGCCC TGAAGCTGAG CCTGGAATGC 102 0 

AGGGTCGACC TCCTCTTCCT GCTGGACAGC TCTGCGGGCA CCACTCTGGA CGGCITCCTG 1060 

65 CGGGCCaAAG ICTTCGTGAA GCGGTTTGTG CGGGCCGTGC TGAGCGAGGA CTCTCGGGCC 114 0 

CGAGTGGGTG TGGCCACATA CAGCAGGGAG CTGCTGGTGG CGGTGCCTGT GGGGGAGTAC 1200 

CAGGATGTGC CTGACCTGGT CTGGAGCCTC GATGGCATTC CCTTCCGTGG TGGCCCCACC 12 60 

CTGACGGGCA GIGCCTTGCG GCAGGCGGCA GAGCGTGGCT TCGGGAGCGC CACCAGGACA 1320 

GGCCAGGACC GGCCACGTAG AGTGGTGGTT TTGCTCACTG AGTCACACTC CGAGGATGAG 1380 

70 GTTGCGQGCC CAGCGCGTCA CGCAAGGGCG CGAGAGCTGC TCCTGCTGGG TGTAGGCAGT 144 0 

GAGGCOGTGC GGGCAGAGCT GGAGGAGATC ACAGGCAGCC CAAAGCATGT GATGGTCTAC 1500 

TCGGATCCTC flGGATCTGTT CAACCAAATC CCTGAGCTGC AGGGGAAGCT GTGCAGCCGG 1S60 

CAGCGGCCAG GGTGCCGGAC ACAAGCCCTG GAOCTCGTCT TCATGTTGGA CACXMCTGCC 1620 

TCAGTAGGGC CSMAGAATTT TGCTC3«aiTG CAfiftGCTTTG TGASAAGCTG TGCCCTCCAG 1680 

75 TTTGAGGTGA ACCCIGACGT GACACBGGTC GGCCTGGTGG TGTATGGCAG CCAGGTGCAG 1740 

ACTGCCTTCG GGCTGGACAC CSAACCCACC CGQGCTGCGA TGCTGCGGGC CATTAfiCCAG 1800 

GCCCCCTACC TAGGTGGGGT GGGCTCAGCC GGCACOGCKC TGCTGCaCAT CTATGACAAA 1860 

GTGATGACCG TCCAGAGGGG TGCCCGGCCT GGTGTCCCCA AAGCTGTGGT GGTGCTCACA 1920 

GGCGGGAGAG GCGCAGAGGA TGCAGCCGTT CCTGCCCAGA AGCTGAGGAR CAATGGCATC 1980 

80 TCTGTCTTGG TCGTGGGCGT GGGGCCTGTC CTAAGTGAGG GTCTGCGGRG GCTTGCAGGT 2040 

CCCCGGGATT CCCIGATCCA CGTGGCAGCI TACX3CCGACC TGCGGTACCA CCAGGACGTG 2100 

CTCATTGAGT GGCTGTGTGG AGAAGCCAAG CAGCCaGTCA ACCTCTGCAA ACCCAGCCCG 2160 

TGCATGAATG AGGGCAGCTG OGTCCTGCAG AATGGGAGCT ACCGCTGCAA GTGTCGGGAT 2220 

GGCTGGGAGG GCCCCCACTG CGAGAACCGT GAGTGGAGCT CTTGCTCTGT ATGTGTGAGC 2280 

85 CAGGGATGGA TTCTTGAGAC GCCCCTGAGG CACATGGCTC COGTGCAGGA GGGCAGCAGC 2340 
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TACCTTCXGG 



I I I I I I 

MFPFLLLEAV CVFLFSRVFP SLPLQEVHVS KBTIGKISAA SKMMWCSAAV DIMFLLDGSH 
SVGKGSFERS KHFAITVCDG LDISPERVRV GAFQFSSTPH LEFPLDSFST QQBVKARIKR 
HVFKGGRTET ELALKYLLHR GLPGGHNASV PQILIIVTDG EtSQGDVALPS KQLKBSGVTV 
FAVGVRFPRW BBLHftLASEP RGQHVLLAEQ VEDATNGLFS TliSSSAICSS ATPDCRVEflH 
PCBHRTLEMV RBFAGNAPCW RGSRRTLAVL AAHCPFYSMK RVFLTHPATC YRTTCPGPCD 
SQPCQNGGTC VPEGLDGYQC LCPLAFGGEA NCALKLSLEC RVDLLFLIiDS SAGITIiDGFL 
EAKVFVKRFV RAVLSEDSRA RVGVATYSRE LLVAVPVGEY QDVPDLVWSL DGIPFRGGPT 
LTGSALRQAA ERGFGSATRT GQDRPRRWV LLTESHSBDE VRGPAEHARA RELLLLGVGS 
EAVRAELEEI TGSPKHVMVY SDPQDLFNQI PELQGKIjCSR QRPGCRTQAIi DLVEMLDTSA 
SVGPENFRQM QSFVRSCALQ PEVNPDVTQV GLWYGSQVQ TAPGLDTKPT HAAMLRAISQ 
APYLGGVGSA GTALLHIYDK VMTVQRGARP GVPKAVWLT GGRGABDAAV PAQKljRNNGI 
SVLWGVGPV LSEGLRRLAG PRDSLIHVAA YSDLRYHQDV LIEWLCGEAK QPVNLCKPSP 
CMNEGSCVLQ NGSYRCKCRD GWEGPHCENH EWSSCSVCVS QGWILETPLH HMftPVQBGSS 
RTPPSNYREG L 




G TGAGTAGAGC CGCCCGGGCA CCGAGCGCTG 
r ATATCAACAT GCCCCCTTTC CTGTTGCTOG AAGCCGTCTG 
TGTTTTCCTG TTTTCCAGAG TGCCCCCATC TCTCCCTCTC CAGGAAGTCC ATGTAAGCAA 
AGAAACCATC GGGAAGATTT CAGCTGCCAG CAAAATGATG TGGTGCTCSG CTGCAGTGGA 
3 GGTCTAACAG CGTCGGGAAA GGGAGCI 
C ATCACAGTCT GTGACGGTCT GGACATCAGC CCCGAGAGGG T 
G TTCAGTTCCA CTCCTCATCT GGAATTCCCC TTGGATTCRT TTTCAACCCA 
ACAGGAAGTG AAGGCAAGAA TCAAGAGGAT GGTTTTCAAA GGAGGGCGCA CGGAGACGGA 
ACTTGCTCTG AAATACCTTC TGCACAGAGG GTTGCCTGGA GGCAGAAATG C ' 
CCAGATCCTC ATCATCGTCA CTGATGGGAA GTCCCAGGGG GATGTGC 
GCAGCTGAAG GAAAGGGGIG TCACTGTGTT TGCTGTGGGG GTCRGGTTTC C 

GGAGCTGCAT GCACTGGCCA GCGAGCCTAG AGGGCAGCAC GTGCTGTTGG CTGAGCAGGT 720 

GGAGGATGCC ACC3iACX3GCC TCTTCAGCAC CCTCAGCAGC TCGGCCATCT GCTCCAGCGC 780 

CACGCCAGAC TGCAGGGKXS AGGCTCACCC CTGTGAGCAC AGGROSCTGG AGATQGTCCG 840 , 

GGfiGTTCGCT GGCAATGOCC CATGCTGGRG AGGATCGOSG CGGRCCCTTG CGGTGCTGGC 900 

TGCACACTGT CCCTTCTACA GCTGGAAGRG AGTGTTCCTA ACCCACCCIG CCACCTGCTA 960 

C3«3GACCACC TGCCCAGGCC CCTGTGACTC GCAGCCCTGC CAGAATGGAG GCACATGTGT 1020 

TCCAGARGGA CTGGACGGCT ACCAGTCCCT CTGCCCGCTG GCCTTTGGAG GGGftGGCTAA 1080 

CTGTGCCCTG AAGCTGAGCC TGGAATGCAG GGTCGACCTC CTCTTCCTGC TGGACAGCTC 1140 

TGOGGGCACC ACTCTGGACG GCTTCCTGOS GGCCAAAGTC TTCGTGAAGC GGTTTCTGOG 1200 

GGCOGTGCTG AGCGAGGRCT CTCGGGCCCG AGTGGGTGTG GCCACATACA GCAGGGAGCT 1260 

GCTGGTGGCG GTGCCTGIGG GGGAGTACCA GGRTGTGCCT GACCIGGTCT GGAGCCTOGA 1320 

TGGCATTCCC TTCCGTGGTG GCCCCACCCT GRCGGGCAGT GCCTTGOGGC AGGOSGCAGA 1380 

GCX3TGGCTTC GGGfiGOGCCA CCAGGACAGG CCAG6ACCGG CCACGTASfiG TGGTGGTTTT 1440 

GCTCACTGAG TCACACTCCG AGGATGAGGT TGOGGGCKCA GOSCXSTCACG CAAGGGOGOG 1500 

AGAGCTGCTC CTGCTGGGTG TAGGCAGTGA GGCCGTGCGG GCAGAGCTGG AGGAGRTCAC 1560 

AGGCAGCCCA AAGCATGTGA TGGTCTACTC GGATCCTCAG GATCTGTTCA ACCAAATCCC 1620 

TGAGCTGCAG GGGAAGCTGT GCAGCCGGCA GCGGCCAGGG TGCCGGACAC AAGCCOTGGA 1680 

CCTCGTCTTC ATGTTGGRCA CCTCTGCCIC AGTAGGGCCC GAGAATTTTG CTCAGATGCA 1740 

GAGCTTTGTG AGAAGCTGTG CCCTCCAGTT TGAGGTGAAC CCTGACGTGA CACAGGTCGG 1800 

CCTGGTGGTG TATGGCAGCC AGGTGCAGAC TGCCTTCGGG CTGGACACCA AACCCACCCG 1860 

GGCTGCGATG CTGCGGGCCA TTAGCCAGGC CCCCTACCTA GGTGGGGTGG GCTCAGCCGG 192 0 

AGAGGGGTG CCCGGCCTGG 1980 

CAGAGGATG CAGCCGTTCC 2040 
2100 

AAGTGAGGGT CTGCGGAGGC TTGCAGGTCC CCGGGATTCC CTGATCCACG TGGCAGCTTA 2160 

CGCCGACCTG CGGTACCACC AGGACGTGCT CATTGAGTGG CTGTGTGGAG AAGCCAAGCA 222 0 

GCCAGTCAAC CTCTGCAAAC CCAGCCCGTG CATGAATGAG GGCAGCTGCG TCCTGCAGAA 226 0 

TGGGAGCTAC CGCTGCAAGT GTCGGGATGG CTGGGAGGGC CCCCACTGCG AGAACCGATT 234 0 

CTTGAGACGC CCCTGAGGCA CATGGCTCCC GTGCAGGAGG GC^GCAGCCG TACCCCTCCC 24 00 

AGCAACTACA GAGAAGGCCT GGGCACTGAA ATGGTGCCTA CCTTCTGGAA TGTCTGTGCC 2460 

CCAGGTCCTT AGAATGTCTG CTTCCCGCCG TGGCCAGGAC CACTATTCTC ACTGAGGGfiG 2520 

GAGGATGTCC CAACTGCAGC CATGCTGCTT AGAGACAAGA AAGCAGCTGA TGTCACCCAC 2580 

AAACGATGTT GTTGAAAAGT TTTGATGTGT AAGTAAATAC CCACTTTCTG TACCTGCTGT 2640 

GCCTTGTTGA GGCTATGTCA TCTGCCACCT TTCCCTTGftG GATAAACAAG GGGTCCTGAA 2700 

GACTTAAATT TAGCGGCCTG ACGTTCCTTT GCACACAATC AATGCTOGCC AGAATGTTGT 2760 
TGACACAGTA ATGCCCAGCA GAGGCCTTTA CTAGAGCATC CTTTGGACGG 

Seq ID NO: 87 Protein sequence 
Proeeln Accession #: Eos sequence 

1 11 21 31 41 51 

I I I I I I 

MPPFLLLEAV CVFLFSHVPP SLPLQEVHVS KETIGKISAA SKMMWCSAAV DIMFLLDGSN 60 

SVGKGSFERS KHFAITVCDG LDISPERVRV GAFQFSSTPH LEFPLDSFST QQEVKAHIKE 120 

MVFKGt^TET ELALKYLLHR GLPGGRNASV PQILIIVTDG KSQGDVALPS KQLKERGVTV ISO 

FAVGVRFPRW EELHALASEP RGQHVLLAEQ VEDATNGLFS TLSSSAICSS ATPDCRVEAH 240 

PCBHRTLEMV REFAGNAPCW RGSRRTLAVL AAHCPFYSWK RVFLTHPATC YRTTCPGPCD 300 

SQPCQNGGTC VPEGLDGYQC LCPLAFGGEA NCALKLSLEC RVDLLFLLDS SAGTTLDGFL 360 
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RAKVFVKRFV RAVLSEDSRA RVGVATYSRE LLVAVPVGEY QDVEDLVWSL DGIPFRGGPT 
LTGSALRQAA ERGFGSATET GQDRPRRWV LLTESHSKDE VAGPfiEHRRA REIiLIiLGVGS 
EAVHAELEEI TGSPKHVMVY SDPQDLPNQI PELQGKLCSR QRPGCSTQAL DLVPMLDTSA 
SVGPENPAQM QSFVRSCALQ FEVNPDVTQV GLWYGSQVQ TAFGLDTKPT RAAMLRAISQ 
5 APYLGGVGSA GTALLHIYDK VMTVQRGARP GVPKAVWLT GGRGAEDAAV PAQKLRNNGI 
SVLWGVGPV LSEGLRRIAG PRDSLIHVAA Y7U)LRYHQDV LIEWLCGEAK QPVNLCKPSP 
CMNEGSCVLQ NGSYRCKCRD GWEGPHCENR FLRRP 

seq ID NO: 88 DNA sequence 
lU Nucleic Acid Accession #: NM_019894 

1 11 21 31 41 51 

I I I I I I 

ATGTTACAGG ATCCTGACAG TGATCAACCT C - - - 

15 AAACCCCGTA TCCCCATGGA GACCTTC 

CTGAGCCTGG CGAGTATCAT CATTGTGGTT G 

TACTTCCTCT GCGGGCAGCC TCTCCACTTC ATCCCGAGGA AGCAGCTGTG TGACGGAGAG 240 

CTGGACTGTC CCTTGGGGGA GGACGAGGAG CACTGTGTCA AGAGCTTCCC CGAAGGGCCT 300 

GCAGTGGCAG TCCGCCTCTC CAAGGACCGA TCCACACTGC AGGTGCTGGA CTCGGCCACfi 360 

20 GGGAACTGGT TCTCTGCCTG TTTOSACAAC TTCACAGAAG CTCTCGCTGA GACAGCCTGT 420 

AGGCAGATGG GCTACAGCAG CAAACCCACT TTCAGAGCTG TGGAGATrGG CCCAGACCAG 480 

GATCTGGATG TTGTTGAAAT CACAGAAAAC AGCCAGGAGC TTCGCATGCG GAACTCAAGT 54 0 

GGGCCCTGTC TCTCAGGCTC CCTGGTCTCC CTGCACTGTC TTGCCTGTGG GAAGAGCCTG 600 

AAGACCCCCC GTGTGGTGGG TGGGGAGGAG GCCTCTGTGG ATTCTTGGCC TTGGCAGGTC 660 

25 AGCATCCAGT ACGACAAACA GCACGTCTGT GGAGGGAGCA TCCTGGACCC CCACTGGGTC 720 

G CCCACTGCTT CAQQAAACAT SCCGATGTGT TCAACTGGAA GGTGCGGGCA 780 



TTCAACCCCA TGTACCCCAA AGACAATGAC ATCGCCCTCA TGAAGCTGCA G 

ACTTTCTCAG GCACAGTCAG GCCCATCTGT CTGCCCTTCT TTGATGAGGA GCTCACTCCA 960 

30 GCCACCCCAC TCIGGATCAT TGGATGGGGC TTTACGAAGC AGAATGGAGG GAAGATGTCT 1020 

GACATACTGC TGCAGGCGTC AGTCCAGGTC ATTGACAGCA CACGGTGC31A TGCAGACGftT 1080 

GCGTACCAGG GGGAAGTCAC CGAGAAGATG ATGTGTGCAG GCATCCOGGA AGGGGGTGTG 1140 

GACACCTGCC AQGGTGACAG TGGTGGGCCC CTGATGTACC AATCTGACCA GTGGCATGTG 1200 

GTGGGCATCG TTAGCTGGGG CTATGGCTGC GGGGGCCCGA GCftCCCCSGG AGTATACACC 1260 

35 AAGGTCTCftG CCTATCICAA CTGGATCTAC AATGICTGGA AGGCTGftGCT GTAA 

Seq ID NO: 39 Protein sequence: 
Protein Accession #: NP_063 947.1 

40 1 i' r f i' r 

MLQDPDSDQP LNSLDVKPLR KPRIPMETPR KVGIPIIIAL LSLASIIIW VLIFWILDKY 60 

YFLCGQPLHF IPRKQLCDGE LDCPLGEDEE HCVKSFPEGP AVAVRLSKDR STLQVIiDSAT 120 

GNWFSACFDN PTEALAETAC RQMGYSSKPT FEAVEIGPDQ DLDWEITEM SQEUiMRNSS 180 

GPCLSGSLVS LHCLACGKSL KTPRWGGBE ASVDSWPWQV SIQYDKQHVC GGSILDPHWV 240 

45 LTAAHCFHKH TDVFNMKVRA GSDKLGSFPS LAVAKIIIIE FNPMYPKDHD lALMKLQFPL 300 

TFSGTVRPIC LPFFDEELTP ATPLWIIGWG FTKQNGGKMS DILLQASVQV IDSTRCHADD 360 

AYQGBVTBKM MCAGIPBGGV DTCQGDSGGP LMYQSDQWHV VGIVSMGYGC GGPSTPGVYT 420 
KVSAYLNMIY NVWKAEL 

50 Seq ID NO: 90 DNA sequence 

Nucleic Acid Accession #: HM_002776.1 
Coding sequence: 82.. 912 

'l 11 21 31 41 SI 

„ I I I I I I 

J J ACCAGCGGCA GACCACAGGC AGGGCAGAGG CAC33TCTGGG TCCCCTCCCT CCTTCCTATC 60 

GGCGACTCCC AGATCCTGGC CATGAGAGCT CCGCACCTCC ACCTCTCCGC CGCCTCTGGC 120 

GCCCGGGCTC TGGCGAAGCT GCTGCCGCTG CTGATGGCGC AACTCTGGGC CGCAGAGGCG 180 

GCGCTGCTCC CCCAAAACGA CACGCCCTTG GACCCCGAAG CCTATGGCGC CCOGTGCGCG 240 

CGOGQCTCGC AGCCCTGGCA GGTCTCGCTC TTCAACGGCC TCTCGTTCCA CTGCGCGGGT 300 

60 GTOCTGGTGG ACCAGAGTTG GGTGCTGACG GCCGCGCACT GCGGAAACAA GCCACTGTGG 350 

GCTCGAGTAG GGGATGATCA CCTGCTGCTT CTTCAGGGCG AGCAGCTCCG CCGGACBACT 420 

CGCTCTGTTG TCCATCCCAA GTACCACCAG GGCTCAGGCC CCATCCTGCC AAGGCGAACG 480 

GATGAGCACG ATCTCATGTT GCTAAfiGCTG GCCAGGCCCG TAGTGCCGGG GCCCCGCGTC 540 
CGGGCCCIGC AGCTTCCCTA CCGCTGTGCT CAGCCCGGAG A 

65 TGGGGCACCA CGGCCGCCCG GAGAGTGAAG TACAACAAGG GCCTGACCTG C 

ACTATCCTGA GCCCTAAAGA GTGTGAGGTC TTCTACCCTG GCGTGGTCAC CAACAACATG 720 

ATATGTGCTG GACTGGACCG GGGCCAGGAC CCTTGCCAGA GTGACTCTGG AGGCCCCCTG 7 80 

GTCTGTGACG AGACCCTCCA AGGCATCCTC TOGTGGGGTG TTTACCCCTG TGGCTCTGCC 840 

CAGCATCCAG CTGTCTACAC CCAGATCTGC AAATACATGT CCTOSATCAA TAAAGTCATA 900 

70 CGCTCCRACT GATCCAGATG CTACGCTCCA GCTGATCCAG ATGTTATGCT CCTGCTGATC 960 

CAGATGCCCA GAGGCTCCAT OGTCCATCCT CTTCCTCCCC AGTC3GGCTGA ACTCTCCCCT 1020 

TGTCTGCACT GTTCAAACCT CTGCCGCCXTT CCACACCTCT AAACATCTCC CCTCTCaCCT 1080 

CATTCCCCCA CCTATCCCCA TTCTCTGCCT GIACTGAAGC TGAAATGCAG GAAGTGGTGG 1140 

CRAAGGTTTA TTCCAGAGAA GCCAGGASGC CGGTCATCAC CCAGCCTCTG AGAGCfiGTTA 1200 

75 CTGGGGTCAC CCRACCTGAC TTCCTCTGCC ACTCCCCXSCT GTGIGACTTT GGGCAAGCCA 1260 

AGTGCCCTCT CTGAACCTCA GTTTCCTC31T CTGCAAAATG GGAACaVATGA CGTGCXTTACC 1320 

TCTTAGACAT GTTGTGAGGA GACTATGATA TAACATGTGT ATGTAAATCT TCATGTGATT 1380 

GTCATGTAAG GCTTAACaCA GTGGGTGGTG AGTTCTGACT AAAGGTTACC TGTTGTCGTG 1440 
ARAAAAAAAA AAAA 

Seq ID H0-. 91 Protein sequence 
Protein Accession #: NP_002767.1 

1 11 21 31 41 SI 

I I I I I I 

OJ MRAPHLHLSA ASGARALAKl. LPIiLMAQLWA AEAAIiLPQND 
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VSLFNGLSFH CAGVLVDQSW VLTAAHCX3NK PLWARVGDDH LLLLQGEQLR RTTRSWHPK 
YHQGSGPILP ERTDEHDLML LKLARPWPG PRVHALQLPY RCAQPCiDQCQ VAGWGTTAAR 
RVKYNKGLTC SSITILSPKB CBVFYPGWT NNMICAGLDR GQDPCQSDSG GPLVCDETLQ 
GILSWGVYPC GSAQHPAVYT QICKYMSWIN KVIRSN 



GTMGGTCTC TGAGATCCTT GCACTAGCTA CATCCTCRGG GTAGGAGGAA 
GATGGCTTCX: AGRAGCRTGC GGCTGCTCXTT ATTGCTGAGC TGCCTGGCCA AflACAGGAGT 
CCTGOGTGAT ATCAT(aTGA GACCCAGCTG TGCTCCTGGA TGGTTTTACC ACASGTCCAA 
TTGCTATGGT TACTTCaGGA AGCTGAGGAA CTGGTCTGAT GCCGAGCTCG AGTCTCAGTC 
TTACGGAAAC GGAGCCCACC TGGCATCTAT CXITGAGTTTA AAGGAAGCCA GCACCaTAGC 
AGAGTACATA AGTGGCTATC AGAGAAGCCA GCCGATATGG ATTGGCCTGC ACXSACTCACR 
GAAGAGGCAG CAGTGGCAGT GGATTGATGG GQCCATGTAT CTGTACAGAT CXTGGTCTGG 
CAAGTCCATG GGTGGGAACA AGCACTGTGC TGAGATGAGC TCC3UITAACA ACTTTTTAAC 
TTGGAGCAGC AACGAATGCA ACAAGCGCCA ACACTTCCTG TGC3UU3TACC GACCATAGAG 
GATTCTGCTA ACTCCTGCAC AGCCCCGTCC TCTTCCTTTC TGCTAGCCTG 
CTCATTATTT CAGAGGGGAA ACCTAGCAAA CTAAGAGTGA TAAGGGCCCT 
ACTACACTGG CTTTTTTAGG CTTAGAGRCA GAAACTTTAG CATTGGCCCA GTAGTGGCTT 
25 CTAGCTCTAA ATGTTTGCCC CGCCATCCCT TTCCACAGTA TCCTTCTTCC CTCCTCCCCT 
GTCTCTGGCT GTCTCGAGCA GTCTAGAAGA GTGCATCTCC AGCCTATGAA ACAGCTGGGT 
CTTTGGCCAT AAGAAGTAAA GATTTGAAGA CAGAAGGAAG AAACTCAGGA GTAAGCTTCT 
AGACCCCTTC AGCTTCTACA CCCTTCTGCC CTCTCTCCAT TGCCTGCACC CCACCCCAGC 
CACTCAACTC CTGCTTGTTT TTCCTTTGGC CATAGGAAGG TTTACCAGTA GAATCCTTGC 
30 TAGGTTGATG TGGGCCATAC ATTCCTTTAA TAAACCATTG TGTACATAAG RAAAAAAAAA 

Seq ID NO; 93 Protein sequence 



MASRSMRLLL LLSCLAKTGV LGDIIMRPSC APGWFYHKSN CYGYFRKUUI WSDAELECQS 60 

YGNGAHLASI IiSLKERSTIA EYISGYORSQ PIWIGIiHDPQ KRQQWQHIDG SMYLYRSWSG 120 
KSHGGHKHCA EMSSNNNFLT WSSNECNKEQ HFLCKYRP 

Seq ID NO: 94 DHA sequence 

Nucleic Acid Accession #= XM_OSia60 

Coding sequence: 1..4086 

1 11 21 31 41 51 

GAGCTAGCGC TCSiAGCAGAG CCCAG03CGG TGCTATCGGA CaGAGOCTGG CGAGCGCMG 60 

OGGCGCGGGG »GCX»GCGGG GCTGAQCXiCG GOaGGGTCT GAACCCAGAT TTCCCAGACT 120 

AGCTACCACT COGCTTGCCC ACGCCOOSGG AGCTOGCGGC GCCTGGCGGT CAGCX3ACCAG 180 

ACGTOOSGGG CCGCTGCGCT CCTGGCXCGC GAGGCGTGAC ACTGTCTOGG CTACAGACCC 240 

AGAGGGAGCA CACTGCCAGG ATGGGRGCTG CTGGGAG§CA GGACTTCCTC TTCRAGGCCA 300 

TGCTGACCAT CAGCTGGCTC ACTCTGACCT GCTTCCXTCG GGCCACATCC ACAGTGGCTG 360 

CTGQGTGCCC TGACCAGAGC CCTGAGTTGC AACCCTGGAA CCCTGGCCAT GACCAAGACC 420 

ACCATGTGCA TATCGGCCAG GGCAAGACAC TGCIGCTCAC CICTTCTGCC ACGGTCTATT 480 

CCATCCACAT CTCAGAGGGA GGCAAGCTGG TCATTAAAGA CXaiCGACGAG CXX3ATTGTTT 54 0 
TGCGAACCCG GCACaTCCTG ATTGACRACG GAGGAGAGCT GCATGCTGGG AGTGCCCTCT 600 
GCCCTTTCCA GGGCMTTTC ACCATCATTT TGTATGGAAG QGCTGATGAA GGTATTC3K3C 660 
CGGATCCTTA CTATGGTCTG AAGTACATTG GGGTTGGTAA AGGAGGCXSCT CTTGAGTTGC 720 
ATGGACAGAA AAAGCTCTCC TGOACSTTTC TGAACARGAC CCTTCaCCCA GGTGGCATGG 780 
CAGAAGGAGG CTATTTTTTT GAAAGGAGCT GGGGCCACCG TGGAGTTATT GTTCATGTCA 840 
TCGACCCCAA ATCAGGC»CA GTCATCCATT CTGACCGGTT TGACACCTAT AGATCCMGA 900 
AAGAGAGTGA AOGTCTGGTC CaGTATTTGA ACGCGGTGCC CGATGGCAGG ATCCTTTCTG 950 

TTGCAGTGAA TGATGAAGGT TCTCGAAATC TGGATGACAT GGCCAGGAAG GCGATGACCA 1020 

AATTGGGAAG CAAACACTTC CTGCACCTTG GATTTAGACA CCCTTX3GAGT TTTCTAACTG 1080 

TGAAAGGAAA TCCATCATCT TCAGTGGAAG ACCATATTGA ATATCATGGA CATCGAGGCT 1140 

CTGCTGCTGC CCGGGTATTC AAATTGTTCC AGACAGAGCA TGGCGAATAT TTCAATGTTT 1200 

TATCTCAGAC TAAAGGTGGG GAGAAAATTT CAGACCTCTG GAAAGCTCAC CCAGGAAAAA 13 20 

TATGCAATCG TCCCATTGAT ATACAGGCCA CTACAATGGA TGGAGTTAAC CTCAGCACCG 1380 

AGGTTGTCTA CAAAAAAGGC CAGGATTATA GGTTTGCTTG CTACGACCGG GGCAGAGCCT 1440 

GCCGGAGCTA CCGTGTACGG TTCCTCTGTG GGAAGCCTGT GAGGCCCAAA CTCACAGTCA 1500 

CCATTGACAC CAATGTGAAC AGCACCATTC TGAACTTGGA GGATAATGTA CAGTCATGGA ISSO ' 

ACATCTGCAA TTTCTTTGAC TTCGATACCT TTGGGGGCCA CATCAAGTTT GCTCTGGGAT 1860 

TTAAGGCAGC ACACTTGGAG GGCACGGAGC TGAAGCATAT GGGACAGCAG CTGGTGQGTC 1920 

AGTACCCGAT TCACTTCCAC CTGG(XGGTG ATGTAGACGA AAGGGGAGGT TATGACCCAC 1980 

CCACATACAT CAGGGACCTC TCCATCCATC ATACATTCTC TCGCTGCGTC ACAGTCCATG 2040 

GCTCCAATGG CTTGTTGATC AAGGACGTTG TGGGCTATAA CTCTTTGGGC CACTGCTTCT 2100 



AGTCIGQAAC CCTCCTCCCC TCGGACCGTG ACAGCaAGAT GTGCAAGATG ATCACAGAGG 2220 

ACTOCTACCC GGGGTACATC CCCRAGCCCA GGCAAGACTG CAATGCTGTG TCCACCTTOT 2280 

GGATGGCCaA TCCCAAt3iAC AACCTCATCA ACTGTGCOSC TGCAGGATCT GAGGAAACTG 2340 

GATTTPGGTT TATTTTTCAC CACGTACCAA CGGGOCCCTC CGTGGGAATG TACTCCCCAG 2400 
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■ GTTATTCAGA GCRCATTCX3V CTGGGAARAT TCTATAACAA CCGRGCRCAT TCCAACTACC 2460 

GGGCTC3GCAT GATCATAGftC AACGGAGTCA AAACCaCOGA GGCCTCTGCX: AAGGACASGC 2S20 

GOCCGTTCCT CTCAATCATC TCTGCCAGftT ACAGCCCTCa CCRGGACGCC GACCCGCTGR 2S80 

AGCCCCX3GGA GCCGGCCATC ATCSGACACT TCATTGCCTA CSU«3AACCAG GACCACGGGG 2640 

CCTGGCTGCG OGGCGGGGAT GTGTGGCIGG ACAGCTGCCG GTTTGCTGAC AATGGCATTG 2700 

GCCTQACCCT GGCCAGTGGT GGAftCCTTCC CX3TATGACGA CGGCTCCaAG CAAGAGATAA 2760 

AGAACAGCTT GTTTGTTGGC GAGRGTGGCA ACGTGGGGAC GGAAATGATG GACAATAGGA 2820 

TCTGGGGCCC TGGCGGCTTG GACCATAGCG GAAGGACCCT CCCTATAGGC CAGAATTTTC 2880 

CAATTAGAGG AATTCAGTTA TATGATGGCC CCATCAACAT CCAAAACTGC ACTTTCKGAA 2940 

AGTTTGTGGC CCTGGAGGGC CGGCACACCA GOGCCCTGGC CTTCCGCCTG AATAATGCCT 3000 

GGCAGAGCTG CCCCCATAAC AACGTGACCG GCATTGCCTT TGAGGACGTT CCGATTACTT 3060 

CCAGAGTGTT CTTCGGAGAG CCTGGGCCCT GGTTC3«ICC3^ GCTGGACaTG GATGGGGATA 3120 

AGACATCTGT GTTCCATGAC GTCGACGGCT CX«TGTCOGR GTACCCTGOC TCCTACOTCR 3180 

CGAAGAATGA CAACTGGCTG GTCCGGCACC CAGAOTGCAT CAATGTTCCC GACTGGAGAG 3240 

GGGCCATTTG CAGTGGGTGC TATGCACAGA TGTACATTCA AGCXTTACSUIG ACCAGTAACC 3300 

TGCGAATGAA GATCATCAAG AATGACTTCC CCAGCCACCC TCTTTAOCTG GAGGGGGCGC 3360 

TCACCAGGAG CACCCATTAC CAGCAATACC AACCGGTTGT CACCCTGCAG AAGGGCTACa 3420 

CCATCCACTG GGACCAGACG GCCCCCGCCG AACTCGCCAT CTGGCTCSiTC AACTTCAACA 3480 



CGATCCCCAA GAAGCTCTTT GGTTCTCAGC TGAAAACAAA GGACCATTTC TTGGAGGTGA 
AGATGGAGAG TTCCAAGCAG CACTTCTTCC ACCTCTGGAA CGACTTCGCT T 
TGGATGGGAA GAAGTACCCC AGTTCGGAGG ATGGCATCCA G 
ACCAAGGGCG CGTGGTGAGC CACACGAGCT TCAGGAACTC CATTCTGCAA G 

GGCAGCTTTT CAACTATGTG GCGACC»TCC CTGACAATTC CATAGTGCTT ATGGCATCAA ^ 

AGGGAAGATA CGTCTCCAGA GGCCCATGGA CCAGAGTGCT GGAAAAGCTT GGGGCAGACA 4200 

GGGGTCTCAA GTTGAAAGAG CAAATGGCAT TCGTTGGCTT CAAAGGCAGC TTCCGGCCCA 4260 

TCTGGGTGAC ACTGGACACI GAGGATCACA AAGCCAAAAT CTTCCAAGTT GTGCCCATCC 4320 

CTGTGGTGAA GAAGAAGAAG TTGTGAGGAC AGCTGCCGCC CGGTGCCACC TCGTGGTAGA 4380 

CTATGACGGT GACTCTTGGC AGCAGACCAG TGGGGGATGG CTGGGTCCCC CAGCCCCTGC 4440 

CAGCAGCTGC CIGGGAAGGC CSIGTTTCAG CCCTGATGGG CCAAGGGAAG GCTATCAGSG 45 00 

ACCXrPGGTGC TGCCaCCTGC CECTACTCAA GTGTCTACCT GGAGCCXCTG GGGCGGTGCT 4S60 

GGCCAATGCT GGAAACATTC ACTTTCCTGC AGCCTCTTGG GTGCTTCTCT CCTATCTGTG 4620 

CCTCTTCaGT GGGGGTPTGG GGACCATATC AGGAGACCTG GGTTGTGCT6 AOWSCAAAGA 4680 

TCCACTTTGG CAGGAGCCCT GACCCAGCTA GGAGGTAGTC TGGAGGGCTG GTCATTCACA 4740 

GATCCCCATG GTCTTCAGCA GACAAGTGAG GGTGGTAAAT GTAGGAGAAA GAGCCTTGGC 4800 

CTTAAGGAAA TCTTTACTCC TSIAAGCAAG AGCCRACCTC ACAGGATTAG GAGCTGGGGT 4860 

AGAACTGGCT ATCCTTGGGG AAGAGGCAAG CCCTGCCTCT GGCCGTGTCC ACCITTCAGG 4920 



AT GGTACCTGGA ACCCAACAGT TCATGGATAT S040 

CX»CTGATAT CCATGATGCT GGGTGCKCXa^ GCGCACAOSG GATGGAGAGG TGAGAACTAA SI 00 

TGCCTAGCTT GAGGGGTCTG CAGTCCaGTA GGGCAGGCAG TC3«3GTCCAT GTGCACTGCai S160 

ATGCCAGGTG GAGAAATCAC AGRGAGGTAA AATGGAGGCC AGTGCC»TTT CAGAGGGGAG S220 

GCTCAGGAAG GCTTCTTGCT TACAGGAATG AAGGCTGGGG GCATTTTGCT GGGGGGAGAT 5280 

GAGGCAGCCT CTGGAATGGC TCAGGGATTC AGCCCTCCCT GCOGCTGCCT GCTGAAGCTG S340 

GTGACTACGG GGTCGCCCTT TGCTCACGTC TCTCTGGCCC ACTCATGA7G GAGAAGTGTG 5400 

G AGCAATGGGC TTTGCTGCTT ATGAGCACAG AGGAATTCAG TCCCCAGGCA 5460 



CATTTGCTCT TCATCCAGGG AACTGAGCAC AGGGGGCCIC CAGGAGACCC T 

T CGGCCTGGGA TTTCAGAGCT GGAAATATAG AAAATATCTA GCCCAAAGCC 
i CAGATGGGGA AAGTGAGCCC CCAAGATGGG A 
GACGGCCTGG GGAGCCCCAC CCTAGCCCTT GCTGCCACAC CACATTGCCT C 

lAGGTAG CTTCTGGAAA TGGGGACAAG TCCCCTCGAA : 
A TGACTAGAGT AGAATGACAG CTAGCAGATC TCTTCCCTCC TGCTCCCAGC ! 
GCACACAAAC CCGCCCTCCC CTTGGTGTTG GCGGTCCCTG TGGCCTTCAC TTIGTTCACT I 
ACCTGTCAGC CCAGCCTGGG TGCACAGTAG CTGCAACTCC CCATTGGTGC TACCTGGCTC ' 
TCCTGTCTCT GCAGCTCTAC AGGTGAGGCC CAGCAGAGGG AGTAGGGCTC GCCATGTTTC ( 
TGGTGAGCCA ATTTGGCTGA TCTTGGGTGT CTGAACAGCT ATTCGGTCCA CCCCAGTCCC 
TTTCAGCTGC TGCTTAATGC CCTGCTCTCT CCCTGGCCCA CC7TATAGAG AGCCCAAAGA 
A GAGGGAGAAC TCTATCTGTG GTTTATAATC TTGCACGAGG CACCAGAGTC 

CCTTCAAAGA GGGCCTGCCT GGCTCCCTCC ACCCAACTGC ACCCATGAGA C 
GAGTCCATTC CCCAGGTGGG A 
AGCTGCTGGG AGGTGACCAT AGGGCTCTGC T 
AGTCACAGGA AGGACTTCTT CCAGGGAGAT TAGTGGTGAT Q 
CXTTCATGTCC TTCTTGTCCA CGGTTTTGTT GAGTTTTCAC TCTTCTAATG CAAGGGTCTC 6600 
ACACTGTGAA CCACTTAGGA TGTGATCACT TTCAGGTGGC CAGGAATGTT GAATGTCTTT 6660 
GGCTCRGTTC ATTTAAAAAA GATATCTATT TGRAAGTTCT CAGAGTTGTA CATATGTTTC 6720 
ACAGTACAGG ATCTGTACAT AAAAGTTTCT TTCCTAAACC ATTCACCAAG AGCCAATATC 6780 
TAGGCATTTT CTTGGTAGCA CAAAITTTCT TATTGCTTAG AAAATTGTCC TCCTTGTTAT 6840 
TTCTGITTGT AAGACTTAAG TGAGTTAGGT CTTTAAGQAA AGC3«CGCTC CTCTGRAATG 6900 
C GAAATAGCTG GTCCTTTTTC GGGAGTTAGA TGTATAGAGT 6960 
r TGTAGGCATC ACCATGAACA AAGATATATT TTCTATTTAT 7020 
TTATTATATG TGCACTTCM GAAGTCACIG TCAGAGRAAT AAAGRATTGT CTTAAATGTC 

Seq ID HO: 95 Protein sequence 
Protein Accession #: XP_051860.2 
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TIILYGRADB GIQPDEYYGL KYIGVGKGGA LBLHGQKKLS WTFLHKTLHP GGMftEGGXFF 180 
BRSWGHRGVI VHVIDPKSGT VIHSDRFDTY RSKKESERLV QYIiNAVPDGR ILSVAVNDB6 240 
SRNLDraiAEK AMTKLGSKHF LHLGFRHPWS FLTVKGHPSS SVEDHIEYHG HRGSAAAHVF 300 
KLFQTBHGBy FNVSLSSEWV QDVBWTBWFD HDKVSQTKGG BKISDLWKAH PGKICNRPID 360 
IQATTMDGVN LSTBWYKKG QDYRFACTTOR GRACaiSYRVE FLCGKPVRPK LTVTIDTNVN 420 
STILNLEDNV QSWKPGDTLV lASTDYSMYQ AEEFQVLPC31 SCAPNQVKVA GKPMYLHIGE 480 
EIDC3VDMRAE VGLLSRNIIV MGBMEDKCYP YENHICNFFD FDTPGGHIKF ALGFKAAHLB S40 
GTELKHMGQQ IiVGQYPIHFH LAGDVDBRGG TOPPTYIRDL SIHHTFSRCV TVHGSNGLLI 600 
KDWGYNSLG HCFFTEDGPE ERNTFDHCLG LLVKSGTIiP SDRDSKMC3CM ITEDSYPGYI 660 
PKPRQDCNAV STPWMSNPNN HLINCAAAGS BBTGFWFIFH HVPTGPSVGM YSPGYSEHIP 720 
LGKFYNNRAH SNYRAGMIID HGVKTTBASA KDKRPFLSII SARYSPHQDA DPLKPRBPAI 780 
IRHFIAYKNQ DHGAWLRGGD VWLDSC3JFAD NGIGLTLASG GTFPYDDGSK QBIKNSLFVG 840 
ESGMVG'fEMM DNRIWGPGGL DHSGRTLPIG QNFPIRGIQL YDGPINIQNC TFRKPVALEG 900 
EHTSALAFRL NNAWQSCPHN HVTGIAFEDV PITSRVFFGE PGPWFNQLDM DGDKTSVFHD 960 
VDGSVSEYPG SYLTKNDNWL VRHPDCINVP DWRGAICSGC YAQMYIQAYK TSNLRMKIIK 1020 
NDFPSHPLYL EGALTRSTHY QQYQPWTLQ KGYTIHWDQT APAELAIMLI NFNKGDWIRV 1080 
GLCYPRGITF SILSDVHNRL LKQTSKTGVF VRTLQMDKVE QSYPGRSHYY WDEDSGLLFL 1140 
KLKAQNEREK FAFCSMKGCE RIKIKALIPK NAGVSDCTAT AYPKFTERAV VDVPMPKKLF 1200 
GSQLKTKDHF LEVKMESSKQ HFFHLWNDFA YIEVDGKKYP SSEDGIQWV IDGNQGRWS 1260 




I I I I I I 

CAGGAATTTG TGGCGGAGAG GGCRAATAAC TGCGGCTCTC COSGCGCCCC GATGCTCGCA 
CCATGTCGAG GCGCAAGCAG GCGAAACCCC AGCACATCAA C 
30 AGCAGCAGCC GCAGCAGCAG ACCCCGGAGT T 

CGGGGGAGCT GGGTGCTCCA GTGAACCACC CAGGGAATGA C 

AAGCCACAGT AAAGCGGCTT CGTCGGGAGG AGACGCACGT CTGTGAGAAA TGCTGTGCGG 300 

TCCTCATCAT GAATGACAGC GAGGGGCCTG TGOCTTCAGA AGACTTCTCC GGAGCTGTAC 42C 

35 TGAGCCACCA GCCCaCCAGT CCCGGCAGTA AGGRCTGTCA CAGGGAGAAT GGCGGCAGCT 480 

CAGAGGACAT GAAGGAGAAG CCGGATGOSG AGTCTGTGGT GTACCIAAAG ACAGAGACAG 540 

CCCTGCCaCC CACCCCCCaO GACATAAGCT ATTTAGCCAA AGGCaAAGTG GCCAACaCTA 600 

ATGTGACXTT GCAGGCACTA CGGGGCACCA AGGTGGCGGT GAATCAGOGG AGCGOSGATG 660 

CACTCCXTTGC CCCOGTGCCT GGTGCCAACA GCATCCCGTG GGTCCTCX3AG CAGATCITGT 720 

40 GTCTGCAGCA GCSGCAGCTA CaGCAGATCC AGCTCACCGA GCAGATCCGC ATCCAGGTGA 780 

ACATGTGGGC CTCCX»OGCC CTCCACTCAA GOXSGGCAGG GQCCGACACT CTGAAGACCT 840 

TGGGCAGCCA CATGTCTCaG OiGGTTTCrG CAGCTGTGGC TTTGCTCAGC CAGAAAGCTG 900 

GAAGCCAAGG TCTGTCTCTG GATGCXTTGA AACAAGCXXA GCTACCTCaC GCX»ACATCC 960 

CTTCTGCCAC CAGCTCCCTG TCCCCaGGGC TGGCACCXTT CACTCTGAAG COSGATGGGA 1020 

45 CCCGGGTGCT CCCGAACGTC ATGTCCCGCC TCCCGAGCGC TTTGCTTCCI CAGGCCCCGG 1080 

GCTCX3GTGOT CTTCCAGAGC CCTTTCTCCa CTGTGGCGCT AGACACATCC AAGAAAGGGA 1140 

AGC3GGAAGCC ACOGAACATC TCCGOSGTGG ATGTCAAACC CAAAGACGAG GOSGCCCTCT 1200 

ACAAGCaCAA GTGTAAGTAC TGTAGCAAGG TTTTTGGGAC TGATAGCTCC TTGCAGATCC . 1260 

ACCTCCGCTC CCACACTGGA GAGAGACCCT TCGTGTGCTC TGTCTGTGOI CATCGCTTCA 1320 

50 CCACCAAOGG CAACCTCAAG GTGCACTTTC ACOSACATCC CCAGGTGASG GCAAACCCCC 1380 

AGCTGTTTGC CGAGTTCCAG GACAAAGTGG CGGCOGGCAA TGGCATCCCC TATGCACTCT 1440 

CTGTACCTGA CCCCATAGAT GAACCGAGTC TTTCTTTAGA CAGCaAACCT GTCCTTGTAA ISOO 

CCACCTCTGT AGGGCTACCT CAGAATCTTT CTTCGGGGAC TAATCCCAAG GACCTCaCGG 1S60 

GTGGCTCCTT GCCCGGTGAC CTGCAGCCTG GGCCTTCTCC AGAAAGTGAG GGTGGACCCA 1520 

55 CACTCCCTGG GGTGGGACCA AACTATAATT CCCCAAGGGC TGGTGGCTTC C 
GGACCCCTGA GCCAGGGTCA GAGACCCTGA A 

AGQCCACCAC TGATCCCAAC GAATGTCTCA TTTGCCACCG AGTCTTAAGC T 

CCCTCAAGAT GCATTATCGC ACCCACACCG GGGAGAGACC GTTCCAGTGT AAGATCTGTG 1B60 
^„ GCCGAGCCTT TTCTACCAAA GGTAACCTGA AGACACACCT T 
A GACGCAGCAT T 

3 ATGCACATGG GCGGTCAGAT TCCCAACACG CCCCTGCCAG 2 040 



CTTTTAACCT GCAGCGCCAG GGCAGCAGAG AAAACGGTTC CGTGGAGAGC GATGGCTTGA 2340 

CCAACGACTC ATCCTCGCTG ATGGGAGACC AGGAGTATCA GAGCCGAAGC CCAGATATCC 240 0 

TGGAAACCAC ATCCTTCCAG GCACTCTOCC CGGCCAATAG TCAAGCCGAA AGCATCAAGT 2460 

FGCTGGG AGCAAAGCAG AGAGCTCCGA GAACAGCCGC ACTGAGATGG 2520 

3 CAGTCTCCCT TCCACGTTTA TCCGAGCCCC GCCGACCTAT GTCAAGGTTG 2580 

AAGTTCCTGG CACATTTGTG GGACCCTCGA CATTGTdCCC AGGGATGACC CCTTTGTTAG 2640 

CAGCCCAGCC ACGCCGACAG GCCAAGCAAC ATGQCTGCAC ACQGTGTGGG AAGAACTTCT 2700 

CGTCTGCTAG CGCTCTTCAG ATCCACGAGC GGACTCACAC TGGAGAGAAG CCTTTTGTGT 2760 

GCAACATTTG TGGGCGAGCT TTTACCACCA AAGGCftACTT AAAGGTTCAC TACATGACAC 2820 

ACGGGQOGAA CAATAACTCA GCCCGCCGTG GAAGGRAGTT GGCCATaSAfl AACACCATGG 2880 

CTCTGTTAGG TACS3GACGGA AAAAGAGTCT CAGAAATCTT TCCCaAGGAA ATCCIGGCCC 2940 

CTTCRGTGAA TGTGGACCCT GTTGTGTGGA ACCAGTACAC CAGCATGCTC AATGGCGGTC 3000 

TGGCCGTGAA GACCAATGAG ATCTCTGTGA TCCAGAGTGG GGGGGTTCCT ACCCTCCCGG 3060 

TTTCCTTGGG GGCCACCTCC GTTGTGAATA ACGCCRCTGT CTCCaAGATG GATGGCTCCC 3120 

AGTCGGGTAT CaGTGCaGAT GTGGAAAAAC CAAGTGCTAC TGACGGOjrT CCCAAACACC 3180 

AGTTTCCTCA CTTCCTGGAA GAAAACAAGA TTGOSGTCAG CTAAGGGAGA ACTTGCGTGG 3240 

AAGGAGCAAT GCAGACSCAG TGAAATCICT AGAATCTGCT TTGTTTTGTA AGAACTCATC 3300 

TCCTCCTGTT TTCTTTTTCT TACTGATATG CAAATGATGT TTACTAOSTT GGTTGTGACC 3360 
ACAACCTC^G GCAAGTGCTA CAATCAOGAT TGTTGCTATG CTGCTTTGCA AAAAGTTG 



308 



wo U2/102235 



PCT/US02/19297 



1 11 21 31 41 51 

I I I I I I 

MSRRKQAKPQ HINSBBDQGB QQPQQQTPBF RDAftPAAPAA GBLGAPVNHP G 

ATOKRLRREE THVCEKCCAE FFSISEFLEH KKNCTKNPPV LIHNDSEGPV PSEDFSGftVL 120 

SHQPTSPGSK DOIRENGGSS EDMKEKPDAE SWYLKTETA LPPTPQDISX LAKGICVAHTH 180 

VTLQALRGTK VAVNQRSADft LPAPVPGftKS IPWVIiEQILC liQQQQLQQIQ LTBQIRIQVN 240 

MWASHALHSS GAGfiDTLKTL GSHMSQQVSA AVALLSQKAG SQGLSLDALK QAKIiPHANIP 300 

SATSSLSPGL APFTLKPDGT RVLPNVMSRL PSALLPQAPG SVLFQSPFST VALDTSKKGK 3 SO 

GKPPNISAVD VKPKDEAALY KHKCKYCSKV FGTDSSLQIH LRSHTGERPF VCSVCGHRFT 420 

TKGNLKVHFH RHPQVKANPQ LFAEFQDKVA AGNGIPyALS VPDPIDEPSL SLDSKPVLVT 4B0 

TSVGLPQNLS SGTNPKDLTG GSLPGDLQPG PSPESEGGPT LPGVGPNYNS PRAGGFQGSG S40 

TPEPGSETLK LQQLVENIDK ATTDPNECLI CHRVLSCQSS LKMHVRTHTG ERPFQCKICG 600 

RAFSTKGNLK THLGVHHTNT SIKTQHSCPI CQKKFTNA™ LQQHIRMHMG GQIPNTPLPE 6 SO 

NPCDFTGSEP MTVGENGSTG AICHDDVIES IDVEEVSSQE APSSSSKVPT PLPSIHSASP 720 

TLGFAMMASL DAPGKVGPAP FNLQRQGSRE NGSVESDGLT NDSSSLMGDQ EYQSRSPDIL 7 BO 

ETTSFQALSP AHSQAESIKS KSPDAGSKAE SSEMSRTEME GRSSLPSTFI RAPPTYVKVE 840 

VPGTPVGPST LSPGMTPLLA AQPRRQAKQH GCTRCGKNFS SASALQIHER THTGEKEFVC 900 

NICGRAFTTK GNLKVHYMTH GANNNSARRG RKLAIENTHA LLCTDGKRVS EIFPKBIIiAP 960 

VI QSGGVPTLPV SIiGATSWMN ATVSKMDGSQ 1020 



25 Nucleic Acid Accession 0: NM_000612.2 
Coding sequence: SS3..109S 

I i' r r r f 

TTCTCCCGCA ACCTTCCCTT CGCTCCCTCC CGTCCCCCCC AGCTCCTAGC CTCCGACTCC 

30 CTCCCCCCCT CACGCCCGCC CTCTOBCCTT CGCCGAACCA AAGTGGA 

TTCTGTTTCT CTCCGTGCTG TTCTCTCCOG CTGTGCGCCT GCCOSCCTCT C 
TCTCCCCCTC GCCCTCTCTT CGGCCCCCOC CTITCACGTT CACTCTGTCT C 

CTCTGCCCCC CTCIATCCTT GATACAACAG CTGACCTCAT TTCCCGATAC CTTTTCCCCC 300 

CCGAAAAGTA au«aTCTGG CCCGCCCCAG CCCGAftGACA GCCCGTCCTC CCTGGACAAT 360 

35 CAGACGAATT CTCCCCCCXIC CCCCAAAAAA AARAGCCATC CCCCCGCTCT GCCCCGTCGC 420 

ACRTTCGGCX: CX:CGCGACrC GGCXa«3AGCG GCGCTGGCAG AGGAGTGTCC GGCAGGAGGG 480 

CCRACGCCCG CTGTTCGGTT TGCGACACGC AGCAGGGAGG TGGGOGGCAG CGTCGCXXSGC S40 

TTCCAGACAC CS«iTGGGAAT CCCAATGGQG AAGTCGATGC TGGTGCTTCT CaCCTTCTTG 600 

GCCTTCX3CCI CGTGCTGCAT TGCTGCTTAC CGCCCOIGTG AGACCCTGTG OSGCXSGGGAG 660 

40 CTGGTGGACA CCCTCCAGTT CGTCTGTGQG GACCGOSGCT TCTACTTCaG CaCGCCCGCA 720 

AGCCGTGTGA GCCGTCGCAG CCGTGGCATC GTTGAGGAGT GCTGTTTCCG CaGCTGTGAC 780 

CTGGCCCTCC TGGAGACGTA CTGTGCTACC CCXKCCAAGT CCX3AGAGGGA CXSTOTCXSACC 840 

CCTCCGACCG TGCTTCCGGA CAACTTCCCC AGATACCCCG TGGGCaVAGTT CTTCCaATAT 900 

GACACCTGGA AGCAGTCCAC CCAGCGCCIG CGCAGGGGCC TGCCIGCCCT CCTGOSTGCC 960 

45 CGCCGGGGTC ACGTGCTCGC CAAGGAGCTC GAGGCGTTCA GGGAGGCC3Ui. ACGTCaCCGT 1020 

CCCCTGATTG CTCTACCCAC CCAAGACCCC GCCCACGGGG GOSCCCCCXC AGAGATGGCC 1080 

AGCAATCGQA AGTGAGCAAA ACTGCCGCRA OTCTGCaGCC CGGOSCCACC ATCCTGCAGC 1140 

CTCCTCCTGA CCACGGACGT TTCCATCAGG TTCCATCCCG AAARTCTCTC GGTTCCACGT 1200 

CCCCCTGGGG CTTCTCCTGA CCCRGTCCCC GTGCCCCGCC TCCCCGAAAC AGGCTACTCT 1260 

50 CCTCGGCCCC CTCCATCGGG CTGAGGAAGC ACAGCAGCAT CTTCAAACAT GTACAAAATC 1320 
GATTGGCTTT AAACACCCTT CRCATACCTT CCCXCC 

Seq ID NO; 99 Protein sequence 
Protein Accession j): NP_00C603 .1 

I T T V T T 

MGIPMGKSML VLLTFLAFAS CCIAAYRPSE TLCGGELVDT LQFVCGDRGF YPSRPASRVS 60 
RRSRGIVEEC CFRSCDLALL ETYCATPAKS ERDVSTPPTV LPDNFPRYPV GKFFQYDTWK 120 
QSTQRLRRGL PALLRARRGH VLAKELBAFR EAKEHRPLIA LPTQDPAHGG APPEMASHRK 



GGCCGGGAGA GTAGCAGTGC CTTGGACCCC AGCTCTCCTC CCCCTTTCTC TCTAAGGATG 60 

\CTCCTA CCCCTGGCCC TACGGCCGAC AGACGGCTCC ATCTGGCCTG 120 

C CCCAGCGAGT CCTCCGGAAA GAGCCTGTCa CCCCATCTGC ACTTGTCCTC 180 

ATGAGCCGCT CCAATGTCCA GCCCACAGCT GCCCCTGGCC AGAAGGTGAT GGAGAATAGC 240 

AGTGGGACAC CCGACATCTT AACGCGGCAC TTCAC3UITTG ATGACTTTGA GATTGGGCGT 3 00 

CCTCTGGGCA AAGGCAAGTT TGGTUiACGTG TACTTGGCTC GGGAGAAGAA AAGCCATTTC 360 

ATCQTGQCGC TCAAGGTCCT CTTCAAGTCC CAGATAGAGA AGGAGGGCGT GGAGCATCAG 420 

CTGOGCAGAG AGATCGAAAT CCAGGCCCAC CTGCaCX^TC COW-CATCCT GCX3TCTCTAC 480 

AACTATTTTT ATGACCGGAG GAGGATCTAC TTGATTCTAG AGTATGCCCC COGOSGGGAG S40 

CTCTACAAGG AGCTGCSGAA GAGCTGCXCA TTTGACGAGC AGCGAACAGC CACGATCATG 600 

GAGGAGTTGG C»GATGCTCT AATGTACTGC CATGGGAAGA AGGTGATTCA CAGAGACATA 660 

AAGCCAGAAA ATCTGCTCTT AGGGCTCAAG GGAGAGCTGA AGATTGCTGA CTTOGGCTGG 720 

TCTGTGCATG CGCCCTCCCT GAGGAGGAAG ACS«iTCfrGTG GCACCCIGGA CTACCTGOCC 780 

CCAGAGATGA TTGAGGGGCG CATGCACAAT GAGAAGGTGG ATCTGTGGTG CATTGGACTG 840 

CTTTGCTATG AGCTGCTGGT GGGGAACCCA CCCTTTGAGA GTGCATCACA CAACGAGACC 900 

TATCGCCGCA TCGTCAAGGT GGACCTAAAG TTCCCOGCTT CTGTGCCCAC GGGAGCCCAG 960 

GACCTCATCT CCAAACTGCT CAGGCATAAC CCCTOGGAAC GGCTGCCCCT GGCCCAGGTC 1020 

TCAGCCCACC CTTGQGTCCG GGCCAACTCT CGGAGGGTGC TGCCTCCCTC TGOCCTTCAA 1080 

TCTGTCGCCT GATGGTCCCT GTCATTCACT CGGGTGOSIG TGTTTGTATG TCTGTGTATG 1140 

TATAGGGGAA AGAAGGGATC CCTAACTGTT CCCTTATCTG TTTTCTACCT CCTCCTTIGT 1200 
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TTMTAAflGO CT6AAGCTTT T 



I I I I I I 

MAQKiMSyPW PyGRQTAPSG LSTLPQRVLR KEPVTPSALV LMSRSNVQPT AAPGQKVMBN 
SSGTPDILTR HFTIDDFBIG RPLGKGKFGN VYLRREKKSH FIVALKVLFK SQIEKEGVEH 
QLRREIEIQA HI.HHENILRL YNYFYDRRRI YLILEYAPRG ELYKELQK8C TFDEQRTATI 
MEELADALMY CHGKKVIHRD IKPEKLLLGL KGELKIADPG WSVHAPSLRR KTMCGTLDYL 
FPEMIEGRMH NEKVDLWCIG VLCYELLVGN PPFESASHNE TYRRIVKVDL KFPASVPTGA 
QDLISKLLRH NESERLPLAQ VSAHPWVRAN SRRVLPP5AL QSVA 



35 



Seg ID NO: 102 D 
Coding sequence: 



C ACAGGTTCAC 

CTTTTCCCAG ATCTTTGGGC CAGAAGTGGG aCaGGCATCC TTCTTCAACC 
\ AAGGATGTAC TCAAAGGGCA GAACTGGCTC ATCTATACAT 

TTCTCCCCCG 

GTCCCTGGCG CTGATCTTCA ATAGCCTCCA AGGCCAACTT CATCCAACAC 
GCCCTTGCTC TCCAATGAGG TAATCTGGCT AGACAGCAAG CAGATCCGAC 

CTTCCTTGAA 

C TACATCGAAA GICGGATAGG TACCAGCSVCC AGCTTCGACA GTGGCATTGC 
TQGGCTCTCT TCTATCAGTC AGTGTACCAG CAGTAGCCAG CTGGATGAAA CAAGTCATCG 
ATGGGCACAG CCAGACACTG CCCCACTACC TGTCCCGGCA AACATTCGCT TCTCCATCTG 
GRTCTCRTTC TTTGRGATCT ACAACGAACT GCTTTATGAC CTATTAGAAC OGCCTAGCCA 
ACAGOSCAAG AGGC3W3ACIT TGCGGCTATG CGAGGATCAA AATGGCaVATC CCTATGTGAA 
AGATCTCAAC TGGATTCATG TG(»AGATGC TGAGGAGGCC TGGAAGCTCX: TARAAGTGGG 
TOSTAAGAAC CAGAGCTTTG CCAGCaCCCA CCTCAACCAG AACTCCAGCC GCftGTCACAG 
CATCTTCTCA ATCAGGATCC TACACCTTCA GGGGGAAGGA GATATflGTCX: CCAAGATCAG 



40 CGAGCTGTCA CTCTGTGATC TGGCTGGCTC AGAGCGCTGC RAAGATCAGA AGAGTGGTGA 
AOGGTTGAAG GAAGCAGGAA ACATTAACAC CTCICTACAC ACCCTGGGCC GCTGTATTGC 
TGCCX:TTCGT CAAAACCAGC AGAACXXBTC AAAGCAGAAC CTGGTTCCCr TCCGTGACAG 
CAAGTTGACT OGAGTGTTCC AAGGTTTCTT CACaGGCKGA GGCCCTTCCT GCATGATTGT 
CAATGTGAAT CCCTGTGCAT CTACCTATGA TGAAACTCTT CATGTGGCCA AGTTCTCAGC 
45 CATTGCTAGC CAGGTGAOTT GTGCATGCXX: CACXTTATGCA ACTGGGATTC CCaTCCCIGC 
ACTCGTTCAT CAAGGAACAT AGTCTTCAGG TATCCCCCAG CTTAGAGAAA GGGGCTAAGG 
CAGACACAGG CCTTGATGAT GATATTGAAA ATGAAGCTGA CATCTCCATG TATGGCAAAG 
AGGAGCTCCT ACAAGTTGTG GAAGCCATGR AGACACTGCT TTTGAAGGAA CX3ACAGGAAA 
AGCTACAGCT GGAGATGCAT CTCCGAGATG AAATTTGCAA TGAGATGGTA GAACAGATGC 
r TGGACACCCA AAAGGAACTA TTGGAGGAAA 
T ATCCTCAAGG AGTCACTGAC AAGTTTTTAC CAAGAAGRGA 

0 TAGAAGCTCT CTTGCAGGAA GCCAGACAAC 2040 
3 AATTGGCCCT ACGGOGGTCA CAAAGGTTGG 2100 
C CTCCRCCCAG CAGCTTCAGG AGGTTAAAGC TAAATTACAG CAGTGCAAAG 2160 
A CTCTACCACT GAAGAGTTGC ATAAGTATCA GAAAATGTTA GAACCACCAC 2220 
CCTCAGCCAA GCCCTTCACC ATTGATGTGG ACAAGAAGTT AGAAGAGGGC CAGAAGAATA 2280 
C TCTCCAATCA GCAGAGAGAG 2340 



.140 



AAGGCCAGGT TTCTGCCAAA AAGCGCCTTG G 

3AAGAAA CCATTCCTTC GAAATTTACT TCCCCGAACA CCAJlCCTGCC 
AGACTGCRGC CCTTATGCCC GGATCCTACG CTCACGGCGT TCCCCTTTAC 

GGCCCTGAGG TGGGTCAGCT ACTCTCCTGA AGAAATAGGT CTCTTTTATG CTTTACCATA 
TATCAGGAAT TATATCCAGG ATGCAATACI CAGACACTAG 
TATAACCACC TATGTAATCT CATGTTGTTG TTTTTTTTTA 



I I I I I I 
MSQGILSPPA GliSDDDVW SPMFBSTAAD LGSWRKNLL SDCSWSTSL EDKQQVPSED 
SMEKUKVYLR VRPLLPSELE RQEDQGCVRI ENVETLVLQA PKDSFALKSH ERGIGQATHR 
FTFSQIFGPE VGQASFFKLT VKEMVKDVLK GQNWLIYTYG VTNSGKTHTI QGTIiOXSGIL 
PRSLALIFNS LQGQLHPTPD LKPLLSNEVI WLDSKQIRQE EHKKLSLLNG GLQEEELSTS 
LKRSVYIESR IGTSTSEDSG lAGLSSISQC TSSSQLDETS HRWAQPDTAP LEVPANIRFS 
IWISFFEIYN ELLYDLLEPP SQQRKRQTLR LCEDQNGHPY VKDLNWIHVQ DAEEAHKLLK 
VGRKNCSFAS THLNQNSSRS HSIFSIRILH LQGEGDIVPK ISELSLCDLA GSERCKDQKS 
GERLKEAGNI NTSLHTLGRC lAALRQNQQN RSKQNLVPFR DSKLTRVFCX3 FFTGRGRSCM 
IVJJVNPCSIST YDETLHVAKP SAIASQVTCA CPTYATGIPI P 

Seq ID NO: 104 DNA Sequence 
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I I I I I I 

AATCCCGACA ATGGCGAAAG ACAACTCAAC TGTTCGTTGC TTCCAGC3GCC T 

TGGAAATGTG ATTATTGGTT GTTGCGGCAT TGCCCTGACT GCGGAGTGCA T 

ATCTGACCAA CACAGCCTCT ACCCACTGCT TGAfiGCCACC GACAACGATG ACATCTAIGG 

GGCTGCCTGG ATCG3CATAT TTGTGGGCAT CTGCCTCTTC TGCCTGTCTG T 

TGTAGGCATC ATGAAGTCCA GCAGGAAAAT TCTTCTGGCG TATTTCATTC T 

AGTATATGCC TTTGAAGTGG CATCTTGTAT CACAGCAGCA ACACAACGAG A 



AGAACCTCTC ftACCTGGAGG CTTGTAAACT AGGCGTGCCT GCTTTTTATC ACARTCAGGG 
CTGCTATGAA CTGATCTCTG 6TCCAATGAA CCGACAOSCC TGGGGGGTTG CCTGGTTTGG 
ATTTGCCATT CTCTGCTGGA CTTTTTGGGT TCTCCTGGGT ACCATGTTCT ACTGGAGCAG 
AATTGAATAT TAAGAA 



MAKDNSrVEC PQGLLIPGNV IIGCCGIALT AECIFFVSDQ HSLYPLLEAT DHDDIYGAAW 60 

IGIFVGICLF C3.SVLGIVGI MKSSRKILLA YFILMFIVYA FBVASCITAA TQRDFFTENL 120 

FLKQMLERYQ MNSPENNDDQ WKKHGVTKTW DRLMLQDNCC GVNGPSDWQK YTSAFRTEHN 180 

DADYFWFRQC CVMNNLKEPL HLEACKLGVP GFYHNQGCYE LlSGFmSHR WQVAWFGFAI 240 
LCWTFWVLLG TMFYWSRIEY 

Seq ID NO: 106 DNA sequence 

Nucleic Acid Accession fts 1JM_002740,1 

Coding sequence: 178.. 1968 

1 i' r r f i' 

CCGCGGTTCC GGCTGCTCCG GCGAGGCGAC CCTTGGGTCG GCGCTGCGGG CGAGGTGGGC 60 

AGGTAGGTGG GOSGRCGGCC GCGGTTCTCC GGCAAGCGCA GGCGGCGGAG TCCCCCRCGG 120 

CGCCCGAAGC GCCCCCCGCA CTCCCGGCCI CCaGCGTTGA GGCGGGGGRG TGAGGAGATG 180 

CCGACCCAGA GGGACAGCAG CftCCATGTCX: CACACGGTCG CAGGCGGCGG CAGCGGGGAC 240 

CATTCCCACC AGGTCCGGGT GAAAGCCTAC TACCGCGGGG ATATCATGAT AACACATTTT 300 

GAACCTTCCA TCTCCTTTGA GGGCCTTTGC AATGAGGTTC GAGACATGIG TTCTTTTGAC 360 

AACGAACAGC TCTTCACCAT GAAATGGATA GATGAGGAAG GAGACCCGTG TACAGTATCA 420 

TCTCAGTTGG AGTTAGAACA AGCCTTTAGA CTTTATGAGC TAAACAAGGA TTCTGAACTC 480 

TTGATTCATG TGTTCCCTTG TGTACCAGAA CGTCCTGGGA TGCCTTGTCC AGGAGAAGAT S40 

AAATCCATCT ACCGTAGAGG TGCACGCCGC TGGAGAAAGC TTTATTGTGC CAATGGCCAC 600 

ACTTTCCAAG CCAAGCGTTT CAACACCCGT CCTCACTGTG CCATCTGCAC AGACCGAATA 660 

TGGGGACTTG GACGCCAAGG ATATAAGTGC ATCAACTGCA AACTCTTGGT TCATAAGAAG 720 

r TGCCACAGGA ACCAGTGATG 780 

A CAGTAATTCC ATATAATCCT 840 

TCAAGTCATG AGA3TTTGGA TCAAGTTGGT GAAGAAAAAG AGGCAATGAA CACCAGGGAA 900 



r ACTGTTGGTT CGATTAAAAA AAACAGATCG TATTTATGCA 1020 

ATGAAAGTTG TGAAAAAAGA GCTTGTTAAT GATGATGAGG ATATTGATTG GGTACAGACA 10 80 

GAGAAGCATG TGTTTGAGCA GGCATCCAAT CATCCTTTCC TTGTTGGGCT GCATTCTTGC 1140 

TTTCAGACAG AAAGCAGATT GTTCTTTGTT ATAGAGTATG TAAATGGAGG AGflCCTAATG 1200 

TTTCATATGC AGCGACAAAG AAAACTTCCT GAAGAACATG CCAGATTTTA CTCTGCAGAA 1260 

ATCAGTCTAG CATTAAATTA TCTTCATGAG CGAGGGATAA TTTATAGAGA TTTGAAACTG 1320 

GACAATGTAT TACIGGACTC TGAAGGCCAC ATTAAACTCR CTGACTACGG CATGTGTAAG 1380 

GAAGGATTAC GGCCAGGAGA TACAACCAGC ACTTTCIGTG GTACTCCTAA TTACATTGCT 1440 

CCTGAAATTT TAAGAGGAGA AGATTATGGT TTCAGTGTTG ACTGGTGGGC TCTTGGAGTG 1500 
CTCATGTTTG AGATGATGGC AGGAAGGTCT CCATTTGATA TTGTTGGGAG C 
CCTGACCAGA ACACAGAGGA TTATCTCTTC CAAGTTATTT T 

CtACGTTCTC TGTCTGTRAA AGCTGCAAGT GTTCTGAAGA GTTTTCTTAA TAAGGACCCT 1680 

AAGGAACGAT TGGGTTGTCSl TCCTCAAACA GGATTTGCTG ATATTCAGGG ACACCEGTTC 1740 

TTCOGAAATG TTGATTGGGA TATGATGGAG CAAAAACAGG TGGTACCTCC CITTAAACCA 1800 

AATATTTCTG GGGAATTTGG TTTGGACAAC TTTGATTCTC ACTTTACTAA TGAACCTGTC 1860 

CAGCTCACTC CAGATGACGA TGACATTGIG AGGAAGATTG ATCaGTCTGA ATTTGAAGGT 1920 

TTTGAGTATA TCAATCCTCT TTTGATGTCT GCAGAAGAAT GTGTCTGATC CTCATTTTTC 1980 

AACCATGTAT TCIACTCATG TTGCCATTTA ATGCATGGAT AAACTTGCTG CAAGCCTGGA 2040 

TACAATTAAC CATTTTATAT TTGCCACCTA CJUlAAAAAOl CCCAATATCT TCTCTTGTAG 2100 

ACTATATGAA TCSiATTATTA CATCTGTTTT ACTATGAAAA AAAAATTAAT ACTACTAGCT 2160 

TCCAGACAAT CATGTCRAAA TTTAGTTGAA CTGGTTTTTC AGTTTTTAAA AGGCCTACAG 2220 
ATGAGTAATG AAGTTACCTT TTTTGTTTAA A 



I I I I I I 

MSHTVAGGGS GDHSHQVEVK AYYRGDIMIT HFEPSISPBG LCNEVRDMCS FDK 
WIDEEGDPCT VSSQLELEEA PRLYELNKDS ELLIHVPPCV PERPGMPCPG EDKSIYRRGA 
RRWRKLYCAN GHTFQAKRFN RRAHCAICTD RIWGLGRQGY KCINCKLLVH KKCHKLVTIE 
CGRHSIiPQEP VMPMDQSSMH SDHAQTVIPY NPSSHESLDQ VGEEKEAMHT RESGKASSSL 
GLQDFDLLRV IGRGSYAKVL LVRLKKTDRI YAMKVVKKEL VNDDEDIDWV QTEKHVFEQA 
SNHPFLVGLH SCFQTESRLF FVIEYVNGGD LMFHMQRQRK LPEEHARFYS AEISLAUJYL 
HERGIIYBDL KLDHVLLDSE GHIKLTDYGM CKEGLRPGDT TSTFCGTPNY lAPEILHGED 
L GVLMFEMMAG RSPEDIVGSS DHPDQHTEDY LEQVILEKQI RIPRSLSVKA 
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5 

10 
15 



40 
45 



Coding sequence: 127.. 934 



GAAACAATGC TGCTAGCGAC ATTCAAGCTG TGOSCTGGGA- GCTCCTACM ACRCATGCGC 
AACATGAAGQ GGCTGftGGCR ACRGGCTGTG ATGGCCATCA GCCSGGAGCI GAACOGGRGG 
GCCCTGGGGG GCXXICACCCC TAGCAOSTGG ATTAACCSGG TTOGGCGGCG GAGCTCTCTA 
CTCGGTTCTC GGCTGGAAGA GACTCTCTAC AGTGACa«3G AGCTGGCCTA TCTCCAGCAG 
GGGGAGGftGG CCATGCAGAR GGCCTTGGGC ATCCTTAGCA ACX»SGAGGQ CTGGASGAAG 
GAGSGTC3«3C AGGACAATGG GGACAAAGTG ATGAGTAAAG TGGTCXCAGA TGTGGGC3UIG 
GTGTTCCGGC TGGAGGTCGT GGTGGACCAG CCCATGGAGA GGCTCTATGA AGAGCTCX3TG 
GAGCGCATGG AAGC3UVTGGG GGAGTGGAAC CCCAATGTCA AGGAGATCAA GGTCCTGCAG 
AAGATCGGAA AAGATACATT CRTTACTCAC GAGCTGGCTG CCGAGGCAGC AGGAAACKTG 
GTGGGGCCCC GTGACTTTGT GAGCGTGCGC TGTGCCAAGC GCCGAGGCTC CACCTGTGTG 
CTGGCTGGCA TGGACACAGA C 



GGAGTCCCAC 



CACTGATTAG CTATGACATG AGGGTTCAGG CCCCTAAAAT 
GGGCCCTTAT GTACCCACCT ARAACCATCT TTAAAATGCT 
GGGATGCTAA CCACAGGGCC TGAGAfiGTCT TGCTTTATGG 
GGCAGTACAT GTCCACAAAC CAGAATCTCA GAGGGTCTCC 
GGCCGCTGCA CAGCAACACC ACAGAACAAG CAGCACCCCA 



MliATFKLCA 



RLEVWDQPM ERLYEELVEE MEAMGEMNPN VKEIKVLQKI GKDTFITHEL AAEAAGNLVG 
FSDFVSVRCA KSBGSTCVLA GHDTDFGHMF EQKGVIRAEH GPTCMVLHPL AGSPSKIKLT 
WLLSIDLKGW LFKSIINQVL SQTQVDFANH LRKHLESHFA SEARC 

Seq ID NOt 110 DHA sequence 

Nucleic Acid Accession #: EOS sequence 

Coding sequence: 131-682 

1 11 21 31 41 51 

I I I I I I 

GCTGGGRGCC TGGGCCGGGA GCCGGGTGRG GGCGCOGAGA GGCTCGGTGG GCGCGGGCGG 
CGAGATATGC CACRCTTCTG CCTGCTGTTG GCAACOCTCC TGGACTAGGC TGCTCTTOTT 
AATCACATGG ATGTTATATA AGAGTTCGGA COGCCCAGCA CACRAGGTCA GCATGCTGCT 
CCTCTGTCAC GCTCTCGCTA TAGCTGTTGT CCAGATCX3TT ATCTTCTCAG AAAGCTGGGC 

AAATAGCTTC AAGTGCTTTA CTTGTGAAAA CGCAGGGGAT AATTATAACT GCAATCGATG 
GGCAGAAGAC AAATGGTGTC CACAAAATAC ACAGTACTGT TTGACAG 
CAGCCACGGA AGAAGCACAT CCATCACCAA AAAGTGTGCC TCCftGAAGTG A 

h ACATAOGGAG TGTAGGTCTT GCTGTGAAGG 
r TACCCACCRA TCACACTAAT GCAGTGTTTG CCGTAATGCA 



CTTGCACTTG 



CAGAGGCTGC CAGGTCAAAC CCTCTTGTTT ATGTGATTAG CTCAGAGCAT CTCTATGAAA 1080 

TCTAACCCTT CCOCTCATGA GAAAGCAGTT TTCCCCACCA ACAGCATSGT CRATGRGAAA 1140 

GGCAACTGTA CGRAGAAAAC TTCCAGTGGA ACTAATATGA AATCTATTTG CAAATTATGG 1200 
GGGGAAATAA AGCTTTTAAA TTATACAATG T 

Protein Accession «: AAM20908.1 

1 11 21 31 41 51 

I I I I I I 

MLYKSSDRPA HKVSMLLLCH flIAIAWQIV IFSESWAPAK NDIPlfllVRPP LDPTPPPNSP 60 
KCFTCENAGD NYNCHRWAED KWCPQNTQYC IiTVHHFTSHG RSTSITKKCA SRSECHFVGC 120 
HHSHDSEHTE CRSCCEGMIC NVELPTflHTH AVFAVMHAQR Tl 
PLL 

Seq ID HO: 112 DNA sequence 
Nucleic Acid Accession #: Eos sequence 
Coding sequences 228-884 



312 
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75 



I I I I I I 

CGCCCGCCGG CCCCAGGCGG GTGCGCTGGG AGCCTGGGCC GGGAGCCGGG TGAGGGCGCX: 
GAGAGGCTCG GTGGGCGCGG GCGGCGAGGR CTCTGCTGGA GCAGGACTTC AGAGTGTTTG 
TTTTCAGCCT GCTTTTAAAG TGATTTGAAG AGAGOGGCTT TGAAGATATG CCACRCTTCT 
GCCTGCTGTT GG(»ACCCTC CTGGACIAGG CTGCTCTTGT TAATCACATG GATGTTGCTG 
ATTACTCTGA GTGCAAACCT TTTCACTGTT CC&GAGAGGA GCCTGACAAC CACATTCTCC 
TTCTCAAGGT GTGCTGCTTA CTGCGCaGGC TGACCAGATA TAAGAGTTCG GACOSCCCAG 
CACACAAGGT CaWSCATGCTG CTCCTCTGTC ACGCTCTCGC TATAGCIGTT GTCCAGATCG 
TTATCTTCTC AGAAAGCTGG GCATTTGCCA AGAACATCAA CTTCTATAAT GTGAGGCCTC 
CTCTCGACCC TACACCATTT CCAAATAGCT TCAAGTGCTT TACTTGTGAA AACGCAGGGG 
ATAATTATAA CTGCAATCGA TQGGCAGAAG ACAAATGGTG TCCACAAAAT ACACSGTACT 
GTTTGACACrr TCATCACTTC ACCAGCCACQ GAAGAAGCAC ATCCATCACC AAAAAGTGTG 
CCTCCAGAAG TGAATGTCAT TTTGTOSGTT GCCACCACAG COaGATTCT GAAC31TACGG 
AGTGTAGGTC TTGCTGTGAA GGAATGATCT GCAATGTAGA ATTACCCACC AATCACACTA 
ATGCAGTGTT TGCOGTAATG CACGCTCAGA GAACATCTGG CRGCAGIGCC CCCACACTCT 
ACCTACCAGT GCTTGCCTGG GTCTTTGTGC TTCCATTGCT GTGATGCCAC CATTCCTAGG 
AGAGGCAGAG ACCAGCCTCT AAAGCACAAG CCAAAAACTG TGTGAACGGT GAACTTTGGA 
GTGAAGATCA ATCTTGCACT TGGTGAAGAG TGCACATTGG ACCTCAAGGC G 
GGTTTGCTTG GATAAAATGT TCCCGCRTGA GGCCACAGGA CTGAGGATGG G 
GGGCCTGAGA AGATGGTCTG ACTTCCAGGC TTCCTGGTCA AAGAGAGCTA CGTTTGGGCA 1140 
GTTCTGCAGA GAGGATCCTG GCAACTAGTC CCACCTGACT AGGCCTTTAG CTGAAAGGAT 1200 
TTCTTGACCT CCTTGRCTGC CTCAGAGGCT GCCAGGTCAA ACCCTCTTGT TTATGTGATT 1260 
AGCTCAGAGC ATCTCTATGA AATCTAACCC TTCCCCTCAT GAGAAAGCAG TTTTCCCCAC 1320 
CAACAGCS^TA GTCAATGAGA AAGGCAACTG TACGAAGAAA ACTTCCAGTG GAACTAATAT 1380 



P EPNSFKCFTC ENAGDNYNCN RWAEDKWCPQ 
NTQYCLTVHH FTSHGRSTSI TKKCASRSEC HFVGCHHSRD SEHTBCRSCC EGMIOTVELP 
TNHTNAVFAV MHAQRTSGSS APTLYLPVIA WVFVLPLL 

Seq ID NOi 114 DNA sequence 

Nucleic Acid Accession #: EOS sequence 

Coding sequence: 402-1025 

1 11 21 31 41 SI 

I I I I I I 

ACTTCCTGAS CCGGGCTGGC TGGGTGGGAA CAGGCTCCTT GCCGCCTCCC CAGCX3CTQGC 
CACTACCACA CTGOXKCCG CCTGGGCCTC CTTTCSVACCT CGTGGTGGAG CCCTGOSGTT 
TCCCAGCXXa GCCGGGCCCG GGGCTGCTCC CTCGCGGGCG AGGCTCACCT GTCCOGGCCC 
GGCCrCCTCC CX3CGCX:ca«S GTGGTTCAGG GCAGGGAGGA GCCGCGCCCX: GCCCCGCGCX3 
GTAGCAGCCA AOGCCGGCCC CaGGCGGGTG CGCTGGGAGC CTGGGCCGGG AGCCGGGTGA 
6GGCGCCGAG AGGCTCGGTG GGCGCGGGCG GCGAGATATG (XACACTTCT GCCTGCTGTT 
GGCAACCCTC CTGGACTAGG CTGCTCTTGT TAATCACATG GATGTTGCTG ATTACTCTGA 
GTGCAAACCT TTTCACTGTT CCAGAGAGGA GCCTGACAAC CACATTCTCC TTCTCAAGAT 
ATAAGAGTTC GGACCGCCCA GCACACAAGG TCAGCATGCT GCTCCTCTGT CACGCTCTCG 
CTATAGCTGT TGTCCAGATC GTTATCTTCT CAGAAAGCTG GGCATTTGCC AAGAACATCA 
ACTTCTATAA TGTGftGGCCT CCTCTCGACC CTACACCRTT TCCAAATAGC TTCAAGTGCT 
TTACTTGTGA AAACGCAGGG GATAATTATA ACTGCAATCG ATGGQCAGAA GACAAATGGT 



GCAGCAGTGC CCCCACACTC TACCTACCAG TGCITGCCTG GGTCTTTGTG C7TCCATTGC 1020 

TGTGATGCCA CCATTCCTAG GAGAGGCAGA GACCAGCCTC TAAAGCACAA GCCAAAAACT 1080 

GTGTGAACGG TGAACTTTOG AGTGAAGATC AATCTTGCAC TTGGTGAAGA GTGCACATTG 1140 

GACCTCAAGG CGAAAGCCAG TGGTTTGCTT GGATAAAATG- TTCCCGCATG AGGCCACAGG 1200 

ACTGAGGATG GGAATTTGGC AGGGCCTGAG AAGATGGICT GACTTCCAGG CTTCCTGGTC 12 60 

AAAGAGAGCT ACGTTTGGGC AGTTCTGCAG AGRGGATCCT GGCAACTAGI CCCACCTGAC 1320 

TAGGCCTTTA GCTGAAAGGA TTTCTTGAOC TCCTTGACTG CCTCAGAGGC TGCCAGGTCA 1380 

G TTTATGTGAT TAGCTCAGAG CATCTCTATG AAATCTAACC CTTCCCCTCA 1440 

GTTTTCCCCA CCAACAGCAT AGTCAATGAG AAAGGCAACT GTACGASGAA ISOO 

A TGAAATCTAT TTGCAAATTA TGGGGGGAAA TAAAGCTTTT 1S60 



MLLITLSANL FTVPERSLTT TFSFSRYKSS DRPAHKVSML LLCHALAIAV VQIVIFSESM 
AFAKNINFYN VRPPLDPTPF PNSFKCPTCE HAGUUlraCNR WAEDKWCPQN TQYCLTVHHF 
TSHGRSTSIT KKCASRSECH FVGCHHSHDS KHTECRSCCE GMICMVELPT MHTNAVFAVM 
HAQRTSGSSA PTLYLPVLAW VFVLP 

Seq ID HO: 116 DMA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1-1059 
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TCATCTGCTG A6W3AGGTGA GCiAAGCTGCR GAAGAAGACT TTGAAflCTftG CRGAGCTG<3G 180 

TTCATGRGGA TGARGGARAG ARGCCAT6CC CATAACRTAA RAGTGCAAGG TGfiAGCAGGA 240 

GGTGCTGATG AAGAftGCTGC AGCAAGTGAT CCACiftAGftTC TAGCTflAGAT CATTGATGAA 300 

GATGTGAAAG GCATTGTACA GAAGAAGATG AGAATCACAG TAAACCAAAC AAAGGAAATG 360 

CAGAAGAGAT ATTTTCAGCT GAATATTGGA AATGTAAAAT GCCACACGTT GGATGAGATG 420 

ATTATCTGAG CTCAGACITG GGAAATAGTC ATGCTTCTGG ACCAAGTGCX: AGGTCCIGGC 480 

CCTACRCAAC A(3GTAGATGG TTGGAtATAT TTGGATAATG GAGCTTTCCA AACAGAAGCA S40 

AAACACRAAC CCACTGATGT AAAATATAAA GAAACTGAAC CAGTGTGTCT TTTCACCATA 600 

GATATAAGAG TTCGGACXSC CCAGCACACA AGAATAAAAC AGAAAGTCTC CATTACTTCT 660 

ATGGCIACAC CATTTCCAAA TAGCTTCAAG TGCTTTACTT GTGAAAACGC AGGGGATAAT 720 

TATAACTGCA ATCX3ATGGGC AGAAGACAAA TGGTGTCCAC AAAATACACA GTACTGTTTG 780 

ACAGTTCATC ACTTCACCAG CCACGGAAGA AGCACATCCA TCACCftAASA GTGTGCCTCC 840 

AGAAGIGAAT GTCATTTTGT CGGTTGCrAC CACAGCXX3AG ATTCTGAACA TACBGAGTGT 900 

AGGTCITGCT GTGAAGGAAT GATCTGCRAT GTAGAATTAC CCACCAATCA CACTAATGCA 960 

GTGTTTGCCG TAATGCACGC TCAGAGAACA TCTGGCT^GCA GTGCCCCCaC ACTCTACCTA 1020 

CCAGTGCTTG CCTGGGTCTT TGTGCTTCCA TTGCTGTGAT GCCRCCATTC CTAGGAGAGG 1080 

CRGAGACCAG CCTCTAAAGC ACAAGCCRAA AACTGTGTGA ACGGTGAACT TIGGAGTGAA 1140 
GATCAATCTT GCACTTGGTG AAGAGTGCAC ATTGGACCTC A 
A CAGGACTGAG G 

avAAGAG AGCTACGTTT GGGCAGTTCT 
R TCCTGGCAAC TAGTCCCACC TGACTAGGCC T 



C CTCATGAGAA AGCAGTTTTC CCCACCAACA 

GCATAGTCAA T 



FMRMKBRSHA HHIKVQGEAG GADBEAAASD PEDLAKIIDE DVKGIVQKKM RITVNQTKEM 
QKRYP3LNIG NVKCHTLDEM IIRAQTWEIV MLLDQVPGPG PTQQVDGWIY LDNGAEQTEA 

-JVCLFTI DIRVRTAQHT RIKQKVSITS MATPFPNSFK CFTCENAGDN 

K WCEQNTQYCli TVHHFTSHGR STSITKKCAS RSECHFVGCH HSRDSEHTBC 
?TNHTNA VFAVMHAQRT SGSSAPTLYL PVLAWVFVLP IiL 

Seq ID HO: IIB DNA sequence 

Huclelc Acid Accession #; XM_D38659.6 

Coding sequence: 528-1688 

1 11 21 31 41 SI 

I I I I I I 

AGTAGGGAGG TGGGCAGGAG CCAGTGAIGA CGGAAIGGCA ATC3VCATITG ACCTCTGATC 
TGTTTATTTC CTCCTCXITTG AOGTCTCCAT ATAAATGTTA CACGGGCATC CCCACSCTCG 
GATACGCACC CaCAGTGGCT GATTCGGGGG TAACCGTGTC ATTTGCTTGC AACACTGGCA 
CCTCTGCCCT GCACCCCGGG AGTGAGC3«3T GAGTGAGGCT CGGGTCTGGG CX3CTGGCTCX: 
GAATCTTCGG GCTGGQAGAG ACTCCACCAT CTGGGQGCGG CCTGGGGGAG CAGCCTTAGT 
GTCTTCCTGC TGATGCAATC CGCTAGGKX3 OSAGTCTCCG CCGCGAGAGG GCCGGTCTGC 
AATCCAGCCC GCCACGTGTA CTCGCCGCOG CCTCGGGCAC TGCCCCAGGT CTTGCTGCAG 
CCGGQACCGC GCTCTGCAGC CGCAGACOOG GTCCACACGG CCAGGGGCTA CGACCCTTGG 
GATCTGCCCT CCGCTCAGCT CGAGCTTCOC TCGTGGCCGA CGGAACAATG AAGGATTGCA 
GTAACGGATG CTCCGCAGAG TGTACCQGAG AAGGAGGATC AAAAGAGGTG GTGGGGACTT 
TTAAGGCTAA AGACCTAATA GTCACACCAG CTACCATTTI AAAGGAAAAA CCAGACCCCa 
ATAATCTGGT TTTTGGAACT GTGTTCACGG ATCATATGCT GAOGGTGGAG TGGTCCTCAG 
AGTTTGGATG GGAGAAACCT CATATCAAGC CTCTTCAGAA CCTGTCATTG CflCCCTGGCT 
CATCAGCTTT GCACTATGCA GTCGAATTAT TTGAAGGATT GAAGGCATTT CGAGGAGTAG 
ATAATAAAAT TCGACTGTTT CAGCCAAACC TCAACATGGA TAGAATGTAT CGCTCTGCTG 



GTCCCATATT CAACATCTGC TAGTCTGTAT ATTCGTCCTA 
TCTCTTGGAG TCAAGAAGCC TACCAAAGCC CTGCTCTTTG 
CCTTATTTTT CAAGTGGAAC CTTTAATCCA GTGTCCCTGT 
GGGCCAATCC CAAGTATGTA AGAGCCTGGA AAGGTGGAAC TGQGGACTGC AAGATGGGAG 
GGAATTACGG CTCATCTCTT TTTGCCCfiST GTGAAGCAGT AGATAATGGG TGTCAGCAGG 
TCCIGTGGCT CTATGGAGAG GACCATCAGA TCACTGAA6T GGGAACTATG AATCTTTTTC 
TTTACTGGAT AAATGAAGAT GGAGAAGRAG AACTGGCAAC TCCTCCACTA GATGGCATCA 
G AGTGACAAGG CGGTGCATTC TGGACCTGGC A 



TGAGAGAGAT CTTTGGCTCT GGTACAGCCT GTGTTGTTTG CCCAGTTTCT G 

ACAAAGGOSA GACAATACAC ATTCCAACTA TGGAGAATGG TCCTAAGCTG GCAAGCCGCA 1620 

TCTTGAGCAA ATTAACTGAT ATCCAGTATG GAAGAGAAGA GAGCGACTGG ACAATTGTGC 1680 

TATCCTGAAT GGAAAATAGA GGATACAATG GAAAATAGAG GATACCAACT GTATGCTACT 1740 

GGQACAGACT CJITGCATTTG AATTGTGATA GATTTCTTTG GCTACCTGTG CATAATGTAG 1800 

TTTGTAGTAT CAATGTGTTA CRAGAGTGAT TGTTTCTTCA TGCCAGAGAA AATGAATTGC 1860 

AATCATCBAA TGGTGTTTCA' TAACTTGGTA GTAGTAACTT ACCTTACCTT ACCTAGAAAA 1920 

ACATTAATGT AAGCCATATA AC3VTGGGATT TTCCTCAATG ATTTTAGTGC CTCCTTTTGT 1980 

ACTTCACrCA GATACTAAAT AGIAfiTTTAT TCTTTAATAT AAGTTACATT CTGCTCCTCA 2040 

AACaAATGCA ATTTTTTGTG TSTCTTTGAA AGCTAATTTG AGAAAATTTC ATAGGTTACA 2100 

TTTCCTGCAG CCTATCTTTA TCCACAGAAA GTGTTTTCTT TTTTTTAAAT CAAGACTTTT 2160 

AAAACTGGAT TTCCTCCC^T CACTGTTTTT TGAAGGTCCT CCAAGTCCGT GTTAAGGTAA 2220 

ATATCTGTTT TCTTCCTGAT GTCACAGCOT GAGCATACTC TGTGCATTAG GAAGACCTGA 2280 

GTGCATTTCC CACCATTGTC CTTTCCACAT TATGTTCTAG CIGGCTGGCT GTCAGGOGAC 2340 

TACAAGACTG AGGGTCTTGT GCCTTATAGA TCTTTGTATC CCCCATGGCT GACATATAGT 2400 

AGGTACTOIG TAAATGGTTT TATAATGAAT CAGTGAACAT TTTGCTTCTA TAGAAGTGTA 2460 
CCTTCTTTGT TTCTATATTA TGAAACCTCT TTATTAGAAT TTG^ " " 



314 



wo U2/102235 



CTGAAGCATA GTTSGTTTAG GAAAICACTT CATATTGATT GTATTAfiAAT TATCTTGGAA 2640 

T ATCCCTAGAG C3«3(3GGACCC CAACCCCCA6 GCCATGGGCC ACACAGCAGQ 2700 

G TGCTGGGOa TTGAGGAfiCT TCATCTGTAT TTATGGCTAC TTCCCMCAC 2760 

CGAACTC CACCTCTTGT CAGCTC3«3TG GCAGCATTAG ATTCTCATAG 2820 

r CCTATT6TGA ACTCTGCATG CAAGQGATCT AGGCTATGOG CTCCTTATGA 2880 

GAATCTAATG CTTGATGRCC TGAGGTGTAA CAGTTTC3VTC CTGAAACX3VC CCITCACCCT 2940 

GC3«;TCTGTG GAAAAATTGT CTTCCACAAA ACTGGTCCCT GGTGCCAAAA ATGTTGGGGA 3000 

CCRCTGCTCT AGAGAGAGGT CATGATATCSV TACCAACCAA ATGGAAATGA CAAATGTTTT 3060 

ATGTCAAGTG TTAATTGCAG AAATAAATCT TTTTTTTTTT TTTTTGGTAG AAAACAAAGA 3120 

GGCATACrCT GATTTTTATA CTTCTGTTTTT GCRGGTGCTC TTTTCTTTGA ATGGAGATTT 3180 

GATGAGCAAG TGGTTAGGAT GCAGGGAGAG CTACTATGGG TGATATTTTC CTTGTTTAGG 3240 

AGCTGTGAGT TAAAATTGTA TCCTTTGTGG TTTATCTAAG GAAAGTCAAA TCTTGACaGA 3300 

AAACATTTTT CCTTGGAAC3G TCAACTCTCa GACATTGTAT TTTGGTTTCC CTC»GTCCTC 3360 

ATAACTTCCT TCTTGCTGAA C3VTATTTTAT TCaCTTTTCA GAGAAGGAAA ATAAAAAGGA 3420 

TTCTAAAAGT TTGATGCATT GGAAAAATTT CCTTGAGGCA TTTAGCAACA CATAGAAAAT 3480 

GGGCTTTGAT TCTTTTCCAA AACTTTTAGC CATAGGCSTCT TTTATAGACA GGGATAGTAA 3540 

AATGAAAATT GAGAAATATA AGATGAAAAG GAATGATAAA AATATCTTTT AGGGGQCTTT 3S00 

TAATTGGTGA TCTGAAATCT TGGGAGAAGC TGTTCTTTTC AGGCCTGAGG TGCTCTTGAC 3660 



ACGCTTTCTT GTTAAATAGA TTTAACAGGA A'CATCTGCAC ATCTTTTTTC CTTGTGCACT 



TTAARATGAA GTGTCftAGAC 



TCATTTATTA AGGATTCAAC TCGTGATACA CC3UIAAGAAG AGTTGACTTC AGAGATGTGT 4680 

TCCATGCTCT CTAGCACAGG AATGRATAAA TTTATAACAC CTGCTTTAGC CITTGITTTC 4740 

AAAAGCACAA AGGAAAAGTG AAAGGGAAAG AGAAACASGT GACIGAGABG TCTTGTTAAG 4800 

GAATCAGGTT TTTTCTACCT GGTAAACATT CTCTATTCTT TTCTCXAAAG ATTGCTGTAA 4860 
GAAAAAATGT AAGAC 



I I I I I I 

MKDCSNGCSA ECIGEGGSKE WGTFKAKDL IVTPATILKE KPDPMHLVFG TVFTDHMLTV 
EWSSEFGWEK PHIKPLCJNLS LHPGSSALHY AVELFEGLKA FRGVDNKIHL PQPNLNMDHM 
XRSAVRATLP VFDKEELLEC IQQLVKLDQE WVPYSTSASL YIRPTFIGTE PSLGVKKPTK 
ALLFVLLSPV GPYFSSGTFN PVSIMANPKY VRAWKGGTGD CKMGGNYGSS LFAQCBAVDN 
GCQQVLWLYG EDHQITEVGT MKLFLYWINE DGEEELATPP LDGIILPGVT RRCILDLAHQ 
HGEFKVSERY LIHDDLTTAL EGHRVREMFG SGTACWCPV SDILYKGETI HIPTMEHGPK 
LASRILSKLT DIQYGREESD WTIVLS 



Seq ID N0> 120 DHA sequence 
Nucleic Acid Accession if: KM_ 
Coding s 



ACAGAGGGCG G 

CCCAGCTGGA G3GGAACTAG TCTGCTCCAG GTGGCAAGCT GCGTGAGCAA GCAAGCCAAC 
ATGGACCGCG ACTCGTACCA T 

CGCTCCACGA CGCCCAGCGA GGACATCTGG AAGAAATTCG AGTTGGTGCC G 




GCCACCCCCG ACTACACTCC CGAGCTCGAA GCCGGCAACC TAGCGCCCAT C 
TTGTTCGGCG AGCCCAAGAT CCAGGCCIGC TCCAGGTCTG AGAGCCCAAG CGACTCCGAG 
GGTGAAGAAA TCGACGTGAC AGTAAAGAAG AGGCAGTCTT TGftGTRCGCG GAAGCCAGTC 
ATCATCGCGG TGOSTGCAGA CCTTCTGGAT CCCCGCATGA ATCTCTTCCA CATCTCCATC 
A TGCTGCCCCT TTTCCICCAG AAAGCTGCTT CCAAGAAGGG 



GATAAGGAAG ATGAAGAGAT TGTGAGOCTC CCACCTGTAG AAAGTGAGGC TGCCCAGTCC 



TACCTGGAGC GCAAGAGACG GAATGATCAA CGTTOGOGGT TCTTGGCCCT GAGGGACGAG 1020 

GTACCCGCCC TGGCCAGCTG CTCTAGQQTT TCCAAAGTAA TGATCCTAGT CAAGGCCACG 1080 

GAATACTTAC ATGAACTGGC GGAAGCOGAG GAGAGGATGG CTACGGAGAA AAGGCSGCIC 1140 

GAATGCCAGC GAOGGCAATT GCAGAAAAGA ATTGAGTACC TOVGTAGCTA CTGACCAAAA 120 0 

AGCCTGACCA TTCTGTCTTA AAAAGACACA AGTTTTCTTT TTGATCICCC TCTCCCCTTT 1260 

AGTAACTTGT ACATTTTTGT TACRGCAGGA CACTCIGGAC AGIAGATTGC AfiAATCGATT 1320 

GCAGCCAGTG CACAAACAAT ATAAAGGCTT GCATTCTTGG AAACTTIGAA ACCCAGOTCT 1380 

CTCTCTTCCC TGACTTATGG GAGTGCTTTG IGTTTTCTGG CACCTTTGGC TTCTCaGCAG 1440 

GCAGCTGACT GAGGAGACTT GGGGTCTTCC TGGCTCACTA TCTCCAAAGA AAAGGCTGAC 1500 

AGATGGTATG CAACAGGTGG TGGATGTTGT TGGGGGCTCC AGCCTGGAGG AAATCTCACA IS 60 

CTCTACATGA ACTTTAGQCT AGGAAAGGAT GTCTCTGGGG TGATGCAAGG ACAGCTGGGT 1620 

GTGGACGCTC TCCTGCGGCT CCATTTTTTT OCAGGAGACR CACAAGCTGC CTTGG6TGAA 1680 



315 



wo U2/102235 



AACftAGCrCSV GAGRCTTGAT CflAOSTGSAC CRTTACCTCA CTGTCAGRCA CTACAGCTAG 1740 

CTGAGGAGTT GGAAACCTTA CATATATGTA TATATATATG TATGTATATA TGTATATATG 1800 

TATATATATA TGTATGTATA TATGTATATA TGTATATATA TATGTATGTA TATATGTATA 1860 

TATGTATATA TATATGTATG TATATATGTA TATATTATGR TGTTGGCTGA CCCCCTTCCT 1920 

CCCACTCTCSi ATGCTGTGAC TCAGAACATT TAAGAGAACT TCGTCTGTAA GTAATTTGTC 1980 

TTAAAGCCCT CTGG6CTCTC TTCTCTGAGT GAGGGAACTT TCTGTCTTCA CAAGGGACTT 2040 

TGTCTCATTC TGCCTCTGTT ATGCAATGGG TTCTACAGCA CCCTTTCCCG CAGGTTAGAA 2100 

ATATTTCCCT AAGACACAGG GAAATGGGTC TTAGCCTGGG GCCTGGGC-AA AGTTCCCAAG 2160 

CCCTGGCTCa TGAACTCAAT CCCTGCCCAG GTGTTTTCTG AGGGGCCCTT GAGGCCAATC 2220 

ITTTCTCAAG ACSWTGTGAG GCACCTTAGA AGGGAGAAOT GTAACACTTT CTCTTTCGCA 2280 

CCTGCCrCTC ATCrCAATCC TTGACTGATG AATTTGAAGT TCIACTAGAA CCATGAAAAC 2340 

TTGTICCTTT CGIGCAICTC CAAGGAGCTT GCTGGCTCTG CAGCCAOGCT TGGGCCCTOS 2400 

CACCAGCCTG CAATGAATCA GATCTCTGTC ACAGAATCTG GGCCTCTCTG AAGTTTTCTG 2460 

GAGAGCTGTT GGGACTCATC CAGTGCTCCA CAAOGTGGAC TTGCCTCCTG GTGTGTTTTA 2520 

AAGGATCCTC CAGGAGOTCT GCTTAGCC3UI TCATOVTGAT GGATTTTTTT TTTTITTTIT 2580 

GAGACGGAGT CTCAACTCTT GTCGCCCAGG CTGGAGGTTA ATGGCATGAT CTCGGCTCSVC 2640 

TGCAACCTCT GCCTCCCGGG TTCAAGCGAT TCTCCIGCCI GAGCCTTCCG SfiTAGCTGGG 2700 

ATCGCAGGCG CCTGCCACTA aSCCTAGCTA ATTTCIGTAT TTITAGTAGA GATGGGGTTT 2760 

CACCACATTG GCCAGGCTGG TCTTGACCTC CTGACCTAGG TGATCCACTG CCTCCATGAT 2820 

AGATTITGCC CCSGCTGGAC TCTGCAGCIC C3VCGTGGAAT CCAGGTGCCT GCCTCC»GTC 2880 
TGGGAAAGTC ACCAACCXX3C AGCTTGTC3VT GTGGGTAACT TCTGAACCCT AAGCC 



I I I r I I 

MDRDSYHHYF YDYDGGEDFY HSTTPSEDIW KKFELVPPPW TWVRSREPSP QLWSPGTWPV 
GCAGDETESQ DYWKAWDANY ASLIRRDCMW SGFSTQEPLE RAVSDMAVG APSGYSPKEF 
ATPDYTPELE AQILAPIFPC LLGEPKIQAC SRSESPSDSB GEBIDVTVKK RQSLSTRKPV 
IIAVRADLLD PRMNLFHISI HQQQHNYAAP FPPESCFQEG APKRMPPKEA LESEAPGGKD 
DKEDEEIVSL PPVESEAAQS CQPKPIHYDT E 



r EYLHELAEAE ERMATEKHQL ECQHRQLQKH lEYLSSY 



Coding sequence: 356.. 4750 

I r f r i' f 

GAGGTTTGGG AGGCGCGGGA GATGTCCACC CTGGGCTGGT GGOGCCGCOG GGCGCCGGGC 60 

GCCATGAGGG TGCX3CTAGGC GGCTGTTCGT GCCCGAGGCT GCGCAGCACT GAGGTGAGCT 120 

TTGCCTTCTT GATCTTCCGT CCTTCTTGGA GACGACTGGC GAGAGGAAGA GGGACTAGGT 180 

CCAAACX3CTA GGTGGCTGGG TCCAGCCGGA GACCCGCACC AAGGAGGAGA TCATCGAGCT 240 

CTTGGTCCTT GAGCAGTACC TGACCATCAT CCCTGAAAAG CTCAAGCCTT GGGTGCGAGC 300 

AAAAAAGCCG GAGAACTGTG AGAAGCTCGT CACTCTGCTG GAGAATTACA AGGAGATGTA 360 

CCAACCAGAA GACGACAACA ACAGTGACGT GACCAGCGAC GACGACATGA CCCGQAACAG 420 

AAGAGAGTCC TCACCACCTC ACTCAGTCCA TTCTTTCAGT GGTGACCGGG ACTGGGACOS 480 

GAGGGGCAGA AGCAGAGACA TGGAGCCACG AGACCGCTGG TCCCACACCA GGAACCCAAG 540 

AAGCAGGATG CCTCCGQGGG ATOTITCCOT TCCTGTGGTG GCGAAAACAA GCTTIGAAAT 600 

GGACAGAGAG GACGACSGGG ACTCCAGGGC TTATGAGTCC CGATCTCRGG ATGCTGAATC 660 

ATACCRAAAT GTGGTGGACC TCGCIGAGGA C3VGGAAACCT CACAACACAA TCCAGGACAA 720 

CATGGAAAAC TACAGGAAGC TGCTCTCCCT CGGAGTGCAG CTTGCTGAAG SCGATGGCCA 780 

CTCCCACATG AOGCAGOGCC ACTCATCAAG ATCCAAGAGA AGTGCCTACC CAAGCACCAG 840 

TCGAGGTCTA AAAACTATGC CTGAAGCCAA AAAATCAACC CACCGQCGGG GGATTTGTGA 900 

AGATGAATCT TCCCACGGAG TGATAATGGA AAAATTCATC AAGGATGTGT CACGCSGTTC 960 

CAAATCGGGA AGAGCAAGGG AGTCAAGCGA CCGGTCACAG AGATTCCCCA GAATGTCAGA 1020 

TGATAACTGG AAGGAC3VTTT CATTGAACAA GAGQGAGTCA GTGATCCAGC AGCGGGTTTA 1080 
TGAAGGGAAT GCATTTAGGG GAGGCTTTAG GTTTAATTCA ACCCTTGTTT CCAGAAAGAG ' 1140 

AGTTCITGAA AGAAAGAGGC GCTATCATTT TGACACAGAT GGGAAGGGCT CGATTCACGA 1200 

TCAAAAAGGC TGTCCCAGGA AGAAGCCCTT TGAATGTGGT AGTGAGATGA GAAAAGCCAT 1260 

GAGCHTGAGC AGCCTGAGCA GCCTCAGCTC CCCCTCCTTT ACOSAGTCAC AGCCAATTGA 1320 

A ATGCCATATG TATGTGATGA GTGTGGGAGG TCGTTCAGTG TCATCTCAGA 1380 

3 CACCAGATCA TGCATACTAG AGAGAACCTC TATGAGTATG GTGAGTCCTT 1440 

T GTGGCTGTCA GTGAAGTTCA GAAAAGTCAG GTTGGAGGGA AACGTTTTGA ISOO 

ATGTRAGGAC TGTGGAGAGA CCTTCAATAA GAGTGCCGCC TTGGCTGAAC ATOGGAAGAT 1560 

TCATGCTAGA GGTTATCTTG TGGAATGTAA GAATCAGGAA TGTGAGGAAG CCTTCATGCC 1620 

TAGCCCCACC TTTAGTGAGC TTCaGAAAAT ATATGGCAAA GACAAATTCT ACGAGTGCAG 1680 



TGGGGATGAC AAAGATAATG AGCGTGAACA TGAAOGTGAA CGTGAACGTG AGCGCGGGGA 
AACCTTTAGG CCCAGCCCAG CCCTTAATGA GTTTCAGAAA ATGTATG 
GTACGTJ^TGT AAGGTGTGTG GGGAGACTTT CCTTCATAGC TCATCCCTGA A 
GAAAATCCAT ACTAGAGGGA ACCCATTTGA AAACAAGGGT AAAGTGTGTG AGGAAACCTT 



T GGCCGGGATG CCTTCATGCA A 
TCATTCTCGA AAGAACCTCT TTGAAGGCAG AGGGTATGAG AAATCTGTCA TT^TAGTOG 2160 
GCCATTCACT GAATCTCAGA AGAGTCATAC TATAACAAGA CCTCTTGAAA G 



C GAGGCAAAAT CATATGAGAG GTCTGTTATT CATAGCTTAG CCTCTGTGGA 2340 
AGCTCAGAAA AGTCACAGTG TAGCAGGGCC C 
CATTCAGAGC TTCGATGCTA TCAACCS 
AGGAAGGGAA TACAGTAGGT CTGTTATCCA TAGCTTAGTG GCTTCCAAAC CTCCAAGAAG 252 0 



AGACCTTAAT GATAAGCGAC AGAAGATTCC TGCCAGAGAG AACCCTTGTG AAGGGGGCAG 2640 

TAAGAATCGC AACTATGAAG ACTCTGTCAT ACAGAGTGTA TTCCGTGCCA AACCTCAGAA 2700 

AAGTGTTCCT GGAGAQGGAT CTGGTGAGTT TAAGAAGGAT GGCGAATTCT CTGTTCCCAG 2760 

CTCAAATGTC OGTGAATACC AGAAGGCTCG TGCTAAAAAG AAATACATTG AGCATAGGAG 2820 



wo 02/102235 



PCT/US02/19297 



CARTGAGACC TCTGTAATTC ACTCTCTGCC TTTTGGTGAA CflAACATTTC GCCCTCGRGG 2880 

GATGCTCTAT GAATGTCAGG AGTGTGGGGA GTGCTTTGCT CATAGCTCTG ACCTCACIGA 2940 

GCftOCAGAAG ATTCATGATA GGGAGAAGCC CTCTGGRAGC SGAAACTATG AATGGTOTGT 3000 

CATTCX3CAGC TTGGCCCCTA CTGACCCTCA AACTIAGTTAC GCCC3UU3AGC AfiTATGCTAA 3060 

AGAGCAAGCG OGGAACAAAT GTAAGQACTT CAGACAATTT TTTGCTACCA GOSAAGACCT 3120 

CAACACAAAC CAGAAAATCT ATGACCAAGA GAAGTCICAT GGCGAGGAGT CTCAAGGCGA 3180 

GAATACTGAT GGGGAGGAGA CCCACAGCGR GGAGACCCAT GGTCAGGAGA CAATTGAAGA 3240 

CCCTGTCATT CAAGGCTC3VG ACATGGAAGA CCCTCAGAAG GATGACCCTG ATGACAAAAT 3300 

CTATGAATGT GAGGACTGTG GCCTGGGCTT TGTGGATCTC ACAGACCTCR CAGACCSTCA 3360 

GAAAGTCCAC AGCAGGAAGT GCCTGGTTGA CAGTCGGGAG TACACACATT CTGTAATTCA 3420 

C3VCCCATTCC ATCAGCGAGT ATCSWaGAGA TTACAOXSGA GAGC3«3CTGT ATGAATOICC 3480 

AAAGTGIGGG GAATCTTTTA TTCATAGCTC ATTCCTTTTC GAGCaTCAGA GAATCCATGA 3540 

ACAAGACCAG TTCTATTCCA TGAAGGGGTG TGATGATGCT TTTATTGCCC TCrTGCCCAT 3600 

GRAGCCACGG AGGAATCGTG CTGCRGAGAG GAATCCTGCT CTTGCTGGGT OGGCCATTCG 36S0 

ATGCCTTTTG TGTGGACAAG GCTTCATTCA TAGCTCTGCC CTTAATGAGC ATATGAGACT 3720 

TCATAGGGAA GATGATTTAC TGGAGCAGAG CCRGATGGCT GAGGAAGCTA TCATTCCAGG 3780 

CTTAGCCCTC ACTGAGTTTC AGAGAAGTCA GACCGAAGAG AGACTCTTTG AATGTGCAGT 3840 

CTGTGGAGAA TCTTTCGTCA ACCCAGCAGA ACTTGCAGAT CACGTAACTG TTCATAAGAA 3900 



TCCTTTATTC ATAGCACAGT 



GATTCAGGGC TTAViCGTAG AGGCTGCTGA GCCAGAAGTG GAGGCTGCCG AGCCAGAAGT 
GGAGGCTGCT GAGCCAGAAG TGGAGGCTGC TGAGCCAAAC GGAGAGGCTG AAGGGCCAGA 

AAATGGGGAT GCTGATGAGC CAGATGGTGC XGGTATTGAA GAGCCAGAAG A 



G GTAGAAGAAC CATACTATGA 
A TGCACAGAAA CCTTCACTTC CAGCACAGCA TTCAGTGAAC ACCTGAAAAC 
TCATGCCAGC ATGATCATAT TTGAGCCTGC AAATGCCTTT GGGGAGTGCT C 



TGACGTCTGT GGGCAGCTCT TCAATGACCG CCTGTCCCTC GCCAGACACC AGAATACCCA 4740 . 

CACTGGCTGA GGGCATGGGG TAAAGGTTAG AAAAOCTTCA CCTAGGACTT GACCCTTACC 4800 

AAACCACAGA GAATCCAAAC CAATCCATGA TAATGTC3«;T AGGAGACTTA ACCTTAGTGT 4860 

GITACACACC TQACITAACA TCTCTAAACI CAGATTGAAA AGAGACCGAA TGIGCAGATT 4920 

CCACAGTCTT AAGCITTCCC CITCAGATGT CAGTGTCIGC ATGTGGGAAA GCCATAGCAC 4980 

ACATCTIACC TTTCCAAGTA AICAGATTGA GAAAACCCTA ICSAGTATTCC AGACTACAGA 5040 

GTTTGCCCAA ATCAACTGTA AATGACACTT GTGTAAOGTA TATATAGTGT TTCATGAGQT 5100 

GTATATAAAA TAGCAAATTA TGACAGAACA GTGATCACAT ATATTTGGAT TTATATGATA 5160 

TACAGTTACA GTTTACTCTG CAGAGGTACC TTACCTGGTA TTCTTTGAAT TTTTTTTTTT 5220 

TTTGGAGGAG GAAGRSAGCA ACAAATTTGA TTATATTTTT AAGTGTCTTA GATCCTGRGA 5280 

AAGATTTATT GTGCATTATT TGAACCCTGT CAATATCTTT TTGAGTAATT GTTTTGTTTC 5340 

TIACCCTTAA ATAGTCTTGT GAAGCTGTAG GOVTGATAGA TAACATGGCT TTTACTCXST 5400 

ACTGTTTGAA AAGATAAGTA CTTTAGCTTC TTTCTGCaGC CATTTCATCT GCACCAACAC 5460 

TTTGGAACCr AATACTGTGT AAGGCTTTAC AATATADGGA TTGGCTTTTT GTGACCC34GA 5520 

TTGATTGGTT GCCACaTGTT ATGTTTGTTG AAGTGGTTCT CATGCAAAAA TATTACACMIT 558(5 

TTGTGTTCTG GGTTTTTTTT TTTTTTTAAC CAACTCAATA TGTGTTTGAT GATAGTGAAT 5640 

TGATAAAACC OGAAGCTTTT CCCTGTAAAT CTTACATCTT TGCCTTTAAA GAATGGOITA 5700 

CAACCATCAC TAGATCACAG TAGTGCCTAA TGAAGGTTGA GRACCGTAGG AGftGGCTCTC 5760 

ATGCTGTAAA TAATGTTGCA GGCTAATAAC CTTTCATCAC TTCCTTTGTG CGCTTCCTGC 5820 

CTTAAGTGAC AAGTAGCAAC ATGGCTTGGG TCCCCTGTGC AGCATCAGCT TATGCTGCCA 5880 

CAAGTCAGTT TGCACCCTAG GTGCCCAGGA GCTAGTATCC TTAGATCTTT CTATCGCTAA 5940 
CTTAATTCTC TTCGTTATTT ATCTGACCCT CTAACTCCAT GTCTAACTTG CATT 

Seq ID NO: 12 3 Protein sequence 



VQTLGGWVQP ETRTKEEIIE LLVLEQYLTI IPEKLKPWVR AKKPENCEKL VTLLEHYKEM 



RSRMPPRDLS LPWAKTSFE MDREDDRDSR AYESRSQDAE SYQNWDLAE DRKPHNTIQD 
NMENYKKLLS LGVQLAEDDG HSHMTQGHSS R 
EDESSHGVIM EKFIKDVSRS SKSGRAHESS D 

VEGMAFRGGF RFNSTLVSRK RVLERKERYH FDTDGKGSIH DQKGCPRKKP FECGSEMRKA 
MSVSSLSSLS SPSFTESQPI DFGAMPYVCD ECGRSFSVIS EFVEHQIMHT RENLYEYGES 
FIHSVAVSEV QKSQVGGKRF ECKDCGETFN KSAALAEHRK IHARGYLVBC KNQECEEAFH 

PSPTFSELQK lYGKDKFYEC RVCKETFLHS SALIEHQKIH FGDDKDNERE H 

ETFRPSPALN EFQKMYGKEK MYECKVCGETT FLHSSSLKEH QKIHTRGHPF E 
FIPGQSLKRR QKTVNKEKLC DFTDGRDAFM QSSELSEHQK IHSRKNLFEG R 
GPFTESQKSH TITRPLESDE DBKAFTISSN PYBNQKEPTK EHVYEAKSYE RSVIHSLASV 720 
EAQKSHSVAG PSKPKVMAES TIQSFDAINH QRVRAGGNTS BGREYSRSVI HSLVASKPPR 780 
SHHGNELVES NEKGBSSIYI SDIiNDKROKI PARENPCEGG SKNRNYEDSV IQSVFRAKPQ 840 
KSVPGEGSGE FKfCDGEFSVP SSHVREYQKA RAKKKYIEHR SHETSVIHSL PFGEQTEEPR 900 
GMLYECQECG ECETffiSSDLT EHQKIHDREK PSGSRNYEWS VIRSLAPTDP QTSYAQBOYA 960 
KEQASNKCKD FSQFFATSED LNTHQKIYDQ EKSHGEESQG BNTDGEETHS EETHGQETIE 1020 
DPVIQGSDME DPQKDDPDDK lYECEDCGLG FVDLTULTDH QKVHSRKCLV DEEEYTHSVI 1080 
HTHSISEYQR DYTGEQLYEC PKCGESFIHS SFLFEHQEIH EQDQLYSMKC CEDGFIALLP 1140 
MKPRRNRAAE RHPALAGSAI RCLLCGQOFI HSSAUIEHHR UHREDDLLEQ SQMAEEAIIP 1200 
GLALTEFQRS QTEBRLFECA VCGESFVNPA ELADHVTVHK NEPYEYGSSY THTSFLTEPL 1260 
KGAIPFYECaC DCGKSFIHST VLTKHKELHL EEEEEDEAAA AAAAAAQEVE ANVHVPQWL 1320 
RIQGLHVEAA EPEVEAAEPE VEAAEPEVEA AEPNGEAEGP DGEAAEPIGE AGQPHGBAEQ 1380 
PNGDADEPDG AGIBDPBERA EEPEGKAEEP BGDADEPDGV GIEDPEBGBD QEIQVEEPYY 1440 
DCHECTETFT SSTAFSBHLK THASMIIPEP ANAFGECSGY lERASTSTGG ANQADEKYFK 1500 
CDVOGQLFND RLSLARHQNT HTG 
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Seg ID NO: 124 DNA sequence 
Nucleic Acid Accession (f: 1IM_007196 
Coding sequences 180.. 1962 

1 11 21 31 41 51 

I I I I I I 

GTTCCCAGAA GCTCCCCAGG CTCTAGTGCA GGAGGAGAAG QAGGAQGAGC AGGACX3TGGA 
GATTCCCaCT TAAAAGGCTC CftGAATCGTG TACCMGQCAG AGftACIGAAG TACTGGGGCC 
TCCTCCaCTG GGTCCGAATC ASIAGGTGAC CCCGCCOCTG GRTTCTGGAA GACCTCACCA 
TGGGACGCCC COGACCTCGT GCGGCCAAGA CGTGGATGTT CSTrGCTCTTG CTGGGGGGAG 
■ CCTGGGCAGG ACACTCCAGG GCACAGGAGG ACAAGGTGCT GGGGGGTCAT GAGTGCCAAC 
CCCATTCGCA GCCTTGGCAG GCGGCCTTGT TCCAGGGCCA GCAACTACTC TGTGGCGGTG 
TCCTTGTAGG TGGCAACTGG GTCCTTACAG CTGCCCACTG TAAAAAACCG AAATACftCAG 
TACGCCTGGG AGACCACAGC CTACAGAATA AAGATGGCCC AGAGCAAGAA ATACCTGTGG 
r CCCACACCCC TGCTACRACA GCAGCGATGT GGAGGACCAC AACCATGATC 



3 CCTGGCCAGA AGTGCACCGT CTCAGGCTGG G 
CCAGTCCCCG AGAGAATTTT CCTGACACTC TCAACTGTGC AGAAGTAAAA ATCTTTCCCC 
AGAAGAAGTG TGAGGATGCT TACCCGGGGC AGATCACAGA TCGCATGGTC TGTGCAGGCA 
GCAGCAAAGG GGCTGACACG TGCCAGGGCG ATTCTGGAGG CCCCCTGGTG TGTGATGGTG 
CACTCCAGGG CATCACATCC TGGGGCTCAG ACCCCTGTGG GAGGTCCGAC AARCCTGGCG 
TCTATACCAA CATCTGCCGC TACCTGGACT GGATCAAGAA GATCATAGGC AGCAAGC3GCT 
GATTCTAGGA TAAGCACTAG ATCTCCCTTA ATAAACTCAC AACTCTC 

Protein Accession U: NP_^009127 

I l' r I 1^ 1^ 

MGRPRPEAAK TWMFLLLLGG AWAGHSEAQE DKVLGGHECQ PHSQPWQAAL FQGQQLLCGG 
VLVGGNWVLT AAHCKKPICTT VRLGDHSLQN KDGPBQEIPV VQSIPHPCYN SSDVEDHNHD 
LMLLQLRDQA SLGSKVKPIS LADHCTQPQQ KCIVSGHGTV TSPSENFPDT LHCAEVKIFP 
QKKCBDAYPG QITDGMVCSG SSKGADTCQG DSGGPLVCDG ALQGITSWGS DPCGRSDKPG 
VYTNICHYLD WIKKIIGSKG 

Seq ID NO I 126 DNA sequence 
Nucleic Acid Accession #: NM_014791.1 
Coding sequence: 171.. 2126 ~ 

1 i' T r 1^ i' 

TTGGCGGGCG GAAGCGGCCA CAACCCGGCG ATCGAAAAGA TTCTTAGGRA CGCCGTACCA 
GCCGCGTCTC TCAGGACftfiC AGGCCCCTGT CCTTCTGTCG GGCGCCGCTC AGCCGTGCCC 
TCCGCCCCTC AGGTTCTTTT TCTAATTCCA AATAAACTTG CAAGRGGACT ATGAAftGRTT 
ATGATGAACT TCTCAAATAT lATGAATTAC ATGAAACTAT TGGGACAGGT GGCTTTGCAA 
AGGTCAAACT TGCCTGCCAT ATCCTTACTG GAGAGATGGT AGCTATAAAA ATCATGGATA 
AAAACACACT AGGGAGTGAT TTGCCCCGGR TCSAAACGGA GATTGAGGCC TTGAAGAACC 
TGAGACATCA GCATATATGT CAACTCTACC ATGTGCTAGA GACAGCCAAC AAflATATTCA 
TGGTTCTTGA GTACTGCCCT GGAGGAGAGC TGTTTGACTA TATAAT7TCC CAGGATOGCC 
TGTCAGAAGA GGAGACCCGG GTTGTCTTCC GTCAGATAGT ATCTGCTGTT GCTTATGTGC 
ACAGCCAGGG CTATGCTCAC AGGGACCTCA AGCCAGAAAA TTTGCTGTTT GATGAATATC 
ATAAATTAAA GCTGATTGAC TTTGGTCTCT GTGCAAAACC CAAGGGTAAC A"VGGATTACC 
ATCTACAGAC ATGCTGTGGG AGTCTGGCIT ATGCAGCACC TGAGTTAATA CAAGGCAAAT 
CATATCTTGG ATCAGAGGCA GATGTTTGGA GCATGGGCAT ACTGTTATAT GTTCTTATGT 
GTGGATTTCT ACCATTTGAT GATGATAATG TAATGGCTTT ATACAAGAAG ATTATGAGAG 
GAAAATATGA TGTTCCCAAG TGGCTCTCTC CCAGTAGCAT TCTGCTTCIT CAACAAATGC 
TGCAGGTGGA CCCRAAGAAA CGGATTTCTA TGAAAAATCT ATTGAACCA 
TGCAAGATTA CAACTATCCT GTTGAGTGGC AAAGCAAGAA TCCTTTTAT 
ATGATTGCGT AACAGAACTT TCTGTACATC ACAGAAACAA CAGGCAAACA ATGGAGGATT 1080 
TAATTTCACT GTGGCAGTAT GATCACCTCA CGGCTRCCTA TCTTCTGCTT CTAGCCAAGA 1140 
AGGCICGGGG AAAACCAGTT CGTTTAAGGC tTTCTTCTTT CTCCTGTGGA CAAGCCAGTG 1200 
CTACCCCATT CACAGACATC AAGTCAAATA ATTGGAGTCT GGAAGATGTG ACCGCAAGTG 1260 

GTGCTGCTAC TCCCCGAACA TCACAGTTTA CCAAGTACTG GACAGAATCA AATGGGGTGG 1380 

AATCTAAATC ATTAACTCCA GCCTTATGCA GAACACCTGC AAATAAATTA AAGAACAAAG 1440 

AAAATGTATA TACTCCTAAG TCTGCTGTAA AGAATGAAGR GTACTTTATG TTTCCTGAGC 1500 

CAAAGACTCC AGTTAATAAG AACCAGCATA AGAGAGAAAT ACTCACTACG CCAAATCGTT 1560 

ACACIACACC CTCAAAAGCT AGAAACCAGT GCCTGAAAGA AACTCCAATT AAAATACCAG 1620 

TAAATTCAAC AGGAACAGAC AAGTTAATGA CAGGTGTCAT TAGCCCTGAG AGGCGGTGCC 1680 

GCTCAGTGGA ATTGGATCTC AACCAAGCAC ATATGGAGGA GACTCCAAAA AGAAAGGGAG 1740 

CCAAAGTGIT TGGGAGCCTT GAAAGGGGGT TGGATAAGGT TATCACTGTG CTCACCAGGA 1800 

GCAAAAGGAA GGGTTCTGCC AGAGACQGGC CCAGAAGACI AAAGCTTCAC lATAATGTGA 1860 

CTACAACTSfi ATTAGTGAAT CCAGATCAAC TGTTGBATGA AAIAATGTCT ATTCTTCCAA 1920 

AGAAGCATGT TGACTTTGTA CAASAGGGTT ATACACTGAA GTGTCAARCA CAfiTCAGATT 1980 

TTGGGAAAGT GACAATGCAA TTTGAATTAG AAGTSIGCCA GCITCAAAAA CCCGATGTGG 2040 

TGGGTATCAG GAGGCAGCGG CTTAAGGGOG ATGCCTGGGT TTACAAAAGA TTAGTGGAAG 2100 

ACATCCTATC TAGCTGCAAG GTATAATTGA TGGATTCTTC CATCCTGCCG GATGAGTGTG 2160 

GGTGTGATAC AGCCTACATA AAGRCTGTTA TGATOGCTTT GATTTTAAAG TTCATTGGAA 2220 

CTACCAACTT GTTTCTAAAG AGCTATCTTA AGACCAATAT CTCTTTGTTT TTAAACAARA 2280 

A TCTAAATCAA GCCCATCTGT CATTATGTTA CTGTCTTTTT 2340 

a TTAATAATTG TTGACTTTCT TAGATTCACT TCCATATGIG 2400 

AATGTAAGCT CTTAACTATG TCTCTTTGTA ATGTGTAATT TCTTTCTGAA ATAAAACCAT 2460 
TTGTGAATAT 
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AYVHSQGXflH RDLKPEHLLP DEXHKLKLID FGLCSKPKGM KDKHLQTCCC3 S 
QGKSYLGeBA DVWSMGILLY VLMC6FLPFD DDNVM6LYKK IMRGKYDVPK WLSPSSILLL 
QQMLQVDPKK RISMKNLLNH PWIMQDWYP VEHQSKNPFI HLDDDCVTEL SVHHEIWRQT 
MEDLISLWQY DHLTATYLLL LAKKAEGKPV ELRLSSFSCG CJASATPFTDI KSNNWSLEDV 
TASDKHYVAG LIDYDWCEDD LSTGAATPRT SQFIKYWTES N6VESKSLTP ALCRTPANKL 
KNKENVrrPK SAVKNBEYFM FPEPKTPVNK NQHKREILTT PNEYTTPSKA EHQCLKETPI 
KIPVNSTGTD KLMTGVISPE RHCRSVBLDL NQAHMEETPK RKGAKVFGSL ERGLDKVITV 
LTRSKRKGSA RDGPRRLKLH YNVTTTRLVN PDQtLNEIMS ILPKKHVDFV QKGYTLKOQT 
QSDFGKVTMQ FELEVCQLQK PDWGIREQR L 

Seq ID NO: 128 DNA sequence 



I I I I 

T CTGGAATGGA GGTCTTATGA GCTGCTATTG A" ' 

3GAGCCC TCCAGTCAGT ACTGATTGAA TTACTCAAGG CTGCCTCTCT 
A GCACTACAGG ACGTCGGGAC TGGGCATTTC CTTCCAACAT GGCCGCCACT 
GCCTCTCCGC AGCCACTCGC CACTGAGGAT GCCGATTCTG AGAATAGCAG CTTCTATTAC 
TATGACTACC TGGATGAAGT GGCCTTCATG CTCTGCAGGA AGGATGCAGT GGTGTCCTTT 
GGCAAAGTCT TCCTCCCAGT CTTCTATAGC CTGATTTTTG TGTTGGGCCT CAGCGGGAAC 
CTCCTTCTTC TCATGGTCTT GCTCCGTTAC GTGCCTCGCA GGCGGATGGT TGAGATCIAT 
CTGCTGAATC TGGCCATCTC CAACCTTCTG TTTCTGGTGA CACTGCCCTT CTGGGGCATC 
TCCGTGGCCT GGCATTGGGT CTTCGGGAGT TTCTTGTGCA AGATGGTGAG CACTCTTTAT 



C CTACCACAGG CTGAGGACCC GGGCCAAGAG CCTGCTCCTT 6 SO 

GCTACCATAG TATGGGCTGT GTCCCTGGCC GTCTCCATCC CTGATATGGT CTTTGTACAG 720 

ACACATGAAA ATCCCAAGGG TGTGTGGAAC TGCCACGCAG ATTTOGGOGG GCATGGGftCC 730 

ATTTGGAAGC TCTTCCTCCG CTTCCAGCAG AACCTCCTAG GGTTTCTCCT TCCACTCCTT 840 

GCCATGATCT TCTTCTACTC CCGTATTGGT TGTGTCTTGG TGAGGCTGAG GCCCGCAGGC 900 

CAGGGCCGGG CTTTAAAAAT AGCTGCAGCC TTGGTGGTGG CCTTCTTOST GCTATGGTTC 960 

CCATACAATC TCAOCTTGTT TCTGCRTACG CTGTTGGACC TGCASGTATT CGGGAACTGT 1020 

GAGGTCAGCC AGCATCTAGA CTACGCACTC CAGQTAACSG AG&GCATCGC CTTCCTTCAC 1030 

TGCTGCTTTT CCCCCATCCT GTATGCCTTC TCCAGTCaCC GCTTCCGCCa GTACCTGAAG 1140 

GCTTTCCTGG CTGCCGTGCT TGGATGGCAC CTGGCACCTG GCACTGCCCA GGCCTCRTTA 1200 

TCCAGCIGTT CTGAGAGCAG CaTACTTACT QCCCAAGRGG AA&TGACTGQ CATGAATGAC 1260 

CTTGGAGAGA GGCAGTCTGA GAACTACCCT AACAAGGBGG ATGTGGGGAA TAAATCSGCC 1320 

TGftGTGACCA AATTTTGGTC TGGTGGGAAC AGRTGGGAAC CAGCTCAATT GGGTGTCCAC 1380 

TOiAAGTGCT CTCTCCS«3GG GCCTCaGTQA CTGTGTTGCT AAACCCAGTG GTCAGTTCTC 1440 

AGTTCTCAGC CATCAGCAGC ATTTGCTCGC CCCXSCCTTCT TCCTCCACTT TCTTCACTTG 1500 

CTTCCRGGAT ACCACGCTTT CTTTTCTGAA TTGCTAC3UIT CTTTCTTCCT TCCTTCCTTG 1560 

CTTCCTTCCT TCCTTCCTTC CCTCTCTCCC TCCCTCCCTC CCTCGCTTCT TOCCTTCCTC 1620 

CTTTCCTCCC TTCCTACTTT CCTTCCTTCC TTCTGACAGG GTCTTGCTCT ATTGCTCTGT 1680 

CACCCAGGCT GGAATGCAGT GGCXSAGATCT CCGCTCACTG TAGCCTCCTC CCCCTGGGTT 1740 

GAAGCAATTC TCATGCCTCA GCCTCCCAAG TAGCCAGGAC TATAGGCACC TGCCACCATG 1800 

CCTGGCIAAT TTTTGTATTT TTTTTCTTTC TTTCTTTCTT TTCTTTTTTT TTTTTTTTTG 1360 

AGACGGAGTC TCAOTCTTGT TGCCCRGGCT GGACTkACAAT GGCGCGATCT CGGCTCaCTG 1920 

CAACCTCCAC CTCCCGGATT CAAGOGATTC TCCTGCCTCA GCCTCCTGAG TAGCTGGAAC 1980 

TACATGCGCG TGCCACCACG CACAGCTAAT TTTTATAATT TTAGTAGAGA TGGGQTTTCA 2040 

CTGCGTTGGC CAGGATGATC TCGATCTCTT GACCTTGGGA TCCACCCGCC TTGGCCTCCC 2100 

AARGTGCTGG GATTACAGGT GTGAGCCACC ATGCCTGGCC CTAATTTTTG TGTTTTTATT 2160 

AGAAACAGAG TTTCACCATG TTGGCCAGGC TGGAGAATTG CTGTAATAGT TTTCCAACTG 2220 

GCCCCTGTCC TTCCTCTCTC TTGCTCTCCT CCCATCTCAT CTGCACCTAG CAGCCAGAGT 2280 

GATCCTCATA CTCTCGGCCT TTACITCCGC CTCCCTCAGA GCAGCRGCCT GTCAAAACAC 2340 

CAGATTACAA CAAATTTAGT TTAARGGTCT CAATTAGCGT TATTGGCAAT TCTAGAATCA 2400 

GGCAACAGAC TCATTGAATC RGGAACAGAT TCACTCCATA AAATACAGAG AGTGCTGCAA 2460 

TGAGCTGGGT AGAAGAGGTT AGTTTTATAG ACAGGAAGGG GCTGTCAAAG GCAGAAAGAA 2520 

ATGAAGAACA AAAAAAAAGA TTGATTTTTT TTTTTTIGAG ACAGGATCTC ACTCTGTCAT 2580 

CCAGGCTGAA GTCCAATCCC ACAATCATGG CTCACTGCAG CCACCACCIC CTGAGCTCAA 2 640 

GTGATCCTCC CATCTAAGCC CCCAAGTAGC TAGGACTACA GGAGCACACC ACCACACCTG 2700 

GCTAATTTTT GTATTTTTTG TGGAGACAGG GTCTCAGTAT GTTACCCAGG TTGGACTGGA 2760 
AACCCTTGGC T 

GAGCCACTGC ACRGGGCCAG ATTCATCATT TCAAAGTTAC T 

AGGGTGGTTG ACATCAGTTT TCTTCAGGTT ACTTTTTAAT AATGATTAAA ACGGGGAACT 2 940 

GGAATTTGGC TGTTAACTCT GTCTCCAGGA GTCTCAAAG6 TCAGATAACA ACTTAGTTTT 3O60 

GGTTTGAGGA CATGGAACTT TAATAOGACA GATTCCATTT TGGTTTGGTC TGGTCAGCTG 3120 

GGACCTAGTG CAGGAGGTTA GTCCJiAAACA ATGGCCTTCC ATAGTTTTTA CTTAACAAfiC 3180 

CCAACrCCTT ACGGTAATCC TTTAAGGCCT ATGTGATCTG CCCTCACCCT GGCTACACTC 3240 

TTTGCCCTTA TGTCCACCAG CCTCCAGTGC TCCAGRCACA ACTGATCTCA AATACTCCTG 3300 

ACACftTCCCC CTGGGGATTC CTCTTCCTGG ATTGCTCTCC TCTCAGCTGT CTGCTTGGCT 3360 

CACTCrCTCR CCITCTGAGG TCTTAGCTCA GATACCACCT TTCaCTCaCC TGTCCAACCT 3420 

ATTTAACACA AACTGGCCCA TCTCTGACGC TCCTGRTCTT CCTGATGCTC TATTTTTAAA 3480 

TTTTTTCAAT AGTGCTTATT TCTTTCAGAC ATTGTATATG TTTCTCCTAT TTGTGATGTT 3540 

CAGTGGCTAC TGTCCCCACC CSIC3VTTGTCC CAGGTGAATT CCACAAGGTA GGGATCTTGG 3600 

TTAGGCTCAC TGCTOTACCC AACCCCCTAA CACAGGGCCT GQTGTGTAGG TTTTCAGAAA 3660 

GTATCTGTAA AATGCATQGG TGGAGGAGGG AGCATTTTCC TTCTGGCSiCT GCTGGGGAAA 3720 

AAGAACCTAA GGCCCTCCAC TCACCAGGGT TCCaCftTTCT CTGTAGTCCa GCTGGGAAAT 3780 

GCTATTAAAA GGACCATTCT CGTAGCCTTQ ATGGTGOACC CAGCATCTCO CAATCAGGAG 3840 

GGCCTTGAAG TACTCTGCCT AGGGAATGTC CCAGGACCAT ATACAGTCTG CAGASGGAGG 3900 

CAGTGAACAC ACTTGGCCTA GGCCCCATCT CAGGAGCCTT CACATCTCCC TGGGACCTGC 3960 

ACAATCAAGT CCfiAAATCTT TGTCTGATGT CAAGQCTAGG GCITTTGCAA ATAGGATCCC 4020 

ATACaTCTTC CAGCCCTCAT ACTCCTCCAC TCCCTTTTCA TTTCCTGAAC ATTOCAACIT 4080 
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CTATGCCTTT GCTCa'TGTGG GCTCCCTCAT GTAATTAGGA TAATTTCCAG CXS«3GTATTC 4140 

CCrCaCCCTT GTTTTACftAC TGAATCCCAC TTTCICTGGG ACTCCACTGC TTGGTCACTC 4200 

TGCCCTGTGC TTTGAGAGGA GGTTGGGTGG AGCATGGGAC ATGGGCCATT TACTGCXTTC 4260 

ACATTTCCCT GGTCACG6TG ATTTTTTTA6 GGGTGGGCaC ATGACCTAGG TASAGCTAAT 4320 

T ATCACIAGAA CATGTGRGAA GAGAGGCATG AAACCGAAGC TGAGAGGGTT 4380 

G AGCTACTCCI TCCTCCRCCC AAACTTCAGA AGGTGAAGGA G&TAGTGCAT 4440 

GTGAGTGAGC AGAT0CC6AG GGCACTGTTT GAGCCTCAGA ATCAAAGCAC TTCTGAAGCT 4500 
AGTCCCAATC TTGGATTCGA CAGTTACAAG AACCAATAAR TTCCTTTCCT TGTGG 

Seq ID NO; 129 Protein sequence 

1 11 21 31 41 51 

^ WSFGKVFLP VFySLIFVLG 60 

CSVAWHW VFGSFLCJOJV 120 

L DKYLEIVHAQ PYHRLRTRAK SLLLATIVWA VSLAV6IPDM 180 

VFVQTHENPK GVWHCHADFG GHGTIWKLFL RFQQNLLGFL LPLLAMIFFY SRIGCVLVSL 240 

EPAGQGRALK lAAALWAFF VLKFPVNLTL FLHTLLDLQV FGNCEVSQHL DYALQVTESI 300 

I LYAFSSHRFR QYLKAPLAAV LGWHLAPGTA QASLSSCSE6 SILTAQEEMT 360 




AGGCGGACAA AGCCCGATTG T 

AGCCCGGGGC AGGGGCGGGG GCCAGTGTGG TGACACACGC TGTAGCTGTC T 

GCTGGCTCGC TCTCTCCTGG GGACACAGAG GTGGGCAGGC AGCACXCAGA GGGACCTACX3 180 

GGCAGCTGTT CCTTCCCCCG ACTCAACiAAT CCCCGGAGGC CCGGAGGCCT GCAGCAGGAG 240 

CGQCCATGAA GAAGCTGATG GTGGTGCTGA GTCTGATTGC TGCAGCCTGG GCAGAGGAGC 300 

AGAATAAGTT GGTGCATGGC GGACCCTGOS ACAAGACaTC TCACCCCTAC CAAGCTGCCC 360 

TCTACACCTC GGGCCACTTG CTCTGTGGTG GGGTCCTTAT CCATCCACTG TGGGTCCTCA 420 

CAGCTGCCCA CTGCAAAAAA CCGAATCTTC AGGTCTTCCT GGGQAAGCAT AACCTTCGGC 480 

AAAGGGAGAG TTCCTAGGBG CAGAGTTCTG TTGTCOGGGC TCTTGATCCAC CCTGACTATG 540 

ATGCCGCCAG CCATGACCaG GACATCATGC TGTTGOKCT GGCAOGCCCA GCCRAACTOT 600 

CTGAACICAT CCAGCCCCTT CCCCTGGAGA GGGACTGCTC AGCCAACACC ACCAGCTGCC 660 

ACATCCTGGG CTGGGGCAAG ACAGC^GATG GTGATTTCCC TGACACCATC CAGTGTGCAT 720 

ACSVTCCaCCT GGTGTCCCGT GfiGGAGTGTG AGCATGOCTA CCCTGGCCflG ATCACCCAGA 780 

ACaTGTTQTG TGCTGQGGAT GRGAAGTAOQ GGAAGGATTC CTGCCAGGGT GATTCTGGGG 840 

GTCCGCTGGT ATGTGGAGAC CaCCTCCGAG GCCTTGTGTC ATGGGGTAAC ATCCCCIGTG 900 

GATCAAAGGA GAAGCCAGGA GTCTACACCa ACGTCTGCAG ATACaCGAAC TGGATOCAAA 960 

AAACCATTCA GGCCAAGTGA CCCTGACATG TGACATCTAC CTCCCGACCT AOCACCCCAC 1020 

TGGCTGGTTC CAGAACGTCT CTCACCTAGA CCTTGCCTCC CCTCCTCTCC TGCCCAGCTC 1080 

TGACCCTGAT GCTTAATAAA CGCAGCGAOG TGAGGGTCCT GATTCTCCCT GGTT7TACCC 1140 

CAGCTCCATC CTTGCATCAC TGGGGAGGAC GTGATGAGTG AGGACTTGGG TCCTCGGTCT 1200 

TACCCCCACC ACTAAGAGAA TACAGGAAAA TCCCTTCTAG GCATCTCCIC TCCCCAACCC 1260 

TTCCACACGT TTGATTTCTT CCTGCAGAGG CCCAGCCACG TGTCTGGAAT CCCAGCTCOG 1320 

CTGCTTACTG TCGGTGTCCC CTTGGGATGT ACCTTTCTTC ACTGCAGATT TCTCACCTGT 1380 

AAGATGAAGA TAAGGATGAT ACAGTCTCCA TCXGGCSGTG GCTGTTGGAA AGATTTAAGA 1440 

TTTCACACCT ATGACATACA TGGGATAGCA CCTGGGCCGC CATGCACTCA ATAAAGAATG 1500 
TATTTT 

I r r r i' r 

MKiCLMWLSL lAAAWAEEQN KLVHGGPCDK TSHPYQAALY TSGHLLCGGV LIHPLWVLTA 60 

AHCKKFNLQV FLGKKNLRQR BSSQBQSSW RAVIHPDYCA ASHDQDIMLL RLASPAKLSE 120 

LIQPLPLERD CSANTTSCHI LGWGKTADGD FPDTIQCAYI HLVSREECEH AYPGQITQNM 180 

LCAGDEKYGK DSCQGDSGGP LVCGDHLRGL VSWGHIPCGS KEKPGVYmV CRYTNWIQKT 240 

Seq ID NO: 132 DNA sequence 



I I I I I I 

ATGAGCAATC AGIACCAGGA GGAGGGCTGC TCCGAGAGGC CCGAGTGCAA AAGTAAATCT 
CCAACTTTGC TCTOCTCCTA CTGCATCGAC AGCATCTrGG GCCX3GAGGAG CKCGTGCAAA 
ATGOKTTGC TGGGAGCCGC GCAGAGCITG CCTGCTCOGC TGACCAGCCG CGCCGACCCG 
GAAAAGGCCG TGCAAGGCTC CCCTAAGAGC AGCAGCGCCC CGTTCGAGGC CGAGCTGCAC 
CTGCCGCCCA AGCTGCGGOG CCTGTAOGGC CCGGGCGGGG GCCGCCTCCT TCAGGGTGCG 



C GCCGCCACCG CCAACCGCGC GGCCOGGGGA ACGGCOGGAC 



GGCGCAGGGG CaSCCGOSGC AGCCGOGGCC GCGGCCGCCG CGGCCTGGGA CACGCTCAAG 
ATCAGCCAGG CQCCGCAGGT GAGCATCAGC CGCAGCAAGT CGTACCQCGA GAACaOGGOS 

CCCTTCGTGC 03CCGCCGCC CGCX3CTGGAC GAGCTGGGCG GCCCGGGGGG CXSTCACGCaC 
CCGGAGGAGC GCCTCGGCGT GGCCGGCGGC CCGGGCAGCG CCCCGGCTGC GGGTGGTGGC 
ACCGGCACCG AGGACGACGA GGAGGAGCTG CTGGAGGACG AAGAAGATGA GGACGAGQAA 
GAGGAACTGC TGGAGGACGA CGAGGAGGAG CTGCTGGAGG ACGACGCCO; CX3CGCTGCTC 
AAGGAGCCCC GGCGCTGTCC TGTGGCCGCC ACTGGCGCOG TGGCCGCAGC AGCTGCCGCT 
GCAGTGGCCA CAGAGGGCGG GGAGCTGTCA CCCAAGGAGG AGCTGCTGCT GCACCOGQAA 
GACGCTGAGG GCAAGGACGG CGAGGACAGC GTGTGCCTCT CTGCGGGCAG CGACTOGGAG 
GAGGGGCTGC TGAAACGCAA ACAGAGGCGC TACCGCACCA CGTTCACCAG CTACCAGCTG 
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45 



GAGGAACTGG MCGGGCCTT CCAGAACiACG C&CTACCOGG ACX3TCTTCAC CAGCGAGGAA 
CTGGCTATGA GGCTGGACIT GftCCGftGGCC CGAGTCCaGQ TCTGGITCCa GAACCGTCGQ 
GCCAAGTGGC GCAAGCGGGA GAflGGCAGGC GCGCAGACCC ACCKCCCTGG GCTGCCCTTC 
CCGOGGCCGC TCTCCGCCftC CCACCCGCTC A6CCCCTACC TGGAOSCCAG CCCCTTCCCT 
CCGCACCACC OGGaSCTOSA CTCCGCTTGG ACTGCCGCTG COGCCGCOGC CGCCGCOSCC 
TTCCXX3RGCC TACCTCCGCC TCCGGGCTCG GCCSGCCIGC CGCKCaGCGG GGCGCCGCTG 
GGCCTGAGCA CTTTCCTCGG AGCGGCAGTG TTCXXIACaCC CAGCTTTCAT CAGCCOSGCa 
TTOGGCRGGC TCTTTTCCAC AATGGCCCCC CTGACCAGCG OSTCGACOGC GGCOGCGCTC 
CTGRGRCAGC CCACACCCGC CX3TGGRGGGC GCAGTGGCAT CGGGCGCCCI GGCCGACOCG 
ISIGACRG ACGCGCCTCT AGCSTAGCCG CGCTGBGGOT CABGGCCAAG 
3 CGCRGCTCAC GCAGCTCAAC ATCCTGCCGG GCACCAGCAC GGGCAAGGAG 
. GTGTGC 

Seq ID NO: 133 Protein sequence 



MRLLGAAQSL PAPLTSRADP 
PGGGRLLQGA AAAAAAAAAA AAJiAATATAG 
GAGAAAAAAA AAAAAWDTLK ISQAPQVSIS RSKSYRENGA 
ELGGPGGVTH PEERLGVAGG PGSAPAAGGG TGTEDDEEEL L 



DAEGKDGEDS VCLSAGSDSE EGLLKRKQRE YRTTFTSYQL EELERAFQKT H 

A RVQVWFQNRR AKWRKREKAG AQTHPPGLPF PGPLSATHPL SPYLDASPFP 
iAAAAAA FPSLPPPPGS ASLPPSGAPL GLSTFLGAAV FHHPAFISPA 
? LTSASTAAAL LRQPTPAVEG AVASGALADP ATAAADSRAS SIAALRLKAK 

EHAAQLTQLN ILPGTSTGKE VC 

Seq ID NO: 134 DNA sequence 
5U Nucleic Acid Accession #: CAT cluster 

1 11 21 31 41 51 

I I I I I I 

TTTTTTTAAA GCAGATCATC TCTCCAAATC ATCACTTCTA TCaAQCCTAT 

TGCTTGAGCA GTGTTATAGC ACTCAGCCCT CAGGGCAAAG ATAAGTCTTC ACCATTGTCA 
35 CACGTAGCAC ACACATATTC AGCCATATCA TGCTGAATGG GAATACAGGA CTTTGTAGAA 

ACAGAACTGA TTCCTGCAGA ATATCCTGAG ATACTTATCA A 

GTCTTTTGTC TGTATTGCCC TTGACACCTC CTCAAGGAAA G 

TTCTGAAGAA CCCTCAGACC TCTTAGGTCT AATGTAGGTT AAGTGCCCIG CAGATCTCCC 

TAGAATAGAA AAGCACCITG AAAACTGTAG TCTGACTTAA TAGACACAAA TATAATGAAA 
40 GCACTAATTC ATAAGATCCT GTTATTTGAA GGA7UVAAGCA GCAAAAGGCA CAAGCTTCAG 

ATATTGGTCT TGCACAGGAA AAAGCTGGAA TTCTACC 



50 CGACGGGTCA TCACGTCGCG CATCGTGGGT GGftGAGGACG CCGAACTCGG GOGTTGGOCG 180 

TGGCAQQGGA GCCTGCGCCT GTGGGATTCC CACGTATGCG GAGTQAGCCT GCTCAGCCAC 240 

CGCTGGGCAC TCACGGCGGC GCACTGCTTT GAAACCTATA GTGACCTTAG TGATCCCTCC 300 

GGGTGGATGG TCCAGTTTGG CCAGCTGACT TCCATGCCAT CCTTCTGSAG CCTGCAGGCC 360 

: TATCTGAGCC CTCGCTACCT GGGGAATTCA 420 



ACATCARAGA GGATGAGGCA CTGCCATCTC CCCACACCCT CCAGGAAGTT 
CAGTTTCCGC 



CCCTCCTGGC CGCTACICTT TTTCCCTCTT C 

XTGGGG CCACTGCCAA GTCAGGCCCT GGTTCTCTTC 
ACATTCCAGT TGATGCCTTG CAGGGCATTC TTCAAAA 



I I I I I 

"IRAGLRK PESQEAAPLS GPCGRRVITS RIVGGEDAEL GRWPWQGSLR 

LWDSHVCGVS LLSHRWALTA AHCFETYSDL SDPSGWHVQF GQLTSMPSFW SLQAYYTRYF 
VSNIYLSPRY LGNSPYDIAL VKLSAPVTYT KHIQPICLQA STFEFENRTD CWVTGWGYIK 
EDEALPSPHT LQEVQVAIIN NSMOIHLFLK YSFRKDIFGD MVCAGNAQGG KDACEGDSGG 
PLACNKNGLW YQIGWSWGV GOSIFHRPGV YTNISHHFEH IQKLMAQSGM SQPDPSWPLL 
FFPLLWALPL LGPV 



D NO! 137 DNA sequence 



ATGGGCGCGC GCGGGGOGCT GCTGCTGGCG CTGCTGCTGG CTCGGGCTGG ACTCAGGAAG 
CCGGAGTCGC AGGAGGCGGC GCCCTTATCA GGACCATGCG GOCGACGGGT CATCACGTOG 
CGCATCGTGG GTGGAGAGGA CGCCGAACTC GGGCGTTGGC CGlGGCaGGG GAGOCTGCGC 
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CTGTC3GGATT CCCACGTATG OGGAGTGftGC CTGCTCAGCC ACOSCTGGGC ACTCaCGGCG 
GCGCRCTGCT TTGAAftCTGA CCTTAGTGAT CCCTCXBGGI GGATGGTCCA GTTTGGCC3W3 
CTGACTTCCA TGCCATCCTT CTGGAGCCTG CAGGOCTACI ACACCCGTTA CTTCGTATCG 
AATATCTATC TGAQCCCTCG CTACCTGaOG AATTCaCCCT ATGACftlTGC CTTGGTGAAG 
CTGTCTGCAC CTGTCACCTA CACTAAACAC ATCCfiGCCCA TCTCTCTCCA GGCCTCCACA 
TTTGftGTTTG AtSAACCGGAC AGACTGCIGG GTGACIGGCr GGGGGTACAT CAAAGAGGAT 
GAGGCSVCTGC CATCTCCCCA CACCCTCCAG GAAGTTCAGG TCQCCATCAT AAACAACTCT 
ATGTGCAACC ACCTCTTCCT CAAGTAC3VGT TTCCGCAAGG ACaTCTTTGG AGACaTGGTT 
TGTGCTGGCA ATGCCCAAGG CGGGAAGGAT GCCTGCTTCG GTGACTCAGG TGGACCCTTG 
GCCTGTAACA AGAATGGACT GTGGTATCAG ATTGGAGTCG TGAGCTGGG6 AGTGGGCTGT 
GGTCGGCCCA ATOGGCCCGG TGTCTACACC AATATCAGCC ACCACTTTGA GTGGATCCfiG 
AAGCTGATGG CCCAGAGTGG CATGTCCCAG CCAGACCCCT CCTGGCCACT ACTCITTTTC 
CCTCTTCTCT GGGCTCTCCC ACTCCTGGGG CCGGTCTGA 



■g. PESQBAAPLS GPCGRRVITS RIVGGEDAEL GRWPWQGSLR 
LWDSHVCGVS LLSHRWALTA AHCFETDLSD PSGWMVQFGQ LTSMPSFWSL QAYYTRYFVS 
NIYLSPRYLG NSPYDIALVK LSAPVTYTKH IQPICXQAST FEFEHHTDCW VTGWGYIKED 
EALPSPHTLQ BVQVAIINNS MCNHLFLKYS FRKDIEGDMV CACSIAQGGKD ACFGDSGGPL 
ACKKNGLWYQ IGWSWGVGC GRPNEPGVYT NISHHFEWIQ KLMAQSGMSQ PDPSMPLLFF 



I I I I I 



TCGCCGCCGG GACCCGGGTG CCTGGGCTCX3 GCTTGRAGOS GCGGCGGCX3C ACCGGCACSG 
OXSCGGGAGC AIGGGCAGGA GGATGCGGGG CGCCGCCGCC ACCGCGGGGC ICTGGCTGCT 
GGCGCTGGGC TCXKTGCTGG CGCIGTGGGG AGGGCTCCTG CCGCCGCXK3A CCGAGCTGCC 
CGCCTCCCGG CCGCCOSAAG ACCGACTCCC ACGGCGCCCG GCCCGGflGCG GCGGCCCCGC 
GCCOGCGCCT CGCTTCCCTC TGCCCCCGCC CCTGGCX3TGG GACGCCOSCX: GCGGCTCCCT 
GAAAACTITC CGGGCGCTGC TCACCCTGGC GGCCGGCGCG GACGGCCCX3C CCCGGCAGIC 
CCGGAGCGAG CCCAGSIGGC ACGIGTCAGC CAGGCAGCCC CGGCCGGAGG AGAGCGCCGC 
GCTGCACGGG GGCGTCTTCT GGAGCCGCGG CCTGGBGGAG CftGCTGCCCC CGGGCTTTIC 
GGAGGCCCAG GOGGCGGCGT GGCTGGAGGC GGCTOGCGGC GCCCGGATGG IGGCCCTGGA 
GCGOGGGGGT TGOGGGCGCA GCTCCAACCG ACTGGCCCGT TTIGCCX»CG GCACCCGCGC 
CTGQGTGCGC TACX3GCATCA ACCCGGAGCA GATTCAGGGC GAGGCCCTGT CTTACTATCT 
GGCGCGCCTG CIGGGCCTCC AGCGCCACX3T GCCGCCGCTG GCaCTGGCTC GGGTGGAGGC 
TCGGGGCGCG CAGTGGGCGC AGGIGCAGGA GGAGCTGCGC GCIGCG»CT GGACCGAGGG 
CAGOGTGGTG AGCCTGACAC GCTGGCTGCC CAACCTCACG GACGTGGTGG TGCCCGCGCC 
CTGGCGCTCG GAGGACGGCC GTCTGCGCCC CCTCOGGGAT GCCGGGGGTG AGCTGGCCAA 
CCTCAGCCAG GCGGAGCTGG TGGACCTAGT ACAATGGACC GACTTAATCC TTTTCGACTA 
CCTGACGGCC AACTTCGACC GGCTCGTAAG CAACCTCTTC AGCCTGCAGT GGGACCOGCG 
CGTCATGCAG CGTGCCACCA GCAACCTGCA CCGCGGTCCG GGCGGGGCGC TGGTCTTTCT 
GQACAATGAG GCGGGCTTGG TGCACGGCTA CCGGGTAGCA GGCATGTGGG ACAAGTATAA 
CGAGCCGCTG TTGCAGTCAG TGTGCGTGTT CCGCGAGCGG ACCGCGCGGC GCGTCCTGGfl 
GCTGCACCGC GGACAGGACG CCGCGGCCCG GCTGCTGCGC CTCTACOGGC GCCACGAGCC 
TCGCITCCCC GAGCTGGCCG CCCTTGCAGA CCCCCACGCT CAGCTGCTAC AGCGCCGCCT 
Z GCCAAGCACA TTTTGCACTG TAAGGCCAAG TACGGCCGCC GGTCTGGGAC 



GGGCGGTCGC CTCTGCCACT GTCAGGGACC AGCCSGCCAA CGCCCACCCG CAAAGGTGTC 
TAAAAACTTC AGCTTTTCAC CCACCTGCCC CTTTCTTTCA ATCCCACGCT GTTTCCTTTC 
AAAGTTCTGG GAGGACGAAC TCACCGAGGC GAGAfiGTGTA ACATTCTCTC CACCCAGCIT 
ATAAAAGGAT TCTTTACTGT GCCAGCACGG GGATIGGATC CGAAGAAACT GGCTACTGGG 
GTTTGGCCCC CGAGTGGCCG TCCCTGTGGG AGATGCACCC CATTCTTGGG CCCCCCTCAT 
TCCCTTTCCG AAAAAGGAAA ACTTGCGTTT GAGCCGTTGA GCTAATTCTG CAATTTTCTA 
CCAAACAGAG CGCTGGTGGC CCCGCAGCAG GGCTGTGACA TTGGCTGGTG GAGOCCCTTC 
CTGIGTTCTC CCTTTGTTCC AGCXSCCGCGA TGGTGAGATC ACTGTTCCAA GCAGGGGGAC 



Seq ID NO: 140 Protei 



1 11 21 31 41 51 

I I I I I I 

MGRRMRGAAA TAGLWLLALG SLLALWGGLL PPRTBLPASR PPEDRLPKRP ARSGGPAPAP 
RFPLPPPLAW DAEGGSLKTF EALLTLAAGA DGPPHQSRSE PRWHVSAEQP RPEESAAVHG 
GVFWSRGLEE QVPPGFSEAQ AAAWLEAAEG ARMVftliERGG CBRSSNRLAR FADGTRACVR 
YGINPEQIQG EALSYYLARL LGLQRHVPPL ALARVEARGA QWAQVQEELR AAHWTEGSW 
SLTRWLPNLT DVWPAPWRS EDGRLRPLRD AGGELANLSQ AELVDLVQWT DLILFDYLTA 
NFDRLVSNLF SLQWDPRVMQ RATSNLHRGP GGALVFLDNE AGLVHGYRVA GMWDKYNEPL 
LQSVCVFRER TAERVLELHR GQDAAARLLR LYRRHEPHFP ELAALBDPHA QLLQRRLDFL 
AKHIIflCKAK YGRRSGT 

Seq ID NO! 141 DNA sequence 
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GTCCGCCftAG ATGCCTGCCC CAGTCCCTCI GCTCTCTGCA GCCCAGCCTT C»CX:CICTTG 
CaTGGGGAAA TCGTCTACAG ACTTCTGCCX: ATGGCTGTAT GTACGCAOSG ACAGAGCTAG 
CAATGACCCT GCTGGCAG6T ATGATAGGRA GCCTCTTCTG TCACAGGATT CXTTCCCAGR 
C3tflTGRCCCC AAATGCCTGT CCCTGCTCCC CCCACCCCCC AACSTCAAAA TAGCCGAGAA 
AAATGCCCTT CTCGGAGCCA AATGTGTGGT GATGCCTTAC AATCAGAAAT TCCTACAGTG 
GCCTGAGGCT TCCACCACTA AACGCAAAGC TGTAGATACC TATTGCTTGG ATTATAAGCC 
TTCCAAGGGA AGAAGGTGGG CTGCAAGAGC ACCAAGCACC AGAATCACAT ATGGGRCTAT 
CACCAAAGAG AGAGACTACT GCGCGGAAGA CCAGACTATC GAGftGCTGGA GAGAAGAAGG 
TTTCCCAGTG GGCTTGAAGC TTGCTGTGCT TGGTATTTTC ATCATTGTGG TGTTTGTCTA 
CCTGACTGTG GAAAATAAGT CGCTGTTTGG TTAAGTAATT TAGG 



n AQPSPSCMGK SSTDFCPWLY VRTDRASNDP AGRYDRKPLL SQDSFPDNDP 
KCLSLLPPPP NIKIAEKNAL LGAKCWMPY NQKFLQWPEA STTKRKAVDT YOiDYKPSKG 

HRWAAHAPST RITYGTITKE RDYCREDQTI ESWREEGPPV GLKLAVLGIF I 

ENKSLFG 



I I I I I I 

GATTCTACCA TCAGAAAAGA GGCCAAACTT CTATCATCAT GGTGGATGTG AAGTGTCTGA 
GTGACTGTAA ATTGCAGAAC CAACTTGAGA AGCTTGGATT TTCACCTGGC CCAATACTAC 
CTTCCACCAS AAAGTTGTAT GAAAAAAAGT TAGTACAGTT GTTGGTCTCA CCTCCCTGTG 
CACCACCTGT GATGAATGGA CCCAGAGAGC TGGATGGAGC GCAGGACAGT GATGACAGCG 
AAGGTGGGCT GCAAGAGCAC CAAGCACCAG AATCACATAT GGGftCTATCA CCAAAGAGAG 
AGACTACTGC GCGGAAGACC AGACIATOGA GRGCTGQAGA GAAGAAGGTT TCCCAGTGGG 
CTTGAAGCTT GCTGTGCTTG GTATTTTCAT CATTGTGGTG TTTGTCTACC TGACTGTGGA 
AAATAAGTCG CTGTTTGGTT AAGTAATTTA GGAGCAAAGC AATGCTCCaA GCGAGGCCTC 
CIGCTTCAGG AAAGAACCAA AACACTACCC TQAAGGGCCA GCCIAGCCTG CAGCCCTCCC 
TIGCAGGGAG CCTTCCCITG CAOTGTGCTG CTCTCACAGA TCGGTGTCTG GGCTCAGCCA 
GGTGGRAGGR ACCTGCCTAA CCAGGCAOCT GTGTTAAGAG CaTGATGGTT AGGAAATCCC 
CCAAGICATG TCAACTCICA TTAAAGGTGC TTCCATATTT GAGCAGGCGT CSAAC 



eeq ID NOi 145 DNA sequence 

Nucleic Acid Accession #: NM_002204.l 

Coding sequence; 74.. 3229 

1 11 21 31 41 51 

aggtgaacag gtcctcacgc ccagctcosc cccctcacgc gctctcgccg ggaccccgct 
;gcgccc cgcgccccac gcctgatgct 
; cggctgcgtc gtctccgcct tcaacctgga 



TCACATTATT GAGGACATGT GGCTTGGAGT GACTGTGGCC AGCCAGGGCC CTGCAGGCAG 480 

AGTTCTGGTC TGTGCCCACC GCTACACCCA GGTGCTGTGG 7CAGGGTCAG AAGACCAGCG 540 

GCGCATGGTG GGCAAGTGCT ACGTGCGAGG CAATGACCTA GAGCTGGACT CCAGTGATGA 600 

CTGGCAGACC TACCACAACG AGATGTGCAA TAGCAACACA GACTACCTGG AGACGGGCAT 660 

GTGCCAGCTG GGCACCAGCG GTGGCTTCAC CCAGAACACT GTGTACTTCG GCGCCCCCGG 720 

TGCCTACAAC TGGAAAGGAA ACAGCTACAT GATTCAGCGC AAGGAGTGGG ACTTATCTGA 780 

GTATAGTTAC AAGGACCCAG AGGACCAAGG AAACCTCTAT ATTGGGTACA CGATGCAGGT 840 

AGQCAGCTTC ATCCTGCACC CCAAAAACAT CACCATTGTG ACAGQTGCCC CACGGCACOG 900 

ACATATGGGC GOGGTGTTCT TGCTGAGCCA GGAGGCAGGC GGAGACCTGC GGAGGAGGCA 960 

GGTGCTGGAG GGCTCGCAGG TGGGCGCCTA TTTTGGCAGC GCAATTGCCC TGGOiGACCT 1020 

GAACAATGAT GGGTGOCAGG ACCTCCIGGT GGGOGCCCCC TACTACTTCG AGAGOAAAGA 1080 

GGAAGTAGGG GGTGCCRTCT ATGTCTTCAT GAACCAGGCG GGAACOTCCT TCCOTGCTCA 1140 

CCCCTCACTC CTTCTTCATG GCCCCAGTGG CTCTGCCTTT GGTTTATCTG TGGCCAGOiT 1200 

TGGTGACATC AflCCAGGATG GATTTCAGGA TATTGCTGTG GGAGCTCCGT TTGAAGQCTT 1260 

GG6CAAAGTG TACATCTATC ACAGTAGCTC TAAGGGGCTC CTTAGACAGC CCCAGCAGGT 1320 

AATCCATGGA GAGAAGCTGG GACTGCCTGG GTTGGCCACC TTCGGCTATT CCCTCAGTGG 1380 

GCftfiATGGRT GTGGATGAGA ACTTCTACCC AGACCTTCTA GTGGGAAGCC TGTCAGACCA 1440 

CATTGTGCTG CTGCGGGCCC GGCCAGTCAT C7kACa\T0GTC CACAAGACCT TGGTGCCCaWS 1500 

GCCAGCTGTG CTGGACCCTG CACTTTGCAC GGCCaCCTCT TGTGTGCAAG TGGRGCTGTG 1560 

CTTTGCTTAC AACCAGRGTG COSGGAACCC CAACTACAGG OSAAACATCA CCCTGGCCTA 1620 

CACTCTGGAG GCTGACAGGG ACCGCCGGCC GCCOCGGCTC CGCTTTGCCG GCAGTGAGTC 1680 

CGCTGTCTTC CAOGGCTTCT TCTCCAIGCC CGAGATGOSC TGCCAGAAGC TGGftGCTGCT 1740 

CCTGATGGAC AACCTCCGTG ACAAACICCG CCCCATCATC ATCTCCATGA ACTACTCTTT 1800 
ACCTTTGQGG ATGCCOSATC GCCCCOGGCT GGGGCTGOSG TCCCTGGACG C 
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CCTCftRCCAG GCACftGGCTC TGGftGAACCA CXCTGAGGTC CAGTTCCAG& AGGAGTGCGG 1920 

GCCTGACAAC AAGTGTGRGA GCAACTTGCA GftTGCGGGCa GCCTTCGTGT CAGAGCAGCA 1980 

GCAGRAGCTG AGCAGGCTCC AGTACAGCAG AGACGTCCGG AAATTGCTCC TGAGCATCAA 2040 

CGTGACGAAC ACCCQGACCT CGGAGCGCTC CGGGQAGGAC GCCCACGAGG CGCTGCTCAC 2100 

CCTGGTGGTG CCTCCCGCCC TGCTGCTGTC CTGAGTGCGC CCCCCCGGGG CCTGCCAAGC 2160 

TAATGAGACC ATCTTTTGCG AGCTGGGGAA CCCCTTCAAA CGGAACCAGA GGATGGAGCT 2220 

GCTCOTCGCC TTTGAGGTCA TCGGOQTGAC CCTGCftCACA AGOQACCTrC AGGTGCAGCT 2280 

GCAGCTCTCC ACGTCGAGTC ACCftGGACAA CCTGTGGCCC ATGATCCICA CTCrGCTGGI 2340 

GGACTATACA CTCCAGACOT CGCTTAGCAT GGTAAATCAC CGGCTACaiAA GCTTCTTTGG 2400 

GGGGRCAGTG ATGGGTGAGT CTGGCATGAA AACTGTQGAG GATGTAGGAA GCCCCCTCAA 2460 

GTATGAATTC CRGGTGGGCC CAATGGGGGA GGGGCTGGTG GGCCTGGG6A CCCTGGTCCI 2520 

AGGTCTGGAG TGGCCCTACG AAGTCAGCAA TGGCAAGTGG CTGCTGTATC CCACGOAGM 2580 

CACCGTCCAT QGCAATGGGT CCTGGCCCTG CCGACCACCT QOAGACCTrA TCAACCCTCT 2640 



GCTGGATCCA GC3GGGAGGCC AGGGCCCCCC ACCTGTCACT C 

CAAGTCTGAG ACTGTGCTGA CCTGTGCCAC AGGGCGTGCC CACTGTGTGT GGCTAGAGTG 
CCCCRTCCCT GATGCCCCCG TTGTCACCAA CGTGACTGTG A 
CACCTTCATC GAGGATTACR GAGACTTTGA CCGAGTCCGG G 

ATTCCTCCGA ACCAGCATCC CCACCATCAA CATGGAGAAC AAGACCACGT GGTTCTCTGT 
GGACATTGRC TCGGAGCTGG TGGAGGAGCT GCCGGCCGAA ATOSAGCTGT G 

T GGGGCTGATC ATCCTCCTGC T 

C CCTGTATGAA GCTAAGAGGC AGAAGGCGGA 318 0 

GATGAAGAGC CAGCCGTCAG AGACAGAGAG GCTGACCGAC GACTACTGAG GGGGCAGCCC 3240 

CCCX3CCCCCG GCCCACCTGG TGTGACTTCT TTAAGCGGAC CCGCTATTAT CAGATCATGC 33 0 0 

25 CCAAGTACCA CGCAGTGCGG ATCCGGGAGG AGGAGCGCTA CCCACCTCCA GGGAGCACCC 3360 

TGCCCACCAA GAAGCACTGG GTGACCAGCT GSCAGACTOS GGRCCARTAC TACTGACGTC 342 0 

CTCCCTGATC CCACCCCCTC CTCCCCCAGT GTCCCCTTTC TTCXTATTTA TCATAAGTTA 3480 

TGCCTCTGAC AGTCCACAGG GGCCACCACC TTTGGCTGST AGCAGOiGGC TCAGGCACAT 3540 
ACACCTCGTC AAGAGCATGC ACATGCTGTC TGGCCCTGGG GATCTTCCCA C 
30 AGCGCTGTGG ACCTTACAAC GCCGAGTGCA CTGC»TTCCT GTGCCCTAGA T 

GCCCACTGCT CGTGGACTGT GCTGGTGCAT CACGGATGGT GCATGGGCTC GCCX3TGTCTC 372 0 

AGCCTCTGCC AGCGCCAGOS CCRAAACAAG CCAAAGAGCC TCCCACCAGA GCCGGGAGGA 3780 

AAAGGCCCCT GCAATGTGGT GACACCTCCC CTTTCACACC TGGATCCATC TTGAGAGOCA 3840 

CAGICACTGG ATTGACTTTG CTGTCAAAAC TACTGACAGG GAGCAGCCCC CGGGCCGCTG 3900 

35 GCTGGTGGGC CCCCAATTGA CACCCATGCC AGAGAGGTGG GGATCCIGCC TAAGGTTGIC 3960 

TACGGGGGCA CTTGGBGGAC CTGGOGTGCT CAGACCCAAC AGCAAAGGAA CTAGAAAGAA 4020 

GGACCCAGAA GGCTTGCTTT CCTGCAICTC TGTGAAGCCT CTCTCCTTGG CCACXGACTG 4080 

AACTCGCAGG GAGTGCAGCA GGAAGGAACA AAGACAGGCA AACGGCAACG TAGCCTGGGC 4140 

TCACTGTGCT GGGGCaTGGC GGGATCCTCX: ACAGAGAGGA GGGGAQCAAT TCTOGACAGA 4200 

40 CAGATGTTGG GAGGATACAG AGGAGATGCC ACTTCTCACT CACCACTACC AGCCAGCCTC 4260 

CAGAAGGCCC CAGAGAGACC CTGCAAGACC ACGGAGGGAG CCX3ACACTTG AATGTAGTAA 4320 

TAGGC3«3GGG GCCKTGCCAC CCCATCCAGC CAGACCCC3«3 CTGAACXATG CGTCAGGGGC 4380 

CTAGAGGTGG AGTTCrTAGC TATCCTTGGC TTTCTGTGCC AGCCTGGCTC TGCCCCTCCX: 4440 

CCATGGGCTG TGTCCTAAGG CCCATTTGAG AAGCIGAGGC TAGTTCCAAA AACCTCTCXT 4500 

45 GACCCCTGCC TGTTGGCAGC CCACTCCCCA GCCCCAGCCX: CTTCCaTGGT ACTGTAGCAG 4560 

GGGAATTCCC TCCCCCTCCT TGTGCCTTCT TTGTATATAG GCTTCTCACC GCGACCAATA 4620 
AACAGCTCEC AGTTTGT 



50 Protein Accession #i 

55 



sequen 



I I I I I . 

MGPGPSRAPR APRLMIiCALA LMVAAGGCW SAFNLDTREL WKEAGNPGS L 
QTERQQRYLL LAGAPRELAV PDGYTNE 

DMWLGVTVAS QGPAGRVLVC AHRYTQVLWS GSEDaRRMVG KCYVRGNDLE LDSSDDWQTY 
HNEMCNSNTD YLETGMCQLG TSGGFTQNTV YFGAPGAYNVJ KGNSYMIQRK EWDLSEYSYK 
DPEDQGNLYI GYTMQVGSFI LHPKNITIVT GAPRHRHMGA VFLLSQEAGG DLRRRQVLEG 
SQVGAYFGSA lALADLNNDG WQDLLVGAPY YFERKEEVGG AIYVFra^QAG TSFPAHPSLL 
LHGPSGSAFG LSVASIGDIN QDGFQDIAVG A 
50 KLGLPGLATF GYSLSGQMDV DENFYPDLLV G 
DPALCTATSC VQVELCFAYN QSAGNPNYRR N 
GFESMPEMRC QKLELLLMDN LRDKLRPIII SMNYSLPLRM PDRPRLGLRS LDAYPILKQA 
QALENHTEVQ PQKECGPDNK CESNLQMRAA FVSEQQQKLS RLQYSEDVRK LLLSINVTNT 
. RTSERSGEDA HEALLTLWP PALLLSSVRP PGACQANETI FCELGKPFKR NQRMELLIAF 

65 EVIGVTLHTR DLQVQLQLST SSHQDNLWPM ILTLLVDYTL QTSLSMVNHR LQSFFGG'R'M 
GESGMKTVED VGSPLKYEFQ VGPMGEGLVG LGTLVLGLEW PYBVSKGKWL LYPTEITVHG 
NGSWPCKPPG DLINPLNLTL SDPGDRPSSP QRRRRQLDPG GGQGPPPVTL AAAKKAKSET 
VLTCATGRAH CVWLECPIPD APWTNVTVK ARVWHSTEIE DYRDFDEVRV NGWATLFLRT 
SIPTIKMENK TTWFSVDIDS ELVEELPAEl ELWLVLVAVG AGLLLLGLII LLLWKCGFFK : 
70 RARTRALYEA KRQKAEMKSQ PSETERLTDD Y 

Seq ID NO: 147 DNA sequence 
Nucleic Acid Accession (fs NM_005S01.1 
_ Coding sequence: 74.. 3274 
75 1 11 21 31 41 51 

I I I I I I 

AGGTGAACAG GTCCTCACGC CCAGCTCCGC CCXXTTCACGC GCTCrCX3CCX3 GGACCCXXKT 
TCCGCTGGCA GCCATQGGCC CCGGCXKCAG CCGCGOSCCC CX30GCCCCAC GCCTGATGCT 
_ _ CTGTGCGCTC GCCTTGATGG TGGCGGCXX3G CGGCTGCGTC GTCTCCGCCT TCAACCTGGA 
80 TACCCGATTC CTGGTAGTGA AGGAGGCCQG GAACCCGGGC AGCCTCTTCXS GCTACTOGGT 
CGCCCTCCAT CGGCAGACAG AGCGGCAGCA GCGCTACCTG CTCCTGaCTG GTGCCCXXXX3 
GGSGCTCGCT GTGCCCGATG GCTACACCAA CCXSQACTGGT GCTGTGTACC TGTGCCCACT 
CACTGCCCAC AAGGATGACT GTGAGCGGAT GAACATCACA GTGAAAAATG ACCCTGGCCA 
TCACATTATT GAGQACATGT GQCTTGGAGT GACTGTGGCC AGCCAGQGCC CTGCAGGCAG 
C GCTACACCCA GGTGCTGTGG TCAQGGTCaG AAGACCAGOG 
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GCGCATGGTG QGCAAGTGCT ACGTGOSAGG CAATGACCTA GAGCTGGACT CCAGTGftlGA 
CTGGCAGACC TACCRCAACX3 AGATGTGCftA TAGCAACACA GRCTACCTGG AGACX3GGCAT 
3 GGCACCAGCG GTGGCTTCAC CCAGAACRCT GTGTACTTCG GCGCCCCCGG 
K ACAGCTACAT GATTCAGCGC AAGGAGTGGG A 
^CCAAGG AAACCTCTAT ATTGGGTACft C 

AGGCAGCTTC ATCCTGCACC CCAAAAACAT CACXaTTGTG ACAGGTGCCC CACGGCACCG 900 

C GCGGTGTTCT TGCTGAGCCA GGAGGCAGGC GGAGACCTGC GGAGGAOGCA 960 

3CGCCTA TTTTGGCftGC GCAATTGCCX: TGGCAGACCT 1020 

3 ACCTCCTGGT GGGCGCCCCC TACTACTTCG AGAGGAAAGA 1080 

GGAAGTAGGG GGTGCCATCT ATGTCTTCAT GAACCAGGCG GGAACCTCCT TCCCTGCTCA 1140 

CCCCTCflCTC CTTCTTCATG GCCCCAGTGG CTCIGCCTTT GGTTTATCTG TGGCCflGCAT 12 00 

TGGTGACATC AACCAGGRTG GATTTCAGGA TATTGCTGTG GGAGCTCCGT TTGAAGGCTT 12 60 

GGGCAAAGTG TACATCTATC ACAGTAGCTC TAAGGGGCTC CTTAGACAGC CCCAGCAGGT 1320 

AATCCATGGA GAGAAGCTGG GACTGCCTGG GTTGGCCACC TTCGGCTATT CCCTCAGTGG 1380 

GCAGATGGAT GTGGATGAGA ACTTCTACCC AGACCTTCTA GTGGGAAGCX; TGTCA6RCCA 1440 

CATTGTGCTG CTGCGGGCCC GGCCAGTCAT CAACATCGTC CACAAGACCT TGGTGCCCAG 15 00 

GCCAGCTGTG CTGGACCCTG CACTTTGCAC GGCCACCTCT TGTGTGCAAG TGGAGCTGTG 1560 

CTTTGCTTAC AACCAGAGTG CCGGGAACCC CAACTACAGG CGAAACATCA CCCTGGCCTA 1620 

CACTCTGGAG GCTGACAGGG ACCGCCQGCC GCCCCGGCTC CGCTTTGCCG GCAGTGAGTC 1680 

CGCTGTCTTC CACGGCTTCT TCTCCATGCC CGAGATGCGC TGCCAGAAGC TG3AGCTGCT 1740 

CCTGATGGAC AACCTCCGTG ACAAACTCCG CCCCATCATC ATCTCCATSA ACTACTCTTT 1800 
G ATGCCCGATC GCCCCCGGCT GGQGCTGCGG TCCCTGGACG CCTACCCGAT 



GCCTGACAAC AAGTGTGAGA GCAACTTGCA GATGCGGGCA G 
GCAGAAGCTG AGCAGGCTCC AGTACAGCAG A_ 
CGTGACGAAC ACCCGGACCT CGGAGCGCTC CGGGGAGGAC GCC 
CCTGGTGGTG CCTCCCGCCC TGCTGCTGTC CTCAGTGCGC CCCCCCGGGG C' 

TAATGAGACC ATCTTTTGCG AGCTQGGGAA CCCCTTCAAA CGGAACCAGA GGATGGAGCT 2220 

GCTCATCGCC TTTGAGGTCA TCGGGGTGAC CCTGCACACA AGGGRCCTTC AGGTGCAGCT 2280 

GCAGCTCTCC AOSTCGAGTC ACCA(3GACAR CCTGTGGCCC ATGATCCTCA CTCTGCTGGT 2340 

GGACTATACA CTCCXGACCT CGCTTAGCAT GQTAAATCAC CGGCTACAAA GCTTCTITGG 2400 

GGGGACAGTG ATGGGTGftGT CTGGCATGAA AACTGTGGAG GATGTAGGAA GCCCCCICaA 2460 

GTATGAATTC CAGGTGGGCC CA&TGGGGGA GGQGCTGGTG GGCCTGGGGA CCCTGGrCCT 2520 

AGGTCTGGAG TGGCXICTACG AAGTCaGCAR TGGCAAGTGG CTGCTGTATC CCACGGRGAT 2580 

CACXX3TCCAT GGCAATGGGT CCTGGCXX:TG CXX3ACCACOT GGAGACXnTA TCaACCXTCT 2640 

CSUVCCTCACT CTTTCTGftCC CTGGGGACAG GCCATCATCC CCACAGCGCA GGCGCOaCA 2700 

GCTGGRTCCA GGGGGAGGCC ftGGGCCCCCX: ACCTGTCACT CTGGCTGCTG CCKAAAAAGC 2760 

CAAGTOTGAG ACTGTGCTGA COTGTGCCAC AGGGCX3TGCC CACTGTGTGT GGCTAGAGTG 2820 

CCCCATCCCT GATGCCCXXX3 TTGTCACCM CGTGACTGTG AAGGCACGRG TGTGGAACAG 2880 

CACCTTCATC GAGGATTACA GAGACTTTGA CCGAGTCCGG GTAAATGGCT GGGCTACCCT 2940 

ATTCCrCXX3A ACCAGCATCC CXACCATCAA CAIGGAGAAC AAGACCACGT GGTTCTCTGT 3000 

GGACATTGAC TCSGAGCTGG TGGAGGAGCT GCCX3GCCGAA ATCGAGCTGT GGCTGSIGCT 30E0 

GGTGGCCGTG GGTGCAGGGC TGCTGCTGCT GGGGCTGATC ATCCTCCTGC TGTGGAAGTG 3120 

TGACTTCTTT AAGCGGACCC GCTATTATCA GATCATGCCC AAGTACCAOG CAGTGOXSAT 3180 

CCGGGAGGAG GAGOSCTACC CACCTCCAGG GAGCACCCTG CCCACCAAGA AGCACTGGGT 3240 

GACCAGCTGG CAGACTCGGG ACCAATACTA CTGAOGTCCT CCCTGATCCC ACCCCCTCCT 3300 

CCCCCAGTGT CCCCTTTCTT CCTATTTATC ATAAGTTATG CCTCIGACAG TCCACAQGGG 3360 

CCACCACCTT TGGCTGGTAG CAGCAGGCTC AGGCACATAC ACCTCGTCAA GAGCATGCAC 3420 

ATGCTGTCTG GCCCTGGGGA TCTTCCCACA GGAGGGCCAG CGCTGTGGAC CTTACAACGC 3480 

CGAGTGCACT GCATTCCTGT GCCCTAGATG CACX3TGGGGC CCACTGCTCG TGGACTGTGC 3540 



C AAAGAGCCTC CCACCAGAGC CGGGAGGAAA AGGCCCCTGC A 

CACCTCCCCT TTCACACCTG GATCCATCTT GAGAGCCACA GTCACTGGAT TGACTTTGCT 3720 

GTCAAAACTA CTGACAGGGA GCAGCCCCCG GGCCGCTGGC TGGTGGGCCC CCAATTGACA 3780 

CCCATGCCAG AGAGGTGGGG ATCCTGCCTA AGGTTGTCTA CGGGGGCACT TGGAGGACCT 3840 

GGCGTGCTCA GACCCAACAG CAAAGGAACT AGAAAGAAGG ACCCAGAAGG CTTGCTTTCC 3900 



A GACAGGCAAA CGGCAACGTA GCCTGGGCTC ACTGIGCTGG © 

C AGAGAGGAGG GGACCAATTC TGGACAGACA GATGTTGGGA GGATACAGAG 40B0 

C TTCTCACTCA CCACTACCAG CCAGCCTCCA GAAGGCCCCA GAGAGACCCT 4140 

C GGAGGGAGCC GACACTTGAA TGTAGTAATA GGCAGGGGGC CCTGCCACCC 4200 

A GACCCCAGCT GAACCATGCG TCAGGGGCCT AGAGGTGGAG TTCTTAGCTA 4260 

TCCTTGGCTT TCTGTGCCAG CCTGGCTCTG CCCCTCCCCC ATGGGCTGTG TCCTAAGGCC 4320 

CATTTGAGAA GCTGAGGCTA GTTCCSiAAAA CCTCTCCTGA CCCCTGCCTG TTGGCAGCCC 4380 

ACTCCCCAGC CCCAGCCCCT TCCATGQTAC TGTAGCAGOG GAATTCCCTC CCCCTCCTTG 4440 
TGCCTTCTTT GTATATAGGC TTCTCACCGC GRCCARTAAR CAGCTCCCAG TTTGT 

Seq ID NO: 148 Prot 
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MGPGPSRAPR APRLMLCALA LMVAAGGCW SAFNLDTEFL WKEAGMPGS LFGYSVAUJR 
QTERQQRYLL LAGAPREIAV FDGYTHRTGA VYLCPLTAHK DDCEKHNITV KNDPGHHIIE 
DMWLGVTVAS QGPAGRVLVC AHRYTQVLWS GSEDQRRMVG KCYVRGNDLE LDSSDDWQiY 
HNEMCHStmJ YLETGMCQLG TSGGFTQNTV YFGAPGAYHW KGNSYMIQEK EWDLSEYSYK 
DPEDQGHLYI GYTMQVGSFI LHPKNITIVT GAPRHIiHHGA VFLLSQEAGG DLRRRQVLEG 
SQVGAYFGSA lALADtNHDG WQDLLVGAPY YFERKEEVGG AIYVFMNQAG TSFPflHPSLL 
LHGPSGSAFG LSVASIGDIN QDGFQDIAVG APFEGLGKVY lYHSSSKGIiL RQPQQVIHGE 
KLGLPGLATF GYSLSGQMDV DENFYPDLLV GSLSDHIVLL RARPVINIVH KTLVPEPAVL 
DPALCTATSC VQVELCFAYN QSAGNPNYRR NITLAYTLEA DRDERPPRLR EAGSESAVEH 
GFFSMPEMRC QKLELLLMDN LRDKLRPIII SMNYSLPLRM PDRPRLGLRS LDAYPIIiNCJR 
QALENHTEVQ FQKECGPDNK CESNLQMRAA FVSEQQQKLS RLQYSHDVEK LLLSINVTNT 
RTSERSGEDA HEALLTLWP PALLLSSVRP PGACQANETI FCELGNPFKR NQEMELLIAE 
EVIGVTLHTR DLQVQLQLST SSHQDNLWPM ILTLLVDYTL QTSLSMUNHR LQSFFGGTVM 
GESGMKTVED VGSPLKYEFQ VGPMGEGLVG LGTLVLGLEW PYEVSNGKWL LYPTEITVHG 
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NGSWECRPPG DLINPUJLTL SDPGDRPSSP QRRRRQLDPG QGQGPPPVTL AAAKKAKSET 
VLTCaTGRBH CVWLECPIPD APWTNVTVK REVWNSTFIB DYKDFDRVEV NGWATLFLRT 
SIPTINMENK TTWFSVDIDS ELVEELPftEI ELWLVLVAVG flGLLLLGLII LLLWKCSJFFK 
RTRYYQIMPK TOAVRIREEE RYPPPGSTLP TKKHWVTSWQ TRDQYY 



CGGGCCAGGT TTCCRGGCTC GGCCGCCGCC TCCAICCCftG CRCCTGCGGA GGGAGCX3CIG 60 

ACCMGGCTC OTrGGCCTGA ATTGGGAGAT GCCCAGCTCA ACCXKGATAA GTACCTCX3RA 120 

GGGGOCGCAG GTCAGCMCC CACTGCCCCT GATAAAAGCA AAGRGACCAA OUUiACAGAT 180 

AACACTGRGG CACCTGTAAC CAflGATTGAA CTTCIGCCGT CCIACTCCAC GGCTACACIG 240 

ATAGATGAGC CXSVCTGAGGT GGATGACCCC TGGAACXTTAC CCACTCTTCA GGACTCGGGG 300 

ATCAAGTGGT CAGAGAGAGA CACXMiAAGGG AAGATTCTCT OrTTCTTCCA AGGGATTGGG 360 

AGATTGATTT TACTTCTCGG ATTTCTCTAC TTTTTCGTGT GCTCCCTGGA TATTCTTAGT 420 

AGOSCCTTCC AGCTGGTTGG AGGAAAAATG GCAGGACAGT TCTTCAGCAA CAGCTCTATT 480 

ATGTCCftACC CrTTGTTGGG GCTGGTGATC GGGGTGCTGG TGACCGTCTT GGTGCAGAGC 540 

TCCAGCACCT CAACGTCCAT CGTTGTCAGC ATGCTGTCCT C^CATTGCT CACTGTTCGG 600 

GCTGCCATCC CCXTTATCAT GGGGGCCAAC ATTGGAAOST CMTCACCAA CACTATTQTT 660 

GOGCrCATGC AGGTGGGAGA TOGGAGTGAG TTCAGAAGAG CTTTTGCAGG AGCCaCTGTC 720 

CATGACTTCT TCAACTGGCT GTCCXSTGTTG GTGCTCTTGC aXSTGOAaaT GGCCACCCAT 780 

TACCTCGAGA TCATAACCCA GCTTATAGTG GAGAGCTTCC ACTTCAAGAA TGGAGAAGAT 840 

GCCCCAGATC TTCTGflAAGT CATCACTAAG CCCTTCACaA AGCTCATTGT CCAGCTGGAT 900 

AAAAAAGTTA TCAGCCAAAT TGCAATGAAC (3ATGRAAAAG CGAAAAACAA GAGTCTTCTC 960 

AAGRTTTGGT GCAAAACTTT TACCAACAAG ACCCftGATTA AOJrCftCTGT TCCCTCGACT 1020 

GCTAACTGCA CCTCCCCTTC CCTCTGTTGG ACGGATGGCA TCCAAAACTG GACCATGAAG 1080 

AATGTGACCT ACAAQGAGAA CATCGCCAAA TGCCRGCATA TCTTTGTGAA TTTCCACCTC 1140 

CCGGRTCTTG CIGTGGGCAC CATCTTGCTC ATACTCTCCC TGCTGGTCCT CTGTGGTTGC 1200 

CTGRTCATGA TTGTCAAGAT CCTGGGCTCT GTGCTCAAGG GGCAGGTCGC CACTGTCATC 12 60 

T CCCTTTGCAT GSTTGACrGG CTACXTTGOCC 1320 



40 
45 
50 




TACTGCTGCC GCGTGTGCTG CaSCGCGTGC TGCTTGCTGT GTGGCTGCCC CAAGTGCTGC 1980 

CGCTGCAGCA AGTGCIGCGA G6ACTTGGAG GAGGCGCAGG AGGGGCAGGA TGTCCCTGTC 2040 

AAGGCTCCTG AGACCTTTGA TAACATAACC ATTAGCAGAG AGGCTCAGGG TGAGGTCCCT 2100 

GCCTCGGACT CAAAGACCGA ATGCACGGCC TTGTAGGGGA CGCCCCAGAT TGTCAGGGAT 2160 
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MAPWPELGDA QPNPDKYLEG AAGQQPTAPD KSKETNKTDN TEAPVTKIEL LPSYSTATLI 
DEPTEVDDPW NLPTLQDSGI KWSERDTKGK ILCFFQGIGR LILLLGFLYF FVCSLDILSS 
AFQLVGGKMA GQFFSNSSIM SNPLLGLVIG VLVTVLVQSS STSTSIWSM VSSSLLTVRA 
AIPIIMGBNI GTSITNTIVA LMQVGDRSEF RRAEAGATVH DEFNWLSVLV LLPVEVATHY 
LEIITQLIVE SFHFKNGEDA PDLLKVITKP FTKLIVQLDK KVISQIAHND EKAKNKSLVK 
IWCaCTFTNKT QINVTVPSTA NCTSPSLCWT DGIQNWTMKH VTYKENIAKC QHIFVNFHLP 
DLAVGTILLI LSLLVLCXSO. IMIVKILGSV LKGQVATVIK KTINTDFPFP FAWLTGYLAI 
LVGAGMTFIV QSSSVFTSAL TPLIGIGVIT lERAYPLTLG SNIGTTTTAI lAALASPGNA 
LRSSLQIALC HFFFNISGIL LWYPIPFTSL PIRMAKGLGH ISAKYRWFAV FYLIIFFFLI 
PLTVFGLSLA GWRVLVGVGV PWFIIILVL CLRLLQSRCP RVLPKKLQNM NFLPLWMRSL 
KPWDAWSKF TGCFQMRCCY CCRVCCRACC LLCGCPKCCR CSKCCEDLEE AQEGQDVPVK 
APETFDNITI SREAQGEVPA S 



ATGAACCGCA GCCACCGGCA CGGGGCGGGC AGCGGCTGCC TQGGCACTAT OGAGGrGAAG 
AGCAAGTTTG GAGCTGAATT TCGTCGGTTT TCGCTGGAAA GATCAAAACC TGGAAAATTT 
GAGGAGTTTT ATGGATTACT ACAACATGTT CATAAGATCC CCAATGTTGA CGTTTTGGTA 
GGCTATGCAG ACATCCATGG AGACTTACTA CCTATAAATA ATGATGATAA TTATCACAAA 
GCTGTTTCAA CGGCCAATCC ACTGCTTAGG ATATTTATAC AAAAGAAGGfl AGAAGCAGAC 
TACAGTGCCT TTGGTACAGA CACGCTAATA AAGAAGAAGA ATGTTTTAAC CAAOGTATTG 
CGTCCTGACA ACCATAGAAA AAAGCCACAT ATAGTCATTA GTATGCCCCA AGACTTIAGA 
CCTGTGTCTT CTATTATAGA CGTGGATATT CTCCCAGAAA GGCATCGTAG GGTACGTCTT 
TACAAATACG GCACGGAGAA ACCCCTAGGA TTCTACATCC GGGATGGCTC CAGTGTCAGG 
GTAACACCAC ATGGCTTAGA AAAGGTTCCA GGGATCTTTA TATCCAGGCT TGTCCCAGGA 
GGTCTGGCTC AAAGTACAGG ACTATTAGCT GTTAATGATG AAGTTTTAGA AGITAATGGC 
ATAGAAGTTT CAGQGAAGAG CCTTGATCAA GTAACAGACA TGATGATTGC AAATAGCCGT 
AACCTCATCA TAACAGTGAG ACCGGCAAAC CAGAGGAATA ATGTTGTGAG GAACASICGG 
ACTTCTGGCA GTTCCGGTCA GICTACTGAT AACAGCCTTC TrGGCTAOCC ACAGCAGATT 
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GAACCAAGCT TTGRGCCRGA GGATGAAGAC AGCGAftGAAG ATGACATTAT CRTTGAAGAC 900 

AATGGAGTGC CAC3SGCAGAT TCCAAAAGCT GTTCCTAATA CTGAGAGCCT QGAGTCATTA 960 

ACACAGATAG AGCTAAGCTT TGAGTCTGGA CAGAATGGCT TTATTCCCTC TAATGAAGTG 1020 

AGCTTAGCAG CCATAGCAAG CAGCTCAAAC ACGGAATTTG AAACACATGC TCCAGATCAA 1080 
AAACTCITAG AAGAAGATGG AACAATCATA ACATTATGA 



1 11 21 31 41 51 

I I I ■ I I I 

MNESHRHGAG SGCLGTMEVK SKFGSEFRRF SLEESKEGKF EEFYGLLQHV HKIPNVDVLV 
GXADIHGDLL FINNDDNYHK AVSTANFLLR IFIQKKEEAD YSAFGTDTLI KKKNVLTHVIi 
RPDMHRKKPH IVISMPQDFE FVSSIIDVDI LPETHRRVRL YKYGTEKPLG FYIHDGSSVR 
VTPHGLEKVP GIFISRLVPG GLAQSTGLLA VNDEVLBVHG lEVSGKSLDQ VTDMMIflNSR 
NLIITVRPflN QRNNWRNSR TSGSSGQSTD NSLLGYFQQI EFSFEFEDED SEEDDIIIED 
NGVPQQIPKA VPNTESLESL TQIELSFESG QNGFIPSNEV SLAAIASSSN TEFETHAPDQ 



ACCCAACAAG GAGGRAGCCT GGGRAGTGCC C 
. TGGCCAATGT TTGATGCTTA ACCCCCOCAA TTTCTGTGAG ATGGATGGCC AGTGCAAGCG 

30 TGACTTGAAG TGTTGCATGG GCATGTGTGG GAAATCCTGC GTTTCCCCTG TGAAAGCTTG 
ATTCCTGCCai TATGGAGGAG GCTCTGGAGT CCTGCICTGT GTGGTCCAGG TCOTTTCCAC 
OCTGAGACTT QOCTCCACCA CTGATATCCT CCTTTGGGGA AAGGCTTGGC ACACAGCAGG 
CTTTCAAGAA GTGCCAGTTG ATCAATGAAT AAATAAACGA GCCTATTTCT CTTTGCAC 



D NO: 154 F 
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AATTCGGCAC GAGGGCAGGT GCAGGCGCAC GCGGCX3AGAG CGIATGGAGC CGAGCCXSTTA 60 

GCGCGCGCCG TCGGTGAGTC AGTCCGTCOG TCCCTCCXSTC CX3ICX3GGGOG CCGCAGCTCX: 120 

50 CGCCAGGCCC AGCGGCCCCG GCCCCTCGTC TCCCCGCACC CGGAGCCACC CGGTGGAGQG 180 

GGCCTTGCCG CGGCAGCCAT GTCCATGGGC CTGGAGATCA CGGGCACCXSC GCTGGCXXSTG 240 

CTGGGCTGGC TGGGCACCAT CGTGTGCTGC GC3GTTGCCCA TGTGGCGOGT GTCGGCCTTC 300 

ATCGGCAGCA ACATCATCAC GTCGCAGAAC ATCTGGGAGG GOCTGTGGAT GAACTGCGTG 360 

GTGCAGAGCA CCGGCCAGAT GCAGTGCAAG GTGTAOGACT OGCTGCTGGC ACTGCCACAG 420 

55 GACCTTCAGG CGGCCCGCGC CCTCATCGTG GTQGCCATCC TGCTGGCCGC CTTCGGGCTG 480 

CTAGTGGCGC TGGTGGGCGC CCaGTGCACC AACTGCGTGC AGGACGACAC GGCCAAGGCC 540 

AAGATCACCA TCGTGGCAGG CGTGCTGTTC CTTOTCGCCG CCCTGCTCAC CCTOSTGCCG 600 

GTGTCCTGGT CGGCCAACAC CATTATCCGG GACTTCTACA ACCCCGTGGT GCCCGAGGCG 660 

. CAGAAGCGCG AGATGGGCGC GGGCCTGTAC GTGGGCTGGG CGGCOGOGGC GCTGCAGCTG 72 0 

60 CTGGGGGGCG CGCTGCTCTG CTGCTCGTGT CCCCCACGCG AGAAGAAGTA CACGGCCACC 78 0 

AAGGTCGTCT ACTCCGCGCC GCGCTCCACC GGCCCGGGRG CCAGCCTGGG CACAGGCTAC 840 

GACCGCAAGG ACTACGTCTA AGGGACAGAC GCAGGGAGAC CCCACCACCA CCACCACCAC 90 0 

: ACCACCACCG CGAGCTGGAG CGCGCACCAG GCCATCCAGC GTGCAGCCTT 960 

3 CCAGCCCACC CCCAGAAGCC AGGAAGCCCC CGCGCTGGAC TGGGGCAGCT 1020 

GCATGGACTG TGAAACCTCA CCCTTCTGGA GCACGGGGCC TGGGTGACCG CCAATACTTG 1140 

1200 



GCCCTGGAAG GGGCACTTGA TATTTTTCAA 



MSMGLEITGT ALAVU3HLGT IVCCALPMWR VSAFIGSNII TSQHIWEGLW MNCWQSIGQ 
75 MQCKVYDSLL ALFQDLQAAR ALIWAILLA AFSLLVALVG AQCTNCVQDD TAKAKITIVA 
GVLFLLAALL TLVPVSWSAN TIIRDFYHFV VFBAQKREMG AGLYVGWAAA ALQLLGGALL 
CCSCPPREKK YTATKWYSA PRSTGPGASL GTGYDRKDYV 



80 
85 



7 DNA sequence 
Accession «i NM 
ce: 1..597 
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AACTTCCAGG ACAACCRATT CCACSGGGAAG TGGTATGIGG TAGGCCTC3GC A 

ATTCTCAGAG AAGACftAAGA CCCGCAAAA6 ATGIATGCTA CCATCTATGR GCTGAAAGAA 
GACAAGAGCT ACAATGTCftC CTCCGTCOTG TTTAGGAAAA AGAAGTGTGA CTACTGGATC 
AGGACTTTTG TTCCAGGTTG CCMCCOGGC CiAGTTCAaSC TGCGCRACAT TAAGAGTTAC 
CCTGGATTAA CXSAGTTACCT CGTCCGAOTG GTGAGCACCA ACTACAACCA GCATGCTATG 
GT6TTCTTCA AGAAAGTTTC TCAAAACAGG GAGTACTTCA AGATCACCCT CTACX3GGAGA 
ACC3VAGGAGC TGACTTCGGA ACTAAAGGAG AACTTCATCX: GCTTCTCCAA ATATCTGGGC 
CTCCCTGAAA ACCACRTCGT CTTCCCTGTC CCAATCGACC AGTGTATCGA CQGCTGA 

Seq ID ^XD: 158 Protein sequence 



MPLGLLWLGL ALLGALHAQA QDSTSDLIPA PPLSKVPLQQ NEQDNQEQGK WYVVGLAGNA 
ILREDKDPQK MYATIYELKE DKSYNVTSVL FEKKKCDYWI RTFVPGCQPG EFTLGUIKSY 
PGLTSYLVRV VSTNYNQHAM VFFKKVSQNR EYFKITLYGR TKELTSELKE NFIRFSKYLG 
LPENHIVFPV PIDQCIDG 



AGGAATCTGC GCTCGGGTTC CGCAGATGCA GAGGTTGAQO TGGCTGCGQG ACTGGAAGTC 
3 GGGGAGAGAC 



AGCCCACTGC CTCAAGCCCC GCTACATAGT TCACCTGGGG CAGCACAACC TCCAGAAGGA 360 

GGAGGGCTGT GAGCAGACCC GGACAGCCAC TGAGTCCTTC CCCCACXCCG GCTTCAACAA 42 0 

CAGCCTCCCC AACAAAGACC ACCGCAATGA CATCATGCTG GTGAAGATGG CATCGCCA.GT 480 

CTCCATCACC TGGGCTGTGC GACCCCTCAC CCTCTCCTCA OGCTGTGTCA CTGCTGGCAC 540 

CAGCTGCCIC ATTTCCGGCT GGQGCAGCAC GTCCAGCCCC CAGTTACGCC TGCCTCACfiC 600 

CTTGCGATGC GCCAACATCA CCATCATTGA GCACCAGAAG TGTGAGAACG CCTACCCCGG 660 

CAACATCACA GACACCATGG TGTGTGCCAG CGTGCAGGAA GGGGGCAAGG ACTCCTGCCA 720 

GGGTGACTCC GGGGGCCCTC TGGTCTGTAA CCAGTCTCTT CAAGGCATTA TCTCCTGGGG 780 

CCAGGATCCG TGTGCGATCA CCCGAAAGCC TGGTGTCTAC AOSAAAGTCT GCAAATATGT B40 

GGACTGGATC CAGGAGACGA TGAAGAACAA TTAGACTGGR CCXACOaiCC ACAGCCCATC 900 

ACCCTCCATT TCCACTTGGT GTTTGGTTCC TGTTCACrCT GTTAATAAGA AACCXTAAGC 960 

CAAGACCCrC TACGAAC31TT CTTTGGGCOT CCTGGACTAC AGGAGMGOT GTCACTTAAT 1020 

AATCAACCTG GGGTTCGAAR TCAGTGAGAC CTGGATTCAA ATTCTGCCTT GAAATATTGT 1080 

GACTCTGGGA ATGACAACAC CTGGTTTGTT CTCTGTTGTA TCXCCAGCCC CAAAGACAGC 1140 



SITKAVRPLT LSSHCVTAGT SCLISGWGST SSPQLRLPHT LRCANITIIE HQKCENAYPG 180 
NITDIMVCAS VQEGGKDSCQ GDSGGPLVOJ QSLQGIISWG QDPCAITEKP GVYTKVCKYV 240 
DWIQETMKNN 

It is understood that the examples described above in no way serve to limit the true 
scope of this invention, but rather are presented for illustrative purposes. All publications, 
sequences of accession numbers, and patent applications cited in this specification are herein, 
incorporated by reference as if each individual publication or patent application were 
specifically and individually indicated to be incorporated by reference. 
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WHAT IS CLAIMED IS: 

1 1 . A method of detecting an ovarian cancer-associated transcript in a cell 

2 fix)m a patient, the method comprising contacting a biological sample Sxtm the patient with a 

3 polynucleotide that selectively hybridizes to a sequence at least 80% identical to a sequence 

4 as shown in Tables 1-26. 

1 2. The method of claim 1 , wherein the biological sample comprises 

2 isolated nucleic acids. 

1 3. The method of claim 2, wherein the nucleic acids are mRNA. 

1 4. The method of claim 2, further comprising the step of ampUfying 

2 nucleic acids before the step of contacting the biological sample with the polynucleotide. 

1 5 . The method of claim 1 , wherein the polynucleotide comprises a 

2 sequence as shown in Tables 1-26. 

1 6. The method of claim 1 , wherein the polynucleotide is immobilized on 

2 a solid surface. 

1 7. The method of claim 1 , wherein the patient is undergoing a therapeutic 

2 regimen to treat ovarian cancer. 

1 8 . The method of claim 1 , wherein the patient is suspected of having 

2 ovarian cancer. 

1 9. An isolated nucleic acid molecule consisting of a polynucleotide 

2 sequence as shown in Tables 1-26. 

1 10. The nucleic acid molecule ofclaim 9, which is labeled. 

1 • 11. An expression vector comprising the nucleic acid of claim 9. 

1 1 2. A host cell comprising the expression vector of claim 1 1 . 

1 13 . An isolated polypeptide which is encoded by a nucleic acid molecule 

2 having polynucleotide sequence as shown in Tables 1-26. 
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1 14. An antibody that specifically binds a polypeptide of claim 13. 

1 ' 15. The antibody of claim 14, further conjugated to an effector component. 

1 16. The antibody of claim 15, wherein the effector component is a 

2 fluorescent label. 

1 17. The antibody of claim 1 5 , wherein the ejBfector component is a 

2 radioisotope or a cytotoxic chemical. 

1 18. The antibody of claim 15, which is an antibody fragment. 

1 19. The antibody of claim 15, which is a humanized antibody 

1 20. A method of detecting an ovarian cancer cell in a biological sample 

2 from a patient, the method comprising contacting the biological sample with an antibody of 

3 claim 14. 

1 21 . The method of claim 20, wherein the antibody is further conjugated to 

2 an effector component. 

1 22. The method of claim 2 1 , wherein the effector component is a 

2 fluorescent label. 

1 23. A method for identifying a compound that modulates an ovarian 

2 cancer-associated polypeptide, the method comprising the steps of: 

3 (i) contacting the compound with an ovarian cancer-associated polypeptide, 

4 the polypeptide encoded by a polynucleotide that selectively hybridizes to a sequence at least 

5 80% identical to a sequence as shown in Tables 1-26; and 

6 (ii) determining the fiinctional effect of the compound upon the polypeptide. 

1 24. A drug screening assay comprising the steps of 

2 (i) administering a test compound to a mammal having ovarian cancer or a cell 

3 isolated therefrom; 

4 (ii) comparing the level of gene expression of a polynucleotide that selectively 

5 hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 1-26 in a 
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6 treated cell or mammal with the level of gene expression of the polynucleotide in a control 

7 cell or mammal, wherein a test compound that modulates the level of expression of the 

8 polynucleotide is a candidate for the treatment of ovarian cancer. 
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